SIRAS Phasing with Iodide
Performing the Calculation with Mlphare from CCP4
Argand Diagram displaying the contribution of anomalous scattering to the phasing calculation. Taken from a figure from  Crystallography 101 
SIRAS Phasing:A sample scriptis provided below for a typical SIRAS phasing calculation.  This particular example is taken from the crystal structure of proteinase K.  All steps on the path are provided from preparing the Scalepack files to displaying the map with O. The general path is:

scalepack2mtz -> 
                truncate -> 
                            cad -> 
                            scaleit -> 
                            mlphare -> 
                                        dm -> 
                                            fft -> 
                                                mapman -> O
 

Example log files are given for each step, along with comments about how to judge the quality of the output. 

Documentation for CCP4 programs:
CCP4 v4.0 Program Documentation locally on /joule2/programs
CCP4 v4.0 Program Documentation from the U.K. site

References for CCP4 programs:
Collaborative Computational Project , Number 4. 1994. "The CCP4 Suite: Programs for Protein Crystallography". Acta Cryst. D50, 760-763 


scalepack2mtz.com
SCRIPT
COMMENTS
#!/bin/csh -f 
#
scalepack2mtz hklin ~/data/prok1.sca\
hklout mtz/nat_i.mtz << eof
SYMM 96
ANOMALOUS NO
END
eof
#
#
scalepack2mtz hklin ~/data/prok_io5.sca\
hklout mtz/io_i.mtz << e
of
SYMM 96
ANOMALOUS YES
END
eof
#
CCP4 document for scalepack2mtz

The native scalepack data set, native.sca, is converted to mtz format first, followed by iodide derivative. 

SYMM -This keyword is compulsory and can be given as the space group name or number. Here we are using space group number 96, otherwise known as P43212. 

ANOMALOUS -Specify whether or not input file contains anomalous data. It is set to NO for the native, and YES for the derivatives. 

---------Logfile from scalepack2mtz---------- 
For each data set check the log file to see that the number of columns, reflections, unit cell, and space group are what you intended. 


truncate.com
SCRIPT
COMMENTS
#!/bin/csh -f 
truncate HKLIN  mtz/nat_i.mtz \
HKLOUT mtz/nat_f.mtz \
<< EOF_truncate > log/truncate_nat.log
TITLE     NATIVE TRUNCATE OUTPUT
LABOUT  F=Fnat sigF=sigFnat
NRESIDUES 272
EOF_truncate
#
truncate HKLIN  mtz/io_i.mtz \
HKLOUT mtz/io_f.mtz \
<< EOF_truncate > log/truncate_io.log
TITLE     NATIVE TRUNCATE OUTPUT
LABOUT  F=Fio sigF=sigFio DANO=Dio SIGDANO=sigDio
NRESIDUES 272
EOF_truncate
CCP4 document for truncate

The native and derivative data sets are processed with truncate. Truncate converts the intensities into structure factors by taking the square root. Hence, the input files have the extension _i.mtz and the output files have the extension _f.mtz. It also puts the data sets on an absolute scale. This makes it convenient for estimating heavy atom occupancies. 

LABOUT -Output labels for each column of data. These should be meaningful and easy to remember. 

NRESIDUES -The number of residues in the asymmetric unit. Used to put the data sets on an absolute scale. 

---------Logfile from truncate---------- 

Truncate outputs some useful statistics. The Wilson B-factor is calculated and plotted: 

For a wilson plot B = - gradient 
SCALE = exp( - intercept).
Least squares straight line gives: B = 21.434 SCALE = 216.53
where F(absolute)**2 = SCALE*F(observed)**2*EXP(-B*2*SINTH**2/L**2) 
There are also statistics to help you determine whether the data is anisotropic and if there are any systematic absences due to lattice centering.

cad.com
SCRIPT
COMMENTS
#!/bin/csh -f
cad     hklin1  mtz/nat_f.mtz \
        hklin2  mtz/io_f.mtz \
        hklout  mtz/derivs_unscaled.mtz \
<<eof-cad
RESOLUTION OVERALL 90.0  1.8
SYMMETRY 96
TITLE native and Iodide derivative
SCAL  FILE 1  1.0
LABIN  FILE 1  E1=Fnat    E2=sigFnat
CTYP   FILE 1  E1=F       E2=Q
LABOUT FILE 1  E1=Fnat    E2=sigFnat

LABIN FILE 2  E1=Fio  E2=sigFio E3=Dio E4=sigDio
CTYP  FILE 2  E1=F    E2=Q      E3=D   E4=Q
LABOUT FILE 2 E1=Fio  E2=sigFio E3=Dio E4=sigDio

END
eof-cad

CCP4 document for cad

The native and iodide derivative data sets are collected into one single file with cad. 

LABIN- Input labels for each column of data. 

LABOUT -Output labels for each column of data. These should be the same as LABIN. 
 
 

---------Logfile from cad---------- 

Check for normal termination.

scaleit.com
SCRIPT
COMMENTS
#!/bin/csh -f
#
scaleit  hklin mtz/derivs_unscaled.mtz \
hklout mtz/derivs.mtz << eof >log/scaleit.log
TITLE Scaling native and derivatives
GRAPH H K L MODF
REFINE ANISOTROPIC
LABI FP=Fnat SIGFP=sigFnat -
FPH1=Fio SIGFPH1=sigFio -
DPH1=Dio SIGDPH1=sigDio
eof

cat log/scaleit.log
grep acceptable log/scaleit.log

CCP4 document for scaleit

The derivative is scaled to the native data set using an overall scale factor and B factor. 

Grep for the word "acceptable" from the log file to see what threshold should be used for the exclude statement in mlphare.com. 

---------Logfile from scaleit---------- 

Scaleit outputs some useful statistics. Check the isomorphous R-factor.  Plot the graphs of "Analysis vs. resolution" by typing xloggraph log/scaleit.log.

mlphare.com
SCRIPT
COMMENTS
#!/bin/csh -f
#

mlphare HKLIN mtz/derivs.mtz\
HKLOUT mtz/derivs_ph.mtz\
<<END-phare >log/mlphare.log
TITLE SIRAS  phasing from  Iodide sites
SYMM 96
ANGLE 10
THRESHOLD 2.5 0.5
PRINT AVF AVE
RESO 20 1.8
CYCLES 15
LABI FP=Fnat SIGFP=sigFnat -
FPH1=Fio  SIGFPH1=sigFio DPH1=Dio SIGDPH1=sigDio
LABO ALLIN  PHIB=PHIO FOM=FOMO
HLOUT HLA=HLA HLB=HLB HLC=HLC HLD=HLD
EXCLUDE DISO 292
EXCLUDE DANO 58
RUN
DERIV     I-  iodide
DCYCLE PHASE ALL REFCYC ALL KBOV ALL

ATOM1 I- -0.0559 0.2496 0.2872 1.0 1.0 BFAC 25.0
ATREF X ALL Y ALL Z ALL OCC 1 3 5 7 9 11 13 15-
AOCC 1 3 5 7 9 11 13 15 BFAC 2 4 6 8 10 12 14

ATOM2 I-  0.2518 0.1552 0.3514 1.0 1.0 BFAC 25.0
ATREF X ALL Y ALL Z ALL OCC 1 3 5 7 9 11 13 15-
AOCC 1 3 5 7 9 11 13 15 BFAC 2 4 6 8 10 12 14

ATOM3 I-  0.5913 0.2789 0.1858 1.0 1.0 BFAC 25.0
ATREF X ALL Y ALL Z ALL OCC 1 3 5 7 9 11 13 15-
AOCC 1 3 5 7 9 11 13 15 BFAC 2 4 6 8 10 12 14

ATOM4 I- -0.0435 0.2228 0.4155 1.0 1.0 BFAC 25.0
ATREF X ALL Y ALL Z ALL OCC 1 3 5 7 9 11 13 15-
AOCC 1 3 5 7 9 11 13 15 BFAC 2 4 6 8 10 12 14

ATOM5 I- -0.0148 0.4773 0.0318 1.0 1.0 BFAC 25.0
ATREF X ALL Y ALL Z ALL OCC 1 3 5 7 9 11 13 15-
AOCC 1 3 5 7 9 11 13 15 BFAC 2 4 6 8 10 12 14

ATOM6 I- -0.0910 0.0769 0.2634 1.0 1.0 BFAC 25.0
ATREF X ALL Y ALL Z ALL OCC 1 3 5 7 9 11 13 15-
AOCC 1 3 5 7 9 11 13 15 BFAC 2 4 6 8 10 12 14

ATOM7 I-  0.0191 0.2712 0.3975 1.0 1.0 BFAC 25.0
ATREF X ALL Y ALL Z ALL OCC 1 3 5 7 9 11 13 15-
AOCC 1 3 5 7 9 11 13 15 BFAC 2 4 6 8 10 12 14

ATOM8 I- 0.4869 -0.1185 0.4652 1.0 1.0 BFAC 25.0
ATREF X ALL Y ALL Z ALL OCC 1 3 5 7 9 11 13 15-
AOCC 1 3 5 7 9 11 13 15 BFAC 2 4 6 8 10 12 14

ATOM9 I- 0.5524 -0.0890 0.1452 1.0 1.0 BFAC 25.0
ATREF X ALL Y ALL Z ALL OCC 1 3 5 7 9 11 13 15-
AOCC 1 3 5 7 9 11 13 15 BFAC 2 4 6 8 10 12 14

ATOM10 I- 0.172 -0.0129 0.0698 1.0 1.0 BFAC 25.0
ATREF X ALL Y ALL Z ALL OCC 1 3 5 7 9 11 13 15-
AOCC 1 3 5 7 9 11 13 15 BFAC 2 4 6 8 10 12 14

ATOM11 I- -0.138 0.1329 0.3132 1.0 1.0 BFAC 25.0
ATREF X ALL Y ALL Z ALL OCC 1 3 5 7 9 11 13 15-
AOCC 1 3 5 7 9 11 13 15 BFAC 2 4 6 8 10 12 14

ATOM12 I- 0.1027 -0.1027 0.250 1.0 1.0 BFAC 25.0
ATREF X ALL Y ALL Z ALL OCC 1 3 5 7 9 11 13 15-
AOCC 1 3 5 7 9 11 13 15 BFAC 2 4 6 8 10 12 14

ATOM13 I- 0.0206 0.0992 0.0891 1.0 1.0 BFAC 25.0
ATREF X ALL Y ALL Z ALL OCC 1 3 5 7 9 11 13 15-
AOCC 1 3 5 7 9 11 13 15 BFAC 2 4 6 8 10 12 14

END-phare
 

CCP4 document for mlphare

Plug in the iodide positions from ShelxD. Refine heavy atom positions for 15 cycles and calculate phases using both isomorphous and anomalous differences. Because ocupancy and Bfactor parameters are coupled, refinement cycles should alternate between occupancy and bfactor refinement as implemented in this script. Refinement of both occupancy and Bfactor during the same cycle can be unstable. 

I usually include the highest resolution possible in the phasing calculation.  Although the statitistics will be poor for the highest shells, Dm will improve the phases with solvent flattening.

---------Logfile from mlphare---------- 

Mlphare outputs some useful statistics. Check that the refinement parameter and anomalous refinement parameter are converging toward 1.0 with each cycle of refinement.  If not, then further cycles of refinement may be necessary (not usually the case.) 

---------------------------------------------
 Refinement Parameter           =   0.997258
 Anomalous Refinement Parameter =   0.860128
 --------------------------------------------
 

Check the phasing power (PhP_a and PhP_c), these should be 1.0 or greater for resolution shells below 3.0 Angstrom resolution.  Check that RCullis is under 1.0
========================================================
 4. Lack of closure analyses

 This includes the isomorphous difference between FP and FPH,
 the lack of closure - difference between FP+FHcalc and FPH,
 calclated for Mean differences, and Rms differences
 each using 3 different weighting schemes:
  a) division by the number of reflections in the range:
  b) a weighted sum using Weight 1( see definition below):
  c) a weighted sum using Weight 2( see definition below):
 These are used to calculate PHASING POWER, defined as:
 (Mean FH / Lack of closure) and  a "Cullis Rfactor"
 extended from the old definition to be:
 (Lack_of_closure / Isomorphous_difference)

 Unweighted MEAN lack-of-closure analysis
 ========================================
1/resol^2
 Resolution(Angstroms)
 Number_acentric_reflections
 Isomorphous_difference_acentric
 Lack_of_closure_acentric
 Phasing_power_acentric
 Cullis_R_acentric(?<1.0)
 Number_centric_reflections
 Isomorphous_difference_centric
 Lack_of_closure_centric
 Phasing_power_centric
 Cullis_R_centric(?<1.0) 
1/res^2 Res Nref_a DISO_a LOC_a PhP_a CullR_a Nref_c DISO_c LOC_cPhP_c CullR_c
0.013 8.83  112 55.2 104.1 2.14 1.89  106  72.1 130.1 1.40 1.81
0.031 5.67  397 62.2  88.5 2.10 1.42  192  78.6 118.1 1.53 1.50
0.057 4.17  879 77.6  69.5 2.20 0.90  297  96.7  85.3 1.52 0.88
0.092 3.30 1539 85.2  68.5 1.59 0.80  365 110.6  84.3 1.20 0.76
0.134 2.73 2393 68.5  55.2 1.45 0.81  464  90.3  71.1 0.95 0.79
0.184 2.33 3430 54.0  44.7 1.16 0.83  550  69.4  54.1 0.89 0.78
0.242 2.03 4644 46.8  41.8 0.78 0.89  615  60.2  50.7 0.58 0.84
0.309 1.80 5832 33.8  31.1 0.62 0.92  641  41.2  36.1 0.50 0.88

TOTAL     19226 51.7  45.5 1.18 0.88 3230  72.8  64.9 1.04 0.89

 *******************************************************
 Analysis of anomalous phasing for Derivative  1
 *******************************************************
  ========================================================
 5. Lack of closure analyses on anomalous differences

 This includes the anomalous difference, the expected
 calculated anomalous lack of closure, the anomalous
 lack of closure (i.e. the difference between DPH and
 DPHcalc, and a "Cullis Rfactor" extended from the old
 definition to be:
 (Lack_of_closure / Anomalous_difference)

 Unweighted MEAN anomalous lack-of-closure analysis
 ==================================================

TABLE: Anomalous lack of closure  analysis  v resln using MEANS-  Phasing    cycle - deriv   1:

 Number_acentric_reflections
 Anomalous_difference_obs
 Anomalous_difference_calc
  Lack_of_ano_closure
 Cullis_R_anomalous $$
 1/resol^2 Nref_a DANO_obs DANO_calc LOC_ano CullR_ano 
   0.013     112    25.1    22.6     13.41    0.53
   0.031     397    19.5    20.1     11.23    0.57
   0.057     879    18.5    18.2     10.22    0.55
   0.092    1537    16.0    13.5     10.51    0.66
   0.134    2393    13.8    10.8      8.38    0.61
   0.184    3427    11.3     7.6      7.60    0.67
   0.242    4607     9.8     5.0      8.08    0.82
   0.309    5803     7.6     3.2      6.66    0.88
   TOTAL   19155    11.1     7.3      7.99    0.72
 


dm.com
SCRIPT
COMMENTS
dm  hklin mtz/derivs_ph.mtz\
hklout mtz/dm.mtz \
<< my-data >log/dm.log
SOLC 0.40
MODE SOLV HIST MULT
RESOLUTION 20 1.8
COMBINE omit
NCYCLE AUTO
#SCHEME RES FROM 2.0
LABIN FP=Fnat SIGFP=sigFnat -
PHIO=PHIO FOMO=FOMO -
HLA=HLA HLB=HLB HLC=HLC HLD=HLD
LABOUT PHIDM=PHI1 FOMDM=W1 -
HLADM=HLADM HLBDM=HLBDM -
HLCDM=HLCDM HLDDM=HLDDM
my-data
 
CCP4 document for Dm

If your native data set extends to higher resolution than the derivative, then use phase extension to the highest resolution possible.  High resolution phases are important for success in ARP/wARP automated chain tracing.
 


fourier.com
SCRIPT
COMMENTS
#!/bin/csh -f
#calculate fourier map for viewing
#
fft HKLIN mtz/dm.mtz \
MAPOUT map/prok.map << END_fft
RESO 20 1.8
LIST 100
FFTSPG 1
TITL  prok Fourier map phase with I-.
BINMAPOUT
XYZLIM 0 1 0 1 0 .5
LABI F1=Fnat SIG1=sigFnat PHI=PHI1 W=W1
SCALE F1 1.0 0.0
END_fft
#
#
extend MAPIN map/prok.map \
MAPOUT map/prok.ext << END-extend
XYZLIM 0 1  0 1.0  0 1
SYMMETRY 96
END-extend

rm fftkw.abcoeffs

al_mapman -b mapsize 10000000<<eof
re m1 map/prok.ext ccp4
no m1
map m1 map/prok.omap
bo sk m1 1.3 0.7 100
bo co map/prok.odb skl 5
quit
eof

 

CCP4 document for fft

CCP4 document for extend

convert map to dsn6 format for display in O.  Also calculate a bones representation of the density for viewing with O.

Here is a convenient macro for viewing your map with O.
s-a-i /pdb/pdb2prk.ent
helix
pdb
mol helix
zone ; end
sym_setup helix 67.93  67.93 102.34 90.00 90.0 90.0 p43212
sym_cell
 

map_file  prok.omap
map_obj 2fofc
map_param
30 30 30
1.3 slate_blue ; ;
map_active
map_draw
 

read_fo prok.odb
bo_set skl bones 100 1,3
bone_draw

menu @mapmove on
 

Back to Phasing 

[Overview] ·[Facilities] · [People] · [Services] ·[Lectures] · [BioLinks] · [Stats] ·[Search]