SmotifTF Version 0.05 Template-free modeling algorithm. SYNOPSIS SmotifTF carries out template-free structure prediction using a dynamic library of supersecondary structure fragments obtained from a set of remotely related PDB structures. This README provides the information required for downloading, installing and running the software package. For more information on how to run the program use "perldoc SmotifTF" after installation. DOWNLOAD Download SmotifTF package from CPAN: INSTALLATION To install SmotifTF package, run the following commands: 1. Manually: Install where standard Perl modules are stored tar -zxvf SmotifTF-version.tar.gz cd SmotifTF-version/ perl Makefile.PL make make test make install 2. Install in a custom location (/home/user/MyPerlLib) tar -zxvf SmotifTF-version.tar.gz cd SmotifTF-version/ perl Makefile.PL PREFIX=/home/user/MyPerlLib/ make make test make install Please, do not forget to add the following line: use lib "$ENV{HOME}/MyPerlLib/share/perl5/" in ./ and ./ 3. Using a CPAN client: as root type: perl -MCPAN -e shell > install SmotifTF 4. Using a CPAN client and installing in a custom location (/home/user/MyPerlLib) perl -MCPAN -e shell > conf makepl_arg PREFIX=/home/user/MyPerlLib/ > install SmotifTF Please, do not forget to add the following line: use lib "$ENV{HOME}/MyPerlLib/share/perl5/" in ./ and ./ PRE-REQUISITES The Smotif-based modeling algorithm requires the query protein sequence as input. Software/data required: 1. Psipred ( 2. HHSuite ( 3. Psiblast and Delta-blast ( 4. Modeller (version 9.14 5. Local PDB directory (central or user-designated) - updated ( Download and install the above mentioned software / data according to their instructions. Note: Psipred and Psiblast require legacy blast and Delta-blast is a part of the Blast+ package. DATABASES REQUIRED: 1. PDBAA blast database is required ( 2. HHsuite databases NR20 and PDB70 are required ( SET UP CONFIGURATION FILE The configuration file, smotiftf_config.ini has all the information regarding the required library files and other pre-requisite software. Set all the paths and executables in this file correctly. Set environment varible in .bashrc file: export SMOTIFTF_CONFIG_FILE=/home/user/MyPerlLib/share/perl5/SmotifTF-version/smotiftf_config.ini MODELING ALGORITHM STEPS ---------------------------------------------------- |First run the Pre-requisites: | | Psipred, HHblits+HHsearch, Psiblast, | | Delta-blast | | | | Single-core job | | Usage: perl --step=all | | --sequence_file=1zzz.fasta --dir=1zzz | ---------------------------------------------------- ---------------------------------------------------- | Step 1: | | Compare Smotifs | | | | Multi-core / cluster job | | Usage: perl --step=1 --pdb=1zzz | ---------------------------------------------------- ---------------------------------------------------- | Step 2: | | Rank Smotifs | | | | Multi-core / cluster job | | Usage: perl --step=2 --pdb=1zzz | ---------------------------------------------------- ---------------------------------------------------- | Step 3: | | Enumerate all possible combinations of | | Smotifs (about a million models) | | | | Multi-core / cluster job | | Usage: perl --step=3 --pdb=1zzz | ---------------------------------------------------- ---------------------------------------------------- | Step 4: | | Rank enumerated structures using a | | composite energy function | | | | Single-core job | | Usage: perl --step=4 --pdb=1zzz | ---------------------------------------------------- ---------------------------------------------------- | Step 5: | | Run Modeller to generate top 5 complete | | models | | | | Single-core job | | Usage: perl --step=5 --pdb=1zzz | ---------------------------------------------------- HOW TO RUN SMOTIFTF: 1. The two perl scripts needed to run SmotifTF are: and If installed locally, the correct path name to the SmotifTF perl library must be provided in both scripts. 2. Create a subdirectory with a dummy pdb file name (eg: 1abc or 1zzz). 3. Put the query fasta file (1zzz.fasta) in this directory. 4. Run the pre-requisites first. This runs Psipred, HHblits+HHsearch, Psiblast and Delta-blast. Input is the query sequence in fasta format and the outputs are (a) dynamic database of Smotifs and (b) the putative Smotifs in the query protein. These are used in the subsequent modeling steps. Follow the instructions given in For more information about the pre-requisites use: perl -help Usage: perl --step=all --sequence_file=1zzz.fasta --dir=1zzz 5. After the pre-requisites are completed, run steps 1 to 5 as given above sequentially. Output from previous steps are often required in subsequent steps. Wait for each step to be completed without errors before going to the next step. Follow the instructions given in For more information use: perl -help Usage: perl --step=[1-5] --pdb=1zzz 6. To run steps 1-5 together use: perl --step=all --pdb=1zzz 7. Use multiple-cores or clusters as available, for steps 1 & 3 above. These are computationally intensive steps. Results: Top 5 models are stored in the subdirectory (1abc or 1zzz) as: Model.1.pdb, Model.2.pdb, Model.3.pdb, Model.4.pdb & Model.5.pdb HOW TO TEST SMOTIFTF PACKAGE A sample fasta sequence (4uzx.fasta) is provided with the distribution that can be used to test the SmotifTF software installation. The fasta file can be found at: /home/user/MyPerlLib/share/perl5/SmotifTF-version/t/Data/4uzx.fasta Steps to perform the test: Create a directory named 4uzx mkdir 4uzx Copy the fasta file into the directory cp /home/user/MyPerlLib/share/perl5/SmotifTF-version/t/Data/4uzx.fasta 4uzx/ Run pre-requisites perl --step=all --sequence_file=4uzx.fasta --dir=4uzx Run modeling algorithm perl --step=all --pdb=4uzx REFERENCE Vallat BK, Fiser A. Modularity of protein folds as a tool for template-free modeling of sequences Manuscript under review. AUTHORS Brinda Vallat, Carlos Madrid, Andras Fiser C<< <andras at> >> SUPPORT AND DOCUMENTATION After installing, you can find documentation for using the sofware with the perldoc command. perldoc SmotifTF You can also look for information at: RT, CPAN's request tracker (report bugs here) AnnoCPAN, Annotated CPAN documentation CPAN Ratings Search CPAN LICENSE AND COPYRIGHT Copyright (C) 2015 Fiserlab Members This program is free software; you can redistribute it and/or modify it under the terms of the the Artistic License (2.0). You may obtain a copy of the full license at: L<> Any use, modification, and distribution of the Standard or Modified Versions is governed by this Artistic License. By using, modifying or distributing the Package, you accept this license. Do not use, modify, or distribute the Package, if you do not accept this license. If your Modified Version has been derived from a Modified Version made by someone other than you, you are nevertheless required to ensure that your Modified Version complies with the requirements of this license. This license does not grant you the right to use any trademark, service mark, tradename, or logo of the Copyright Holder. This license includes the non-exclusive, worldwide, free-of-charge patent license to make, have made, use, offer to sell, sell, import and otherwise transfer the Package with respect to any patent claims licensable by the Copyright Holder that are necessarily infringed by the Package. If you institute patent litigation (including a cross-claim or counterclaim) against any party alleging that the Package constitutes direct or contributory patent infringement, then this Artistic License to you shall terminate on the date that such litigation is filed. Disclaimer of Warranty: THE PACKAGE IS PROVIDED BY THE COPYRIGHT HOLDER AND CONTRIBUTORS "AS IS' AND WITHOUT ANY EXPRESS OR IMPLIED WARRANTIES. THE IMPLIED WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE, OR NON-INFRINGEMENT ARE DISCLAIMED TO THE EXTENT PERMITTED BY YOUR LOCAL LAW. UNLESS REQUIRED BY LAW, NO COPYRIGHT HOLDER OR CONTRIBUTOR WILL BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, OR CONSEQUENTIAL DAMAGES ARISING IN ANY WAY OUT OF THE USE OF THE PACKAGE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.