sequencher4.5序列拼接软件使用说明
发布时间:2024-11-25
发布时间:2024-11-25
生物圈http:///Tour Guide for Windows and Macintosh Introduction
SEQUENCHER has been developed to work with a wide range of sequencing
applications. For example, SEQUENCHER can be used to:
Create assemblies for shotgun or EST sequencing projects Edit contigs while viewing all relevant trace data Assemble multiple sequences to a user-defined Reference Sequence Detect and annotate polymorphisms Align cDNAs to their genomic sequence using the Large Gap algorithm Discover heterozygous peaks Create difference reports for SNP discovery Display restriction maps, ORF maps, protein translations Automatically trim poor quality and vector sequences
Macintosh and PC Support
SEQUENCHER is available for both Macintosh and PC platforms. The demo CD provides software for both environments.
Unlimited Trial
With the SEQUENCHER demo, you can use your own data and enjoy an unlimited evaluation period. The only functions not available are copying data to other
applications, saving, exporting, printing, and reporting. Some sample data have been included to get you started, but once you have tried SEQUENCHER with the sample files, please try using your own data so you can see precisely what SEQUENCHER can do for you.
What You Will Learn in This Tutorial
The purpose of this tutorial is to guide you through SEQUENCHER‘s core assembly and editing functions. Additional application-specific tutorials are included on your CD in PDF format. In this tutorial, you will
1. Install the SEQUENCHER demo
2. Create a new project
3. Import data
4. Trim sequences
5. Assemble a contig
6. View contig assembly
7. Edit assembled chromatograms
8. Find heterozygotes
9. Work with a Reference Sequence
10. Translate sequences to amino acids
生物圈
11. Annotate a sequence
12. Create a Variance Table and Report
13. Create a Translated Variance Table
14. What else can I do with SEQUENCHER? http:///
Once you have mastered these techniques, you will be ready to explore
SEQUENCHER ’s other powerful features.
Conventions Used in this Guide
Menu items or keys that you are to select are in bold. The purple text provides step-by-step instructions for running through the tour guide and the black text provides additional information. Greater than symbols define menu > submenu commands. Before you start
Check that you have the appropriate hardware and disk space.
Additional Requirements will vary with your project.
MACINTOSH RECOMMENDED REQUIREMENTS:
10.3.9 and higher 512 MB RAM 150 MB hard disk space
WINDOWS MINIMUM REQUIREMENTS:
Windows 2000 and higher 512 MB RAM 150 MB hard disk space
1. Install SEQUENCHER Demo
Insert the demo CD into your machine. Double-click on the Sequencher 4.9 Demo installer icon.
The installer will create a Sequencher 4.9 Demo folder in your Applications (Mac) or Program Files > Gene Codes (Windows) folder.
2. Create a New Project
Launch SEQUENCHER by double-clicking on the Sequencher Demo icon.
On your Macintosh, this will be in the opened Sequencher Demo folder. On a
PC there is a Sequencher Demo icon on your desktop.
When SEQUENCHER is launched, a dialog will alert you that you are running in Demo Mode. After you hit the OK command button on the alert dialog,
生物圈http:///SEQUENCHER will present you with an empty Project window. This is where you
import, manipulate, and display sequence fragments and assembled contigs.
New Project Window
3. Import Data
To import data, select Import > Folder of Sequences... from the File menu.
Browse to the Applications (Mac) or Program Files > Gene Codes (Windows)
then Sequencher 4.9 Demo > Sample Data > Demo Sample Data > Demo
Sequences folder.
Click on the Choose (Mac) or OK (Windows) button at the bottom window.
When prompted to import the 9 files, select the Import All Files in Folder
command button. The Project now contains the 9 sequences.
Imported Sequences
These imported files, with their associated quality scores, are just one example of the wide variety of file types SEQUENCHER accepts for import. Note that the Quality column displays the % quality for each of the imported sequences—the percent of bases that are above the low quality threshold as set in the Confidence User
Preference pane.
4. Trim Sequences
SEQUENCHER has tools that allow you to trim imported sequences based on several different criteria: ambiguous data, data that have low confidence scores, or data
contaminated with vector sequence. The trimmed data are fully recoverable within the SEQUENCHER project. To trim the low confidence sequence:
生物圈
http:///From the menu bar, choose Select > Select All to highlight all sequences if
they aren’t already highlighted.
From the menu bar, choose Sequence > Trim Ends... SEQUENCHER displays the Ends Trimming window for the default trimming parameters.
SEQUENCHER recognizes poor quality sequence based on a number of criteria. The confidence score, provided in these samples, is one of the most sensitive. Increasing the stringency of the trim criteria further increases the quality of your data.
Select the Change Trim Criteria button. Uncheck all but the three criteria checked below, and adjust the values of the
two confidence trims to match.
Click OK to return to the overview for Ends Trimming.
The Ends Trimming window displays how much poor quality data will be trimmed based on the defined criteria. You have the additional option to individually deselect a fragment for trimming on either the 5' or 3' end by removing the "X" from the
appropriate box below the trim graphic.
Ends Trimming
Click the Trim Checked Items button at the top of the window, and then click the Trim button when asked for confirmation.
生物圈http:///The data that you have removed are completely recoverable. SEQUENCHER always stores two copies of every imported Sequence, the original sequence and the data as you have edited it in SEQUENCHER.
Close the Ends Trimming window and return to the Project window by
clicking on the close control in the upper corner of the window.
5. Assemble a Contig
Several alignment algorithms are provided with SEQUENCHER to accommodate the wide variety of assembly applications. For this example, you will use the Dirty Data algorithm because it is best suited for data that may include the occasional
ambiguities or miss-calls generated by automated sequencers. The Assembly
Strategies and the Assemble by Name tutorials in the Sequencher 4.9 Demo\Tutorials
folder explain the other assembly options.
生物圈
http:///Click on the Assembly Parameters button at the top of the Project window. Accept the defaults for the Assembly Algorithm, Minimum Match Percentage, and Minimum Overlap parameters. They should be Dirty Data, 85% and 20
bases, respectively.
Optimize gap placement by selecting Use ReAligner and Prefer 3’ Gap
Placement, if not already selected.
Select OK.
Once you have returned to the Project window, you are ready to begin assembly.
All of the sequence fragments should be selected. If they are not, select them
now using Select > Select All.
Click on the Assemble Automatically button at the top of the Project window. Click Close to dismiss the dialog.
6. View Contig Assembly
The Contig User Preference defines sorting criteria for new assemblies. The default preference sorts the fragments according to position, 5' to 3', within the contig.
SEQUENCHER provides a number of alternative sorting options.
Double-click on the Contig[0001] icon to open the contig Overview window. Click on the Sort button, select the by Strand radio button, and click on OK to
sort the fragments by strand.
Contig Overview
Click on the Sort button, select the by Position radio button, and click OK to
return to the original sorting order.
The Overview contains three sections. The top section displays a schematic of how the fragments are assembled in this contig. The arrows indicate the direction of the fragment in relation to the assembly.
SEQUENCHER provides a Selection Marquee that allows you to navigate from
within the Overview into the Bases window.
The next section provides coverage information. For instance, the consensus called
around base 1206 has less coverage than the surrounding consensus bases.
Below the coverage bar is the open reading frame map. Three bars marked with green
flags and red lines, representing start and stop codons respectively.
Click and drag on the Selection Marquee in the Overview so that it selects the
region around base 1,206..
Select the Bases button at the top of the window to open the Contig Editor and view the base sequences that assemble at this position.
7. Edit Assembled Chromatograms
The Contig Editor provides the tools for checking and editing sequences. It is divided into four quadrants. You can modify the appearance of the Contig Editor from the View menu and in your User Preferences.
Select View > Display Color Bases.
Under Window > User Preferences, change Display > Contig so the Font is
Courier New and the Size is 18.
Close the User Preferences window.
Contig Editor
The two upper panels show the individual fragment names to the left with their
sequences to the right. The Agent Box contains descriptive information about your sequences and your selection. The lower right panel displays consensus information including ambiguities <+> and disagreements < > if there are any. The blue shading describes the confidence, low, medium, and high, with the lighter shades defining higher confidence. Note that you may toggle off this shading by selecting View > Display Base Confidences.
To begin the editing process, move your selection to base one in the consensus. The Select menu provides several tools to navigate to areas of interest in the contig. From the Select menu, choose the command Next Ambiguous Base. The
Agent box reads "4 frag bases selected at consensus position 83," and "Select
Next Ambiguous Base = spacebar".
You have now moved your position in the consensus to the position of the first
ambiguous base. An ambiguous base in the consensus is any that includes: 1) A
contributing fragment base that is not an A, C, G, or T, 2) Disagreements between other fragment bases, and 3) All fragment bases that contribute to the consensus have low quality scores. Position 83 is flagged, because the base called in three of the
fragments is an "N".
To view the chromatograms at this position, while your selection is still on the ambiguous consensus position, select the Show Chromatograms button at the
top of the window.
Assembled Trace Windows
Each chromatogram window displays the current version of the base calls in black and above a line that separates them from the original base calls. At the left of the trace window, SEQUENCHER provides tools that allow you to manipulate how the traces are displayed. The volume control bars allow you to adjust the chromatogram peak height. The A, C, G, and T buttons allow you to turn off the display of the signal from any or all of the bases. Additional controls to format the traces are available through the Window menu under User Preferences... Display pane.
The base call at position 83 is ambiguous in the forward direction, probably
because the irregular spacing challenged the original base caller. To change
the "N" to "C", with your selection still on the consensus base, type "C". Your
edit will correct the consensus and every sequence in the contig at that
position.
Note that SEQUENCHER displays the edited base call in a contrasting color. You are now ready to continue editing your sequence. You can move in your contig from one region of ambiguity to the next using SEQUENCHER ’s navigational tools.
From the Select menu, choose Next Ambiguous Base or use the spacebar to
execute the previous Select command.
SEQUENCHER jumps to the next ambiguity in the contig.
8. Call Heterozygous bases
The next ambiguous base in the consensus is at position 126. One forward sequence calls a T and three forward sequences call a C. A quick look at the chromatogram, however, shows that the forward and the reverse sequences have both C and T peaks
at position 126. This sequence is from a mixed population.
Raise the slider for the 1-Reverse sequence to amplify the signal. Type "Y" in the consensus line at position 126.
In SEQUENCHER, you can manually call each heterozygote one at a time, or you can use the Call Secondary Peaks... function to automatically find the heterozygous bases.
From the Sequence menu, select Call Secondary Peaks...
Change the Minimum lower peak height to 35%.
Click on the Only make changes that result in an ambiguity checkbox to turn it
on.
Click OK.
SEQUENCHER displays a dialog so that you can confirm the changes to 41 base calls.
Click Continue. With your cursor still in the consensus, click the spacebar to select the next
ambiguous base position.
The next ambiguous base position is 320. At this position, you will find that all of the contributing sequences have bold magenta "R"s. The color indicates that the
sequences have been edited, in this case automatically by the Call Secondary Peaks… function. The "R" is the IUPAC abbreviation for a mix of puRines, or "A or G". The "+" under the consensus line flags the ambiguous position.
Continue selecting and editing until all the ambiguities are resolved.
Note that not all of the automatically called bases will be true heterozygotes nor will this function capture all heterozygotes, but it will find all bases that have secondary peaks that are at least 35% of the primary peak.
Close the Contig and Chromatogram windows by clicking on the close buttons in the corners of each.
9. Translate Sequences to Amino Acids
There are numerous potential applications for the Reference Sequence. The Reference Sequence facilitates comparative sequence alignments, defines base numbering, and boosts assembly speed. In this Tour Guide, we will use the Reference Sequence
function to compare the assembled trace sequences to a known text sequence.
From the Project window, select File > Import > Sequencher Project… and
navigate to the Demo Sample Data folder from which you imported the trace files. Click on the file HepC Reference.SPF, then click on the Open button.
The new sequence, AB049090, is now in your project. You can tell that this sequence is already a Reference, because it has an "R" in the icon, and "Ref:" precedes "DNA Fragment" in the Kind column. You can make any sequence a Reference Sequence in SEQUENCHER from the Sequence menu. You can also right click or Ctrl+click on the sequence name to invoke the context sensitive menu. The Reference Sequence tutorial provides more information on the Reference functions.
You can use the same assembly parameters that we used for the autoseq fragments. Note that the current parameters are listed in the Project window just below the button
bar.
Choose Select > Select All and click on the Assemble to Reference button. Click Close to dismiss the Assembly Completed dialog.
The Reference Sequence is now incorporated into Contig[0001].
Double-click on the Contig[0001] icon to display the contig Overview.
Immediately you can see that the Reference contributes to the contig in a different manner than the non-Reference sequences. For example, the numbering of the
sequence, as it appears on the coverage bar, is negative until the Reference begins to contribute. Note the white space at the 3’ end of the coverage bar indicates that the Reference does not contribute coverage at all. The blue bar in the Reference is a
graphic representation of the CDS feature.
When the data that extends beyond the Reference is not of interest, you can trim the contig sequences to the Reference.
From the Contig menu, select the Trim to Reference Sequence command.
Contig[0001] now starts at base position 1. It is useful to view a Reference Sequence when editing a sample sequence. The Reference Sequence will guide you so you can critically examine the differences between the sample consensus and the Reference. Yet the Reference will not contribute to the consensus nor will the Reference be
affected by edits in the consensus.
10. Translate Sequences to Amino Acids
SEQUENCHER provides a variety of ways to display the translation of DNA
sequence. For instance, you can display one or all three of the reading frames below the sequence while editing.
Drag the Selection Marquee in the Overview to the 5’ end of the contig. Click on the Bases button. Place your cursor on base position 1 in the consensus line and click on it. Click on the [ ] button in the lower right corner of the Contig Editor.
SEQUENCHER displays the translated consensus.
The first click changes the [ ] to a [ 1 ], displaying the translation in the first reading frame. This button will toggle through each of the reading frames, followed by the display of all three and then concluding with the display of an [ r ] which displays
both the Reference Sequence translation and the consensus translation in the
Reference Frame.
Click on the upper translation button until it reaches a [ 1 ] to leave the
translation in the first reading frame.
To toggle the display of the Reference Sequence translation, click on the lower
translation button, with an R icon.
Contig With Protein Translation of Consensus and Reference Sequence
11. Annotate a Sequence
SEQUENCHER provides for sequence annotation by allowing you to create a feature for a single base or a range of bases.
If your cursor is not in the consensus, select a base in the consensus. From the menu bar, choose Select > Bases by Number…. In the Select Base dialog, enter 208 in both text boxes. Click OK.
At this position, the sample sequence has a Valine, GTG, and the Reference codes for a Methionine, ATG.
Choose Sequence > Mark Selection As Feature. From the drop down Feature Key menu, choose variation. In the Feature Name box, give your feature a brief name like "A-> G, Met->
Val".
The default Feature Style for a variation is red and underlined, but you may choose any style you wish.
Click OK. If the View > Display Features menu item is not checked, select it. Turn off Display Color Bases from the View menu.
The feature will now be displayed in both the Bases view and the Overview. Note that in the Overview, the red single base feature displays over the range of bases covered
in the blue CDS feature.
Feature in the Bases View
Feature in the Overview
12. Create a Variance Table and Report
After you have edited the sample sequence so that you are confident that the
consensus base calls are correct, you can generate a comparison report of how the sample differs from the Reference.
Close the Contig Editor window. With Contig[0001] selected, choose Contig > Compare Consensus to
Reference.
SEQUENCHER displays a table listing the differences between the consensus
sequence that you edited and the Reference Sequence that you imported. Note that base 208 is still red and underlined.
You can also modify the look of this table.
Click on the symbol that looks like an open elevator button in the bottom left
corner to expand the width of the columns.
From the View menu, choose Colors As Backgrounds.
This report lists the differences between the consensus of one contig and the
Reference, but it is also possible to create this report for hundreds of contigs, when
they share the same Reference.
Variance Table
The Variance Table also acts as a link to the original data.
Double-click on the cell at base position 208.
SEQUENCHER rearranges the windows and opens the Contig and Chromatogram Editors for that base position.
Use the arrow keys on your keyboard to navigate to bases of interest.
Create a new variation feature from the Variance Table at position 338 by
selecting that cell in the Variance Table and then executing the Sequence >
Mark Selection As Feature command.
Click OK.
SEQUENCHER defaults to the variation feature key, because this was the last feature key used. The new feature name is "variation" and it is also red and underlined.
Click on the Reports button to open the Reports dialog
You have several reporting options available to you, so that you can share the
contents of the Variance Table outside of SEQUENCHER. Reporting is disabled in the Demo, but you can view sample reports in SEQUENCHER help. The image
below is the result of creating a Variance Table from the Row Selection for positions 153 – 338.
13. Create a Translated Variance Table
In addition to the Variance Table, which displays the results of the comparison of DNA sequences, Sequencher provides a Translated Variance Table, which displays the differences in the translation of DNA sequences. The following directs you to create the translated "sister" table to the currently open Variance Table, but you can also create a Translated Variance Table directly from a contig by selecting sequence names in the contig or by selecting contigs in a Project window.
While you are still in the Review mode of this Variance Table, click on the
Translation button on the button bar of the Variance Table window.
上一篇:vlookup函数最全详解
下一篇:1008调频对讲收音机制作