sequencher4.5序列拼接软件使用说明

发布时间:2024-11-25

生物圈http:///Tour Guide for Windows and Macintosh Introduction

SEQUENCHER has been developed to work with a wide range of sequencing

applications. For example, SEQUENCHER can be used to:

Create assemblies for shotgun or EST sequencing projects Edit contigs while viewing all relevant trace data Assemble multiple sequences to a user-defined Reference Sequence Detect and annotate polymorphisms Align cDNAs to their genomic sequence using the Large Gap algorithm Discover heterozygous peaks Create difference reports for SNP discovery Display restriction maps, ORF maps, protein translations Automatically trim poor quality and vector sequences

Macintosh and PC Support

SEQUENCHER is available for both Macintosh and PC platforms. The demo CD provides software for both environments.

Unlimited Trial

With the SEQUENCHER demo, you can use your own data and enjoy an unlimited evaluation period. The only functions not available are copying data to other

applications, saving, exporting, printing, and reporting. Some sample data have been included to get you started, but once you have tried SEQUENCHER with the sample files, please try using your own data so you can see precisely what SEQUENCHER can do for you.

What You Will Learn in This Tutorial

The purpose of this tutorial is to guide you through SEQUENCHER‘s core assembly and editing functions. Additional application-specific tutorials are included on your CD in PDF format. In this tutorial, you will

1. Install the SEQUENCHER demo

2. Create a new project

3. Import data

4. Trim sequences

5. Assemble a contig

6. View contig assembly

7. Edit assembled chromatograms

8. Find heterozygotes

9. Work with a Reference Sequence

10. Translate sequences to amino acids

生物圈

11. Annotate a sequence

12. Create a Variance Table and Report

13. Create a Translated Variance Table

14. What else can I do with SEQUENCHER? http:///

Once you have mastered these techniques, you will be ready to explore

SEQUENCHER ’s other powerful features.

Conventions Used in this Guide

Menu items or keys that you are to select are in bold. The purple text provides step-by-step instructions for running through the tour guide and the black text provides additional information. Greater than symbols define menu > submenu commands. Before you start

Check that you have the appropriate hardware and disk space.

Additional Requirements will vary with your project.

MACINTOSH RECOMMENDED REQUIREMENTS:

10.3.9 and higher 512 MB RAM 150 MB hard disk space

WINDOWS MINIMUM REQUIREMENTS:

Windows 2000 and higher 512 MB RAM 150 MB hard disk space

1. Install SEQUENCHER Demo

Insert the demo CD into your machine. Double-click on the Sequencher 4.9 Demo installer icon.

The installer will create a Sequencher 4.9 Demo folder in your Applications (Mac) or Program Files > Gene Codes (Windows) folder.

2. Create a New Project

Launch SEQUENCHER by double-clicking on the Sequencher Demo icon.

On your Macintosh, this will be in the opened Sequencher Demo folder. On a

PC there is a Sequencher Demo icon on your desktop.

When SEQUENCHER is launched, a dialog will alert you that you are running in Demo Mode. After you hit the OK command button on the alert dialog,

生物圈http:///SEQUENCHER will present you with an empty Project window. This is where you

import, manipulate, and display sequence fragments and assembled contigs.

New Project Window

3. Import Data

To import data, select Import > Folder of Sequences... from the File menu.

Browse to the Applications (Mac) or Program Files > Gene Codes (Windows)

then Sequencher 4.9 Demo > Sample Data > Demo Sample Data > Demo

Sequences folder.

Click on the Choose (Mac) or OK (Windows) button at the bottom window.

When prompted to import the 9 files, select the Import All Files in Folder

command button. The Project now contains the 9 sequences.

Imported Sequences

These imported files, with their associated quality scores, are just one example of the wide variety of file types SEQUENCHER accepts for import. Note that the Quality column displays the % quality for each of the imported sequences—the percent of bases that are above the low quality threshold as set in the Confidence User

Preference pane.

4. Trim Sequences

SEQUENCHER has tools that allow you to trim imported sequences based on several different criteria: ambiguous data, data that have low confidence scores, or data

contaminated with vector sequence. The trimmed data are fully recoverable within the SEQUENCHER project. To trim the low confidence sequence:

生物圈

http:///From the menu bar, choose Select > Select All to highlight all sequences if

they aren’t already highlighted.

From the menu bar, choose Sequence > Trim Ends... SEQUENCHER displays the Ends Trimming window for the default trimming parameters.

SEQUENCHER recognizes poor quality sequence based on a number of criteria. The confidence score, provided in these samples, is one of the most sensitive. Increasing the stringency of the trim criteria further increases the quality of your data.

Select the Change Trim Criteria button. Uncheck all but the three criteria checked below, and adjust the values of the

two confidence trims to match.

Click OK to return to the overview for Ends Trimming.

The Ends Trimming window displays how much poor quality data will be trimmed based on the defined criteria. You have the additional option to individually deselect a fragment for trimming on either the 5' or 3' end by removing the "X" from the

appropriate box below the trim graphic.

Ends Trimming

Click the Trim Checked Items button at the top of the window, and then click the Trim button when asked for confirmation.

生物圈http:///The data that you have removed are completely recoverable. SEQUENCHER always stores two copies of every imported Sequence, the original sequence and the data as you have edited it in SEQUENCHER.

Close the Ends Trimming window and return to the Project window by

clicking on the close control in the upper corner of the window.

5. Assemble a Contig

Several alignment algorithms are provided with SEQUENCHER to accommodate the wide variety of assembly applications. For this example, you will use the Dirty Data algorithm because it is best suited for data that may include the occasional

ambiguities or miss-calls generated by automated sequencers. The Assembly

Strategies and the Assemble by Name tutorials in the Sequencher 4.9 Demo\Tutorials

folder explain the other assembly options.

生物圈

http:///Click on the Assembly Parameters button at the top of the Project window. Accept the defaults for the Assembly Algorithm, Minimum Match Percentage, and Minimum Overlap parameters. They should be Dirty Data, 85% and 20

bases, respectively.

Optimize gap placement by selecting Use ReAligner and Prefer 3’ Gap

Placement, if not already selected.

Select OK.

Once you have returned to the Project window, you are ready to begin assembly.

All of the sequence fragments should be selected. If they are not, select them

now using Select > Select All.

Click on the Assemble Automatically button at the top of the Project window. Click Close to dismiss the dialog.

6. View Contig Assembly

The Contig User Preference defines sorting criteria for new assemblies. The default preference sorts the fragments according to position, 5' to 3', within the contig.

SEQUENCHER provides a number of alternative sorting options.

Double-click on the Contig[0001] icon to open the contig Overview window. Click on the Sort button, select the by Strand radio button, and click on OK to

sort the fragments by strand.

Contig Overview

Click on the Sort button, select the by Position radio button, and click OK to

return to the original sorting order.

The Overview contains three sections. The top section displays a schematic of how the fragments are assembled in this contig. The arrows indicate the direction of the fragment in relation to the assembly.

SEQUENCHER provides a Selection Marquee that allows you to navigate from

within the Overview into the Bases window.

The next section provides coverage information. For instance, the consensus called

around base 1206 has less coverage than the surrounding consensus bases.

Below the coverage bar is the open reading frame map. Three bars marked with green

flags and red lines, representing start and stop codons respectively.

Click and drag on the Selection Marquee in the Overview so that it selects the

region around base 1,206..

Select the Bases button at the top of the window to open the Contig Editor and view the base sequences that assemble at this position.

7. Edit Assembled Chromatograms

The Contig Editor provides the tools for checking and editing sequences. It is divided into four quadrants. You can modify the appearance of the Contig Editor from the View menu and in your User Preferences.

Select View > Display Color Bases.

Under Window > User Preferences, change Display > Contig so the Font is

Courier New and the Size is 18.

Close the User Preferences window.

Contig Editor

The two upper panels show the individual fragment names to the left with their

sequences to the right. The Agent Box contains descriptive information about your sequences and your selection. The lower right panel displays consensus information including ambiguities <+> and disagreements < > if there are any. The blue shading describes the confidence, low, medium, and high, with the lighter shades defining higher confidence. Note that you may toggle off this shading by selecting View > Display Base Confidences.

To begin the editing process, move your selection to base one in the consensus. The Select menu provides several tools to navigate to areas of interest in the contig. From the Select menu, choose the command Next Ambiguous Base. The

Agent box reads "4 frag bases selected at consensus position 83," and "Select

Next Ambiguous Base = spacebar".

You have now moved your position in the consensus to the position of the first

ambiguous base. An ambiguous base in the consensus is any that includes: 1) A

contributing fragment base that is not an A, C, G, or T, 2) Disagreements between other fragment bases, and 3) All fragment bases that contribute to the consensus have low quality scores. Position 83 is flagged, because the base called in three of the

fragments is an "N".

To view the chromatograms at this position, while your selection is still on the ambiguous consensus position, select the Show Chromatograms button at the

top of the window.

Assembled Trace Windows

Each chromatogram window displays the current version of the base calls in black and above a line that separates them from the original base calls. At the left of the trace window, SEQUENCHER provides tools that allow you to manipulate how the traces are displayed. The volume control bars allow you to adjust the chromatogram peak height. The A, C, G, and T buttons allow you to turn off the display of the signal from any or all of the bases. Additional controls to format the traces are available through the Window menu under User Preferences... Display pane.

The base call at position 83 is ambiguous in the forward direction, probably

because the irregular spacing challenged the original base caller. To change

the "N" to "C", with your selection still on the consensus base, type "C". Your

edit will correct the consensus and every sequence in the contig at that

position.

Note that SEQUENCHER displays the edited base call in a contrasting color. You are now ready to continue editing your sequence. You can move in your contig from one region of ambiguity to the next using SEQUENCHER ’s navigational tools.

From the Select menu, choose Next Ambiguous Base or use the spacebar to

execute the previous Select command.

SEQUENCHER jumps to the next ambiguity in the contig.

8. Call Heterozygous bases

The next ambiguous base in the consensus is at position 126. One forward sequence calls a T and three forward sequences call a C. A quick look at the chromatogram, however, shows that the forward and the reverse sequences have both C and T peaks

at position 126. This sequence is from a mixed population.

Raise the slider for the 1-Reverse sequence to amplify the signal. Type "Y" in the consensus line at position 126.

In SEQUENCHER, you can manually call each heterozygote one at a time, or you can use the Call Secondary Peaks... function to automatically find the heterozygous bases.

From the Sequence menu, select Call Secondary Peaks...

Change the Minimum lower peak height to 35%.

Click on the Only make changes that result in an ambiguity checkbox to turn it

on.

Click OK.

SEQUENCHER displays a dialog so that you can confirm the changes to 41 base calls.

Click Continue. With your cursor still in the consensus, click the spacebar to select the next

ambiguous base position.

The next ambiguous base position is 320. At this position, you will find that all of the contributing sequences have bold magenta "R"s. The color indicates that the

sequences have been edited, in this case automatically by the Call Secondary Peaks… function. The "R" is the IUPAC abbreviation for a mix of puRines, or "A or G". The "+" under the consensus line flags the ambiguous position.

Continue selecting and editing until all the ambiguities are resolved.

Note that not all of the automatically called bases will be true heterozygotes nor will this function capture all heterozygotes, but it will find all bases that have secondary peaks that are at least 35% of the primary peak.

Close the Contig and Chromatogram windows by clicking on the close buttons in the corners of each.

9. Translate Sequences to Amino Acids

There are numerous potential applications for the Reference Sequence. The Reference Sequence facilitates comparative sequence alignments, defines base numbering, and boosts assembly speed. In this Tour Guide, we will use the Reference Sequence

function to compare the assembled trace sequences to a known text sequence.

From the Project window, select File > Import > Sequencher Project… and

navigate to the Demo Sample Data folder from which you imported the trace files. Click on the file HepC Reference.SPF, then click on the Open button.

The new sequence, AB049090, is now in your project. You can tell that this sequence is already a Reference, because it has an "R" in the icon, and "Ref:" precedes "DNA Fragment" in the Kind column. You can make any sequence a Reference Sequence in SEQUENCHER from the Sequence menu. You can also right click or Ctrl+click on the sequence name to invoke the context sensitive menu. The Reference Sequence tutorial provides more information on the Reference functions.

You can use the same assembly parameters that we used for the autoseq fragments. Note that the current parameters are listed in the Project window just below the button

bar.

Choose Select > Select All and click on the Assemble to Reference button. Click Close to dismiss the Assembly Completed dialog.

The Reference Sequence is now incorporated into Contig[0001].

Double-click on the Contig[0001] icon to display the contig Overview.

Immediately you can see that the Reference contributes to the contig in a different manner than the non-Reference sequences. For example, the numbering of the

sequence, as it appears on the coverage bar, is negative until the Reference begins to contribute. Note the white space at the 3’ end of the coverage bar indicates that the Reference does not contribute coverage at all. The blue bar in the Reference is a

graphic representation of the CDS feature.

When the data that extends beyond the Reference is not of interest, you can trim the contig sequences to the Reference.

From the Contig menu, select the Trim to Reference Sequence command.

Contig[0001] now starts at base position 1. It is useful to view a Reference Sequence when editing a sample sequence. The Reference Sequence will guide you so you can critically examine the differences between the sample consensus and the Reference. Yet the Reference will not contribute to the consensus nor will the Reference be

affected by edits in the consensus.

10. Translate Sequences to Amino Acids

SEQUENCHER provides a variety of ways to display the translation of DNA

sequence. For instance, you can display one or all three of the reading frames below the sequence while editing.

Drag the Selection Marquee in the Overview to the 5’ end of the contig. Click on the Bases button. Place your cursor on base position 1 in the consensus line and click on it. Click on the [ ] button in the lower right corner of the Contig Editor.

SEQUENCHER displays the translated consensus.

The first click changes the [ ] to a [ 1 ], displaying the translation in the first reading frame. This button will toggle through each of the reading frames, followed by the display of all three and then concluding with the display of an [ r ] which displays

both the Reference Sequence translation and the consensus translation in the

Reference Frame.

Click on the upper translation button until it reaches a [ 1 ] to leave the

translation in the first reading frame.

To toggle the display of the Reference Sequence translation, click on the lower

translation button, with an R icon.

Contig With Protein Translation of Consensus and Reference Sequence

11. Annotate a Sequence

SEQUENCHER provides for sequence annotation by allowing you to create a feature for a single base or a range of bases.

If your cursor is not in the consensus, select a base in the consensus. From the menu bar, choose Select > Bases by Number…. In the Select Base dialog, enter 208 in both text boxes. Click OK.

At this position, the sample sequence has a Valine, GTG, and the Reference codes for a Methionine, ATG.

Choose Sequence > Mark Selection As Feature. From the drop down Feature Key menu, choose variation. In the Feature Name box, give your feature a brief name like "A-> G, Met->

Val".

The default Feature Style for a variation is red and underlined, but you may choose any style you wish.

Click OK. If the View > Display Features menu item is not checked, select it. Turn off Display Color Bases from the View menu.

The feature will now be displayed in both the Bases view and the Overview. Note that in the Overview, the red single base feature displays over the range of bases covered

in the blue CDS feature.

Feature in the Bases View

Feature in the Overview

12. Create a Variance Table and Report

After you have edited the sample sequence so that you are confident that the

consensus base calls are correct, you can generate a comparison report of how the sample differs from the Reference.

Close the Contig Editor window. With Contig[0001] selected, choose Contig > Compare Consensus to

Reference.

SEQUENCHER displays a table listing the differences between the consensus

sequence that you edited and the Reference Sequence that you imported. Note that base 208 is still red and underlined.

You can also modify the look of this table.

Click on the symbol that looks like an open elevator button in the bottom left

corner to expand the width of the columns.

From the View menu, choose Colors As Backgrounds.

This report lists the differences between the consensus of one contig and the

Reference, but it is also possible to create this report for hundreds of contigs, when

they share the same Reference.

Variance Table

The Variance Table also acts as a link to the original data.

Double-click on the cell at base position 208.

SEQUENCHER rearranges the windows and opens the Contig and Chromatogram Editors for that base position.

Use the arrow keys on your keyboard to navigate to bases of interest.

Create a new variation feature from the Variance Table at position 338 by

selecting that cell in the Variance Table and then executing the Sequence >

Mark Selection As Feature command.

Click OK.

SEQUENCHER defaults to the variation feature key, because this was the last feature key used. The new feature name is "variation" and it is also red and underlined.

Click on the Reports button to open the Reports dialog

You have several reporting options available to you, so that you can share the

contents of the Variance Table outside of SEQUENCHER. Reporting is disabled in the Demo, but you can view sample reports in SEQUENCHER help. The image

below is the result of creating a Variance Table from the Row Selection for positions 153 – 338.

13. Create a Translated Variance Table

In addition to the Variance Table, which displays the results of the comparison of DNA sequences, Sequencher provides a Translated Variance Table, which displays the differences in the translation of DNA sequences. The following directs you to create the translated "sister" table to the currently open Variance Table, but you can also create a Translated Variance Table directly from a contig by selecting sequence names in the contig or by selecting contigs in a Project window.

While you are still in the Review mode of this Variance Table, click on the

Translation button on the button bar of the Variance Table window.

sequencher4.5序列拼接软件使用说明.doc 将本文的Word文档下载到电脑

    精彩图片

    热门精选

    大家正在看

    × 游客快捷下载通道(下载后可以自由复制和排版)

    限时特价:7 元/份 原价:20元

    支付方式:

    开通VIP包月会员 特价:29元/月

    注:下载文档有可能“只有目录或者内容不全”等情况,请下载之前注意辨别,如果您已付费且无法下载或内容有问题,请联系我们协助你处理。
    微信:fanwen365 QQ:370150219