|
|
|
|
Working with Electronic Sequence Databases
Electronic Sequence Formats
Converting Electronic Sequences
A full listing of Conversion
Programs can be found at the Pasteur Institute. The most
commonly used of these is Readseq,
available at a number of locations:
Pasteur
(Advanced;
Java) | IUBio
| BIMAS | Download
(Java) | Baylor
Exercises
A. A Mammoth Undertaking
- Search Entrez
protein using the keyword "mammoth"
- What confounding results show up?
- Do you get useful results searching Entrez
nucleotide?
- Repeat the search at SRS. Are
there any differences?
B. Format Conversion
- Search Entrez
protein for accession number AY044919
- Search Entrez for
the biglycan (BGN) gene in the Indian elephant (Elephas maximus)
- Download one of the sequences in GenBank format
- Convert the sequence to FASTA format using ReadSeq
- Look for the OMIM entry for BGN
C. Tasmanian Taxonomy
- Search the Taxonomy
Browserfor the Tasmanian Tiger
- Click on the
Metatheria (Marsupial) link in the lineage of the tiger. Get the
nucleotide sequences
- Refine the Entrez query to get only cytochrome b sequences
- Download the set in FASTA format
- Browse through the lineage to get an outgroup
- See what other extinct
taxa are available
- See what LinkOut resources are available for
Drosophila melanogaster
D. Batch Entrez
- Construct a batch file of Indian elephant biglycan genes
- Retrieve the sequences using Batch
Entrez
- Using this batch
file, obtain biglycan genes for the Elephantidae
E. Looking for Lubber
- Search Entrez for "lubber"
- Is there a human homolog for the first protein listed?
- Use the BLink to see other promising areas of exploration
- Investigate the pheromone-related genes -- are there any Bombyx
homologs?
- In Entrez Structure, search for 1GM0
References
- GenBank, Benson et al., NAR
- LocusLink
- Database Resources
of NCBI, 2001
- Human Genome
Resources at NCBI Factsheet
- Gibbs AJ & McIntyre GA. 1970. The diagram, a method for comparing sequences. Its use with amino acid and nucleotide sequences.
Eur J Biochem 1970 16(1):1-11 PMID:
5456129
- Peter D. Karp, Suzanne
Paley, and Jingchun Zhu. Database verification studies of SWISS-PROT and GenBank.
Bioinformatics 2001 17: 526-532
|
|
|
|
|