Genomika: analýza a algoritmy - Cvičení 1

Připravit si základní sadu programů. Budeme je používat během celého semestru a budou se vám hodit během celé kariéry, připravte si je dobře:

Nejlépe by měly programy číst ze standardního vstupu a tamtéž poslat výstup. Tak, aby bylo možno použít konstrukce podobné těmto:

cat mm_pax6-plain.txt | reformat_DNA | statistic_DNA
cat mm_pax6.fasta | get_region 286 1596 | translate >mm_pax6_cds.aa

Příští týden zjistíme, kdo napsal programy nejlépe a ty se stanou naším zlatým standardem, tak do toho.

Data

Genetic code

Base1  = TTTTTTTTTTTTTTTTCCCCCCCCCCCCCCCCAAAAAAAAAAAAAAAAGGGGGGGGGGGGGGGG
Base2  = TTTTCCCCAAAAGGGGTTTTCCCCAAAAGGGGTTTTCCCCAAAAGGGGTTTTCCCCAAAAGGGG
Base3  = TCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAG
AAs    = FFLLSSSSYY**CC*WLLLLPPPPHHQQRRRRIIIMTTTTNNKKSSRRVVVVAAAADDEEGGGG
	
Standard Code = FFLLSSSSYY**CC*WLLLLPPPPHHQQRRRRIIIMTTTTNNKKSSRRVVVVAAAADDEEGGGG
Vertebrate Mitochondrial Code = FFLLSSSSYY**CCWWLLLLPPPPHHQQRRRRIIMMTTTTNNKKSS**VVVVAAAADDEEGGGG
Yeast Mitochondrial Code = FFLLSSSSYY**CCWWTTTTPPPPHHQQRRRRIIMMTTTTNNKKSSRRVVVVAAAADDEEGGGG
Mold, Protozoan, Coelenterate Mitochondrial and Mycoplasma/Spiroplasma Code = FFLLSSSSYY**CCWWLLLLPPPPHHQQRRRRIIIMTTTTNNKKSSRRVVVVAAAADDEEGGGG
Invertebrate Mitochondrial Code = FFLLSSSSYY**CCWWLLLLPPPPHHQQRRRRIIMMTTTTNNKKSSSSVVVVAAAADDEEGGGG
Ciliate, Dasycladacean and Hexamita Nuclear Code = FFLLSSSSYYQQCC*WLLLLPPPPHHQQRRRRIIIMTTTTNNKKSSRRVVVVAAAADDEEGGGG
Echinoderm and Flatworm Mitochondrial Code = FFLLSSSSYY**CCWWLLLLPPPPHHQQRRRRIIIMTTTTNNNKSSSSVVVVAAAADDEEGGGG
Euplotid Nuclear Code = FFLLSSSSYY**CCCWLLLLPPPPHHQQRRRRIIIMTTTTNNKKSSRRVVVVAAAADDEEGGGG
Bacterial, Archaeal and Plant Plastid Code = FFLLSSSSYY**CC*WLLLLPPPPHHQQRRRRIIIMTTTTNNKKSSRRVVVVAAAADDEEGGGG
Alternative Yeast Nuclear Code = FFLLSSSSYY**CC*WLLLSPPPPHHQQRRRRIIIMTTTTNNKKSSRRVVVVAAAADDEEGGGG
Ascidian Mitochondrial Code = FFLLSSSSYY**CCWWLLLLPPPPHHQQRRRRIIMMTTTTNNKKSSGGVVVVAAAADDEEGGGG
Alternative Flatworm Mitochondrial Code = FFLLSSSSYYY*CCWWLLLLPPPPHHQQRRRRIIIMTTTTNNNKSSSSVVVVAAAADDEEGGGG
Blepharisma Nuclear Code = FFLLSSSSYY*QCC*WLLLLPPPPHHQQRRRRIIIMTTTTNNKKSSRRVVVVAAAADDEEGGGG
Chlorophycean Mitochondrial Code = FFLLSSSSYY*LCC*WLLLLPPPPHHQQRRRRIIIMTTTTNNKKSSRRVVVVAAAADDEEGGGG
Trematode Mitochondrial Code = FFLLSSSSYY**CCWWLLLLPPPPHHQQRRRRIIMMTTTTNNNKSSSSVVVVAAAADDEEGGGG
Scenedesmus obliquus mitochondrial Code = FFLLSS*SYY*LCC*WLLLLPPPPHHQQRRRRIIIMTTTTNNKKSSRRVVVVAAAADDEEGGGG
Thraustochytrium Mitochondrial Code = FF*LSSSSYY**CC*WLLLLPPPPHHQQRRRRIIIMTTTTNNKKSSRRVVVVAAAADDEEGGGG
Pterobranchia mitochondrial code = FFLLSSSSYY**CCWWLLLLPPPPHHQQRRRRIIIMTTTTNNKKSSSKVVVVAAAADDEEGGGG
Candidate Division SR1 and Gracilibacteria Code = FFLLSSSSYY**CCGWLLLLPPPPHHQQRRRRIIIMTTTTNNKKSSRRVVVVAAAADDEEGGGG
      

IUB Nucleotide Codes

Code Definition Mnemonic
AAdenineA
CCytosineC
GGuanineG
TThymineT
RAGpuRine
YCTpYrimidine
KGTKeto
MACaMino
SGCStrong
WATWeak
BCGTNot A
DAGTNot C
HACTNot G
VACGNot T
NAGCTaNy

Files:


Time-stamp: <2019-10-02 10:36:59 (hpaces)>