Name Last Update
Arabidopsis Loading commit data...
Drosophila Loading commit data...
Escherichia_coli Loading commit data...
Human Loading commit data...
Mouse Loading commit data...
Oryza Loading commit data...
Saccharomyces_cerevisae Loading commit data...
Zebrafish Loading commit data...
all_species Loading commit data...
Readme.txt Loading commit data...
dataset.zip Loading commit data...
human_pir.txt Loading commit data...
Origin data:

** Coding

Take exon sequences

Human: Ensembl 92 with ID in Swiss prot, 45 956
Mouse: Ensembl 92 with ID in Swiss prot, 23 715
Zebrafish: Ensembl 92 with ID in Uniparc (not enough in swiss prot), 41 760
Arabidopsis thaliana : Ensembl plants 38 with ID in Swiss prot, 19 228
Oryza sativa Japonica: Ensembl plants 38 with ID in Uniparc, 42 362
Drosophila (fruitfly): Ensembl 92 with ID in Uniparc, 13 928
Saccharomyces cerevisiae: Ensembl 92 with ID in Swiss prot, 6 684
E.coli : Ensembl 92 with ID in Swiss prot, 4083

** Non-coding

Human: RNAcentral (Gencode), 30 171
Mouse: RNAcentral (Gencode), 17 582
Zebrafish: RNAcentral (Rfam), 13 885
Arabidopsis: RNAcentral (Rfam, RefSeq, TAIR), 7 036
Oryza sativa: RNAcentral (Rfam), 6 076
Drosophila: RNAcentral (Flybase), 3 610
Saccharomyces cerevisiae: RNAcentral(Rfam), 1 355
E.coli : RNAcentral (Rfam), 1 058