1. Crick FH: The Complementary Structure of DNA. Proc Natl Acad Sci U S A 1954, 40(8):756-758.
2. Watson JD, Crick FH: Molecular structure of nucleic acids; a structure for deoxyribose nucleic acid. Nature 1953, 171(4356):737-738.
3. Crick FH: On protein synthesis. Symp Soc Exp Biol 1958, 12:138-163.
4. Crick FH, Barnett L, Brenner S, Watts-Tobin RJ: General nature of the genetic code for proteins. Nature 1961, 192:1227-1232.
5. Crick FH, Brenner S, Klug A, Pieczenik G: A speculation on the origin of protein synthesis. Origins of life 1976, 7(4):389-397.
6. Hanson AA, Rogan EG, Cavalieri EL: Synthesis of adducts formed by iodine oxidation of aromatic hydrocarbons in the presence of deoxyribonucleosides and nucleobases. Chem Res Toxicol 1998,
11(10):1201-1208.
7. Rutledge LR, Durst HF, Wetmore SD: Computational comparison of the stacking interactions between the aromatic amino acids and the natural or (cationic) methylated nucleobases. Phys Chem Chem Phys 2008,
10(19):2801-2812.
8. Cysewski P, Szefler B: Environment influences on the aromatic character of nucleobases and amino acids. J Mol Model 2010, 16(11):1709-1720. 9. Chalikian TV, Volker J, Plum GE, Breslauer KJ: A more unified picture
for the thermodynamics of nucleic acid duplex melting: a characterization by calorimetric and volumetric techniques. Proc Natl Acad Sci U S A 1999, 96(14):7853-7858.
10. Patton JT, Spencer E: Genome replication and packaging of segmented double-stranded RNA viruses. Virology 2000, 277(2):217-225.
11. Kong A, Gudbjartsson DF, Sainz J, Jonsdottir GM, Gudjonsson SA, Richardsson B, Sigurdardottir S, Barnard J, Hallbeck B, Masson G et al: A high-resolution recombination map of the human genome. Nat Genet
2002, 31(3):241-247.
12. Paux E, Sourdille P, Salse J, Saintenac C, Choulet F, Leroy P, Korol A, Michalak M, Kianian S, Spielmeyer W et al: A physical map of the 1- gigabase bread wheat chromosome 3B. Science 2008, 322(5898):101-104. 13. Burge CB, Karlin S: Finding the genes in genomic DNA. Curr Opin Struct
Biol 1998, 8(3):346-354.
14. Lake JA: Origin of the eukaryotic nucleus: eukaryotes and eocytes are genotypically related. Canadian journal of microbiology 1989, 35(1):109-118. 15. Ribeiro S, Golding GB: The mosaic nature of the eukaryotic nucleus. Mol
Biol Evol 1998, 15(7):779-788.
16. Dolan MF, Melnitsky H, Margulis L, Kolnicki R: Motility proteins and the origin of the nucleus. The Anatomical record 2002, 268(3):290-301.
17. Huber MD, Gerace L: The size-wise nucleus: nuclear volume control in eukaryotes. The Journal of cell biology 2007, 179(4):583-584.
18. Archambault J, Friesen JD: Genetics of eukaryotic RNA polymerases I, II, and III. Microbiological reviews 1993, 57(3):703-724.
19. Roeder RG: Nuclear RNA polymerases: role of general initiation factors and cofactors in eukaryotic transcription. Methods Enzymol 1996, 273:165- 171.
20. Cramer P, Armache KJ, Baumli S, Benkert S, Brueckner F, Buchen C, Damsma GE, Dengl S, Geiger SR, Jasiak AJ et al: Structure of eukaryotic RNA polymerases. Annual review of biophysics 2008, 37:337-352.
21. Black DL: Mechanisms of alternative pre-messenger RNA splicing.
Annu Rev Biochem 2003, 72:291-336.
22. Jurica MS, Moore MJ: Pre-mRNA splicing: Awash in a sea of proteins.
Molecular Cell 2003, 12(1):5-14.
23. Crick F: Split genes and RNA splicing. Science 1979, 204(4390):264-271. 24. Murray V, Holliday R: Mechanism for RNA splicing of gene transcripts.
FEBS Lett 1979, 106(1):5-7.
25. Hastings ML, Krainer AR: Pre-mRNA splicing in the new millennium.
Curr Opin Cell Biol 2001, 13(3):302-309.
26. Oliver JL, Bernaola-Galvan P, Carpena P, Roman-Roldan R: Isochore chromosome maps of eukaryotic genomes. Gene 2001, 276(1-2):47-56. 27. Delcher AL, Harmon D, Kasif S, White O, Salzberg SL: Improved
microbial gene identification with GLIMMER. Nucleic Acids Res 1999,
27(23):4636-4641.
28. Majoros WH, Pertea M, Salzberg SL: TigrScan and GlimmerHMM: two open source ab initio eukaryotic gene-finders. Bioinformatics 2004,
20(16):2878-2879.
29. Allen JE, Majoros WH, Pertea M, Salzberg SL: JIGSAW, GeneZilla, and GlimmerHMM: puzzling out the features of human genes in the ENCODE regions. Genome Biol 2006, 7 Suppl 1:S9 1-13.
30. Lukashin AV, Borodovsky M: GeneMark.hmm: new solutions for gene finding. Nucleic Acids Res 1998, 26(4):1107-1115.
31. Besemer J, Borodovsky M: Heuristic approach to deriving models for gene finding. Nucleic Acids Res 1999, 27(19):3911-3920.
32. Salamov AA, Solovyev VV: Ab initio gene finding in Drosophila genomic DNA. Genome Res 2000, 10(4):516-522.
33. Solovyev V, Salamov A: The Gene-Finder computer tools for analysis of human and model organisms genome sequences. Proc Int Conf Intell Syst Mol Biol 1997, 5:294-302.
34. Schmidt T, Frishman D: Assignment of isochores for all completely sequenced vertebrate genomes using a consensus. Genome Biology 2008,
9(6).
35. Zhang W, Wu W, Lin W, Zhou P, Dai L, Zhang Y, Huang J, Zhang D:
Deciphering heterogeneity in pig genome assembly Sscrofa9 by isochore and isochore-like region analyses. PLoS One 2010, 5(10):e13303. 36. Oliver JL, Carpena P, Hackenberg M, Bernaola-Galvan P: IsoFinder:
computational prediction of isochores in genome sequences. Nucleic Acids Res 2004, 32(Web Server issue):W287-292.
37. Rabiner LR: A Tutorial on Hidden Markov-Models and Selected Applications in Speech Recognition. P Ieee 1989, 77(2):257-286.
38. Newberg LA: Error statistics of hidden Markov model and hidden Boltzmann model results. BMC Bioinformatics 2009, 10:212.
39. Salzberg SL, Pertea M, Delcher AL, Gardner MJ, Tettelin H: Interpolated Markov models for eukaryotic gene finding. Genomics 1999, 59(1):24-31. 40. Krogh A, Brown M, Mian IS, Sjolander K, Haussler D: Hidden Markov
models in computational biology. Applications to protein modeling. J Mol Biol 1994, 235(5):1501-1531.
genes. Nature Reviews Genetics 2002, 3(9):698-709.
42. Aurrecoechea C, Brestelli J, Brunk BP, Fischer S, Gajria B, Gao X, Gingle A, Grant G, Harb OS, Heiges M et al: EuPathDB: a portal to eukaryotic pathogen databases. Nucleic Acids Res 2010, 38(Database issue):D415-419. 43. Rombel IT, Sykes KF, Rayner S, Johnston SA: ORF-FINDER: a vector for
high-throughput gene identification. Gene 2002, 282(1-2):33-41.
44. Stifanic M, Batel R: Genscan for Arabidopsis is a valuable tool for predicting sponge coding sequences. Biologia 2007, 62(2):124-127.
45. Burge C, Karlin S: Prediction of complete gene structures in human genomic DNA. J Mol Biol 1997, 268(1):78-94.
46. Birney E, Durbin R: Dynamite: A flexible code generating language for dynamic programming methods used in sequence comparison. Ismb-97 - Fifth International Conference on Intelligent Systems for Molecular Biology, Proceedings 1997:56-64.
47. Sadi MS, Sami AZM, Ahmed IU, Ruhunnabi ABM, Das N:
Bioinformatics: Implementation of a proposed upgraded Smith- Waterman Algorithm for local alignment. Cibcb: 2009 Ieee Symposium on Computational Intelligence in Bioinformatics and Computational Biology
2009:87-91.
48. Guigo R, Agarwal P, Abril JF, Burset M, Fickett JW: An assessment of gene prediction accuracy in large DNA sequences. Genome Res 2000,
10(10):1631-1642.
49. Liu Q, Mackey AJ, Roos DS, Pereira FC: Evigan: a hidden variable model for integrating gene evidence for eukaryotic gene prediction.
Bioinformatics 2008, 24(5):597-605.
50. Stanke M, Morgenstern B: AUGUSTUS: a web server for gene prediction in eukaryotes that allows user-defined constraints. Nucleic Acids Research
2005, 33:W465-W467.
51. Stanke M, Keller O, Gunduz I, Hayes A, Waack S, Morgenstern B:
AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Research 2006, 34:W435-W439.
52. Keller O, Kollmar M, Stanke M, Waack S: A novel hybrid gene prediction method employing protein multiple sequence alignments. Bioinformatics
2011, 27(6):757-763.
53. Wang Z, Gerstein M, Snyder M: RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet 2009, 10(1):57-63.
54. Pauling L, Corey RB, Branson HR: The structure of proteins; two hydrogen-bonded helical configurations of the polypeptide chain. Proc Natl Acad Sci U S A 1951, 37(4):205-211.
55. Hansen JL, Schmeing TM, Moore PB, Steitz TA: Structural insights into peptide bond formation. P Natl Acad Sci USA 2002, 99(18):11670-11675. 56. Payne JW: Peptides and micro-organisms. Advances in microbial physiology
1976, 13:55-113.
57. von Heijne G: The signal peptide. J Membr Biol 1990, 115(3):195-201.
58. von Heijne G: The structure of signal peptides from bacterial lipoproteins. Protein Eng 1989, 2(7):531-534.
59. Paetzel M, Karla A, Strynadka NC, Dalbey RE: Signal peptidases. Chem Rev 2002, 102(12):4549-4580.
60. von Heijne G: A new method for predicting signal sequence cleavage sites. Nucleic Acids Res 1986, 14(11):4683-4690.