- Art Gallery -

Human genome

The human genome is the genome of Homo sapiens, which is stored on 23 chromosome pairs. Twenty-two of these are autosomal chromosome pairs, while the remaining pair is sex-determining. The haploid human genome occupies a total of just over 3 billion DNA base pairs. The Human Genome Project (HGP) produced a reference sequence of the euchromatic human genome, which is used worldwide in biomedical sciences.

The haploid human genome contains ca. 23,000 protein-coding genes, far fewer than had been expected before its sequencing.[1][2] In fact, only about 1.5% of the genome codes for proteins, while the rest consists of non-coding RNA genes, regulatory sequences, introns, and (controversially named) "junk" DNA.[3]



There are estimated to be between 20,000 and 25,000 human protein-coding genes. The estimate of the number of human genes has been repeatedly revised down from initial predictions of 100,000 or more as genome sequence quality and gene finding methods have improved.

Surprisingly, the number of human genes seems to be less than a factor of two greater than that of many much simpler organisms, such as the roundworm and the fruit fly. However, human cells make extensive use of alternative splicing to produce several different proteins from a single gene, and the human proteome is thought to be much larger than those of the aforementioned organisms.[citation needed] Besides, most human genes have multiple exons, and human introns are frequently much longer than the flanking exons.[citation needed]

Human genes are distributed unevenly across the chromosomes. Each chromosome contains various gene-rich and gene-poor regions, which seem to be correlated with chromosome bands and GC-content. The significance of these nonrandom patterns of gene density is not well understood.[citation needed] In addition to protein coding genes, the human genome contains thousands of RNA genes, including tRNA, ribosomal RNA, microRNA, and other non-coding RNA genes.[citation needed]

Regulatory sequences

The human genome has many different regulatory sequences which are crucial to controlling gene expression. These are typically short sequences that appear near or within genes. A systematic understanding of these regulatory sequences and how they together act as a gene regulatory network is only beginning to emerge from computational, high-throughput expression and comparative genomics studies. Some types of non-coding DNA are genetic "switches" that do not encode proteins, but do regulate when and where genes are expressed.[4]

Identification of regulatory sequences relies in part on evolutionary conservation. The evolutionary branch between the primates and mouse, for example, occurred 70–90 million years ago.[5] So computer comparisons of gene sequences that identify conserved non-coding sequences will be an indication of their importance in duties such as gene regulation.[6]

Another comparative genomic approach to locating regulatory sequences in humans is the gene sequencing of the puffer fish. These vertebrates have essentially the same genes and regulatory gene sequences as humans, but with only one-eighth the "junk" DNA. The compact DNA sequence of the puffer fish makes it much easier to locate the regulatory genes.[7]

Other DNA

Protein-coding sequences (specifically, coding exons) comprise less than 1.5% of the human genome.[3] Aside from genes and known regulatory sequences, the human genome contains vast regions of DNA the function of which, if any, remains unknown. These regions in fact comprise the vast majority, by some estimates 97%, of the human genome size. Much of this is composed of:

Repeat elements

  • Tandem repeats
    • Satellite DNA
    • Minisatellite
    • Microsatellite
  • Interspersed repeats
    • SINEs
    • LINEs


  • Retrotransposons
    • LTR
      • Ty1-copia
      • Ty3-gypsy
    • Non-LTR
      • SINEs
      • LINEs
  • DNA Transposons

Junk DNA

However, there is also a large amount of sequence that does not fall under any known classification. Much of this sequence may be an evolutionary artifact that serves no present-day purpose, and these regions are sometimes collectively referred to as "junk" DNA. There are, however, a variety of emerging indications that many sequences within are likely to function in ways that are not fully understood. Recent experiments using microarrays have revealed that a substantial fraction of non-genic DNA is in fact transcribed into RNA,[8] which leads to the possibility that the resulting transcripts may have some unknown function. Also, the evolutionary conservation across the mammalian genomes of much more sequence than can be explained by protein-coding regions indicates that many, and perhaps most, functional elements in the genome remain unknown.[9] The investigation of the vast quantity of sequence information in the human genome whose function remains unknown is currently a major avenue of scientific inquiry.[10]

Information content

The 3 billion base pairs of the haploid human genome correspond to about 750 megabytes of data, since every base pair can be coded by 2 bits. The entropy rate of the genome differs significantly between coding and non-coding sequences. It is close to the maximum of 2 bits per base pair for the coding sequences (about 45 million base pairs), but less for the non-coding parts. It ranges between 1.5 and 1.9 bits per base pair for the individual chromosome, except for the Y chromosome, which has an entropy rate below 0.9 bits per base pair.[11]

Information content of the haploid human genome by chromosome:

total (XY) total (XX) 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 X Y
million base pairs 3,080 3,022 247 243 199 191 181 171 159 146 140 135 134 132 114 106 100 89 79 76 63 62 47 50 155 58
megabytes (raw data) 770 756 61.8 60.7 49.9 47.8 45.2 42.7 39.7 36.6 35.1 33.9 33.6 33.1 28.5 26.6 25.1 22.2 19.7 19.0 16.0 15.6 11.7 12.4 38.7 14.4
megabytes in zipped
Human Genome Project file[12]
827 819 65.1 68.2 57.4 52.3 51.3 48.8 45.3 38.6 33.9 39.1 39.8 38.8 28.8 26.5 22.9 22.5 22.7 22.2 16.4 18.9 10.4 10.4 38.6 8.0
entropy rate in bits per base pair
(Liu, Venkatesh and Maley 2008)
1.70 1.71 1.82 1.80 1.82 1.82 1.83 1.82 1.81 1.83 1.59 1.83 1.84 1.59 1.56 1.53 1.66 1.82 1.87 1.58 1.86 1.82 1.62 1.83 1.80 0.84


DNA sequencing determines the order of the nucleotide bases in a genome.


The Human Genome Project and a parallel project by Celera Genomics each produced and published a haploid human genome sequence, both of which were a composite of the DNA sequence of several individuals.[13]

See also: Personal genomics

A personal genome sequence is a complete sequencing of the chemical base pairs that make up the DNA of a single person. Because medical treatments have different effects on different people because of genetic variations such as single-nucleotide polymorphisms (SNPs), the analysis of personal genomes may lead to personalized medical treatment based on individual genotypes.

The completion of the fifth such map was announced in December 2008. The genome mapped was that of a Korean researcher Seong-Jin Kim. Genome maps had previously been completed for Craig Venter of the U.S. in 2007, James Watson of the U.S. in April 2008, and Yang Huanming of China in November 2008 and Dan Stoicescu in January 2008.[14][15][16]

Personal genomes had not been sequenced in the Human Genome Project to protect the identity of volunteers who provided DNA samples. That sequence was derived from the DNA of several volunteers from a diverse population.[17] Another distinction is that the HGP sequence is haploid, however, the sequence maps for Venter and Watson for example are diploid, representing both sets of chromosomes.

Kim’s genome had 1.58 million SNPs that had never been reported before and indicates that six out of 10,000 DNA bases are unique to Koreans. Kim's sequence map can be used to assist in building a standard Korean genome, which can then be used to compare the genomes of other Korean individuals for personalized medical treatments.


Whereas a genome sequence lists the order of every DNA base in a genome, a genome map identifies the landmarks. A genome map is less detailed than a genome sequence and aids in navigating around the genome.[18][19]


An example of a variation map is the HapMap being developed by the International HapMap Project. The HapMap is a haplotype map of the human genome, "which will describe the common patterns of human DNA sequence variation."[20] It catalogs the patterns of small-scale variations in the genome that involve single DNA letters, or bases.

Researchers published the first sequence-based map of large-scale structural variation across the human genome in the journal Nature in May 2008.[21][22] Large-scale structural variations are differences in the genome among people that range from a few thousand to a few million DNA bases; some are gains or losses of stretches of genome sequence and others appear as re-arrangements of stretches of sequence. These variations include differences in the number of copies individuals have of a particular gene, deletions, translocations and inversions.

Main articles: Human genetic variation and Human genetic clustering

Most studies of human genetic variation have focused on single-nucleotide polymorphisms (SNPs), which are substitutions in individual bases along a chromosome. Most analyses estimate that SNPs occur on average somewhere between every 1 in 100 and 1 in 300 base pairs in the euchromatic human genome, although they do not occur at a uniform density. Thus follows the popular statement that "we are all, regardless of race, genetically 99.9% the same",[23] although this would be somewhat qualified by most geneticists. For example, a much larger fraction of the genome is now thought to be involved in copy number variation.[24] A large-scale collaborative effort to catalog SNP variations in the human genome is being undertaken by the International HapMap Project.

The genomic loci and length of certain types of small repetitive sequences are highly variable from person to person, which is the basis of DNA fingerprinting and DNA paternity testing technologies. The heterochromatic portions of the human genome, which total several hundred million base pairs, are also thought to be quite variable within the human population (they are so repetitive and so long that they cannot be accurately sequenced with current technology). These regions contain few genes, and it is unclear whether any significant phenotypic effect results from typical variation in repeats or heterochromatin.

Most gross genomic mutations in gamete germ cells probably result in inviable embryos; however, a number of human diseases are related to large-scale genomic abnormalities. Down syndrome, Turner Syndrome, and a number of other diseases result from nondisjunction of entire chromosomes. Cancer cells frequently have aneuploidy of chromosomes and chromosome arms, although a cause and effect relationship between aneuploidy and cancer has not been established.

Genetic disorders
For more details on this topic, see Genetic disorder.
This article needs attention from an expert on the subject. See the talk page for details. WikiProject Medicine or the Medicine Portal may be able to help recruit an expert. (April 2008)

Most aspects of human biology involve both genetic (inherited) and non-genetic (environmental) factors. Some inherited variation influences aspects of our biology that are not medical in nature (height, eye color, ability to taste or smell certain compounds, etc). Moreover, some genetic disorders only cause disease in combination with the appropriate environmental factors (such as diet). With these caveats, genetic disorders may be described as clinically defined diseases caused by genomic DNA sequence variation. In the most straightforward cases, the disorder can be associated with variation in a single gene. For example, cystic fibrosis is caused by mutations in the CFTR gene, and is the most common recessive disorder in caucasian populations with over 1,300 different mutations known. Disease-causing mutations in specific genes are usually severe in terms of gene function, and are fortunately rare, thus genetic disorders are similarly individually rare. However, since there are many genes that can vary to cause genetic disorders, in aggregate they comprise a significant component of known medical conditions, especially in pediatric medicine. Molecularly characterized genetic disorders are those for which the underlying causal gene has been identified, currently there are approximately 2,200 such disorders annotated in the OMIM database.[25]

Studies of genetic disorders are often performed by means of family-based studies. In some instances population based approaches are employed, particularly in the case of so-called founder populations such as those in Finland, French-Canada, Utah, Sardinia, etc. Diagnosis and treatment of genetic disorders are usually performed by a geneticist-physician trained in clinical/medical genetics. The results of the Human Genome Project are likely to provide increased availability of genetic testing for gene-related disorders, and eventually improved treatment. Parents can be screened for hereditary conditions and counselled on the consequences, the probability it will be inherited, and how to avoid or ameliorate it in their offspring.

As noted above, there are many different kinds of DNA sequence variation, ranging from complete extra or missing chromosomes down to single nucleotide changes. It is generally presumed that much naturally occurring genetic variation in human populations is phenotypically neutral, i.e. has little or no detectable effect on the physiology of the individual (although there may be fractional differences in fitness defined over evolutionary time frames). Genetic disorders can be caused by any or all known types of sequence variation. To molecularly characterize a new genetic disorder, it is necessary to establish a causal link between a particular genomic sequence variant and the clinical disease under investigation. Such studies constitute the realm of human molecular genetics.

With the advent of the Human Genome and International HapMap Project, it has become feasible to explore subtle genetic influences on many common disease conditions such as diabetes, asthma, migraine, schizophrenia, etc. Although some causal links have been made between genomic sequence variants in particular genes and some of these diseases, often with much publicity in the general media, these are usually not considered to be genetic disorders per se as their causes are complex, involving many different genetic and environmental factors. Thus there may be disagreement in particular cases whether a specific medical condition should be termed a genetic disorder.

See also: Human evolution and Chimpanzee Genome Project

Comparative genomics studies of mammalian genomes suggest that approximately 5% of the human genome has been conserved by evolution since the divergence of extant lineages approximately 200 million years ago, containing the vast majority of genes.[10][9] Intriguingly, since genes and known regulatory sequences probably comprise less than 2% of the genome, this suggests that there may be more unknown functional sequence than known functional sequence. A smaller, yet large, fraction of human genes seem to be shared among most known vertebrates. The published chimpanzee genome differs from that of the human genome by 1.23% in direct sequence comparisons.[26] Around 20% of this figure is accounted for by variation within each species, leaving only ~1.06% consistent sequence divergence between humans and chimps at shared genes.[27]. This nucleotide by nucleotide difference is dwarfed, however, by the portion of each genome that is not shared, including around 6% of functional genes that are unique to either humans or chimps.[28] In other words, the considerable observable differences between humans and chimps may be due as much or more to genome level variation in the number, function and expression of genes rather than DNA sequence changes in shared genes. On average, a typical human protein-coding gene differs from its chimpanzee ortholog by only two amino acid substitutions; nearly one third of human genes have exactly the same protein translation as their chimpanzee orthologs. A major difference between the two genomes is human chromosome 2, which is equivalent to a fusion product of chimpanzee chromosomes 12 and 13[29] (later renamed to chromosomes 2A and 2B, respectively).

Humans have undergone an extraordinary loss of olfactory receptor genes during our recent evolution, which explains our relatively crude sense of smell compared to most other mammals. Evolutionary evidence suggests that the emergence of color vision in humans and several other primate species has diminished the need for the sense of smell.[30]

Mitochondrial genome

The human mitochondrial genome, while usually not included when referring to the "human genome", is of tremendous interest to geneticists, since it undoubtedly plays a role in mitochondrial disease. It also sheds light on human evolution; for example, analysis of variation in the human mitochondrial genome has led to the postulation of a recent common ancestor for all humans on the maternal line of descent. (see Mitochondrial Eve)

Due to the lack of a system for checking for copying errors, Mitochondrial DNA (mtDNA) has a more rapid rate of variation than nuclear DNA. This 20-fold increase in the mutation rate allows mtDNA to be used for more accurate tracing of maternal ancestry. Studies of mtDNA in populations have allowed ancient migration paths to be traced, such as the migration of Native Americans from Siberia or Polynesians from southeastern Asia. It has also been used to show that there is no trace of Neanderthal DNA in the European gene mixture inherited through purely maternal lineage.[31]

See also: Epigenetics

Epigenetics are a variety of features of the human genome that transcend its primary DNA sequence, such as chromatin packaging, histone modifications and DNA methylation, and which are important in regulating gene expression, genome replication and other cellular processes. Basically they are marks on DNA that are influenced within an individual's own life. Over-eating for years will change a person's epigenetics in such a way that children, grandchildren, and the next few generations will be predisposed to a shorter and unhealthier life.[citation needed] Drugs are being researched that can change one's epigenetics, and which when fully developed will be able to "shut off" certain diseases and disorders. Overall epigenetics strengthen and weaken certain genes but are not an actual part of the DNA.[32][33]
See also

* Craig Venter#Individual human genome sequenced
* Eugenics
* Eukaryotic chromosome fine structure
* Genetic distance
* Genomic organization
* Human genetic engineering
* Human Genome Sequencing Center
* Noncoding DNA
* The Genographic Project
* Y-chromosomal Adam


1. ^ Evolutionary Trajectories of Primate Genes Involved in HIV Pathogenesis, 2 September 2009
2. ^ International Human Genome Sequencing Consortium (2004). "Finishing the euchromatic sequence of the human genome.". Nature 431 (7011): 931–45. doi:10.1038/nature03001. PMID 15496913. [1]
3. ^ a b International Human Genome Sequencing Consortium (2001). "Initial sequencing and analysis of the human genome.". Nature 409 (6822): 860–921. doi:10.1038/35057062. PMID 11237011. [2]
4. ^ Carroll, Sean B. et al. (May 2008). "Regulating Evolution", Scientific American, pp. 60–67.
5. ^ Nei M, Xu P, Glazko G (2001). "Estimation of divergence times from multiprotein sequences for a few mammalian species and several distantly related organisms.". Proc Natl Acad Sci USA 98 (5): 2497–502. doi:10.1073/pnas.051611498. PMID 11226267. PMC 30166. http://www.pnas.org/cgi/content/full/051611498.
6. ^ Loots G, Locksley R, Blankespoor C, Wang Z, Miller W, Rubin E, Frazer K (2000). "Identification of a coordinate regulator of interleukins 4, 13, and 5 by cross-species sequence comparisons.". Science 288 (5463): 136–40. doi:10.1126/science.288.5463.136. PMID 10753117. Summary
7. ^ Meunier, Monique. "Genoscope and Whitehead announce a high sequence coverage of the Tetraodon nigroviridis genome". Genoscope. http://www.cns.fr/externe/English/Actualites/Presse/261001_1.html. Retrieved 2006-09-12.
8. ^ "...a tiling array with 5-nucleotide resolution that mapped transcription activity along 10 human chromosomes revealed that an average of 10% of the genome (compared to the 1 to 2% represented by bona fide exons) corresponds to polyadenylated transcripts, of which more than half do not overlap with known gene locations.Claverie J (2005). "Fewer genes, more noncoding RNA.". Science 309 (5740): 1529–30. doi:10.1126/science.1116800. PMID 16141064.
9. ^ a b "...the proportion of small (50-100 bp) segments in the mammalian genome that is under (purifying) selection can be estimated to be about 5%. This proportion is much higher than can be explained by protein-coding sequences alone, implying that the genome contains many additional features (such as untranslated regions, regulatory elements, non-protein-coding genes, and chromosomal structural elements) under selection for biological function." Mouse Genome Sequencing Consortium (2002). "Initial sequencing and comparative analysis of the mouse genome.". Nature 420 (6915): 520–62. doi:10.1038/nature01262. PMID 12466850.
10. ^ a b The ENCODE Project Consortium (2007). ""Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project"". Nature 447 (7146): 799–816. doi:10.1038/nature05874. PMID 17571346.
11. ^ Zhandong Liu, Santosh S Venkatesh and Carlo C Maley, Sequence space coverage, entropy of genomes and the potential to detect non-human DNA in human samples, BMC Genomics 2008, 9:509, doi:10.1186/1471-2164-9-509, fig. 6, using the Lempel-Ziv estimators of entropy rate.
12. ^ at Project Gutenberg[3], zipped plaintext file, includes header information. These are generally larger than the theoretically possible values, and in fact usually larger than the number of bytes of raw binary data.
13. ^ "PLoS Biology: A New Human Genome Sequence Paves the Way for Individualized Genomics". Biology.plosjournals.org. doi:10.1371/journal.pbio.0050266. http://biology.plosjournals.org/perlserv/?request=get-document&doi=10.1371/journal.pbio.0050266&ct=1. Retrieved 2009-05-31.
14. ^ Updated Dec.5,2008 08:25 KST (2008-12-05). "Digital Chosunilbo (English Edition) : Daily News in English About Korea". English.chosun.com. http://english.chosun.com/w21data/html/news/200812/200812050003.html. Retrieved 2009-05-31.
15. ^ "Kbs World". Rki.kbs.co.kr. 2008-12-11. http://rki.kbs.co.kr/english/news/news_science_detail.htm?no=3623. Retrieved 2009-05-31.
16. ^ Harmon, Amy (2008-03-04). "Gene Map Becomes a Luxury Item". The New York Times. http://www.nytimes.com/2008/03/04/health/research/04geno.html. Retrieved 2010-04-02.
17. ^ "Human Genome Project Completion: Frequently Asked Questions". genome.gov. http://www.genome.gov/11006943. Retrieved 2009-05-31.
18. ^ "What's a Genome?". Genomenewsnetwork.org. 2003-01-15. http://www.genomenewsnetwork.org/resources/whats_a_genome/Chp3_1.shtml. Retrieved 2009-05-31.
19. ^ NCBI_user_services (2004-03-29). "Mapping Factsheet". Ncbi.nlm.nih.gov. http://www.ncbi.nlm.nih.gov/About/primer/mapping.html. Retrieved 2009-05-31.
20. ^ "About the Project". HapMap. http://www.hapmap.org/abouthapmap.html. Retrieved 2009-05-31.
21. ^ "2008 Release: Researchers Produce First Sequence Map of Large-Scale Structural Variation in the Human Genome". genome.gov. http://www.genome.gov/27026113. Retrieved 2009-05-31.
22. ^ Nature. "Mapping and sequencing of structural variation from eight human genomes : Abstract". Nature. doi:10.1038/nature06862. http://www.nature.com/nature/journal/v453/n7191/abs/nature06862.html. Retrieved 2009-05-31.
23. ^ from Bill Clinton's 2000 State of the Union address
24. ^ Nature. "Global variation in copy number in the human genome : Article : Nature". Nature<!. http://www.nature.com/nature/journal/v444/n7118/full/nature05329.html. Retrieved 2009-08-09.
25. ^ Online Mendelian Inheritance in Man (OMIM)
26. ^ "We calculate the genome-wide nucleotide divergence between human and chimpanzee to be 1.23%, confirming recent results from more limited studies. The Chimpanzee Sequencing and Analysis Consortium (2005). "Initial sequence of the chimpanzee genome and comparison with the human genome.". Nature 437 (7055): 69–87. doi:10.1038/nature04072. PMID 16136131.
27. ^ "we estimate that polymorphism accounts for 14–22% of the observed divergence rate and thus that the fixed divergence is ~1.06% or less. The Chimpanzee Sequencing and Analysis Consortium (2005). "Initial sequence of the chimpanzee genome and comparison with the human genome.". Nature 437 (7055): 69–87. doi:10.1038/nature04072. PMID 16136131.
28. ^ "Our results imply that humans and chimpanzees differ by at least 6% (1,418 of 22,000 genes) in their complement of genes, which stands in stark contrast to the oft-cited 1.5% difference between orthologous nucleotide sequences." Demuth J, De Bie T, Stajich JE, Cristianini N, Hahn MW (2006). "The Evolution of Mammalian Gene Families.". PLoS ONE 1 (1): E85. doi:10.1371/journal.pone.0000085. PMID 17183716.
29. ^ "Human chromosome 2 resulted from a fusion of two ancestral chromosomes that remained separate in the chimpanzee lineage" The Chimpanzee Sequencing and Analysis Consortium (2005). "Initial sequence of the chimpanzee genome and comparison with the human genome.". Nature 437 (7055): 69–87. doi:10.1038/nature04072. PMID 16136131.
"Large-scale sequencing of the chimpanzee genome is now imminent."Olson M, Varki A (2003). "Sequencing the chimpanzee genome: insights into human evolution and disease.". Nat Rev Genet 4 (1): 20–8. doi:10.1038/nrg981. PMID 12509750.
30. ^ "Our findings suggest that the deterioration of the olfactory repertoire occurred concomitant with the acquisition of full trichromatic color vision in primates." Gilad Y, Wiebe V, Przeworski M, Lancet D, Pääbo S (2004). "Loss of olfactory receptor genes coincides with the acquisition of full trichromatic vision in primates.". PLoS Biol 2 (1): E5. doi:10.1371/journal.pbio.0020005. PMID 14737185.
31. ^ Sykes, Bryan (2003-10-09). "Mitochondrial DNA and human history". The Human Genome. http://genome.wellcome.ac.uk/doc_WTD020876.html. Retrieved 2006-09-19.
32. ^ "Misteli". Cell. doi:10.1016/j.cell.2007.01.028. http://www.cell.com/content/article/abstract?uid=PIIS0092867407001262. Retrieved 2009-05-31.
33. ^ "Bernstein et al.". Cell. doi:10.1016/j.cell.2007.01.033. http://www.cell.com/content/article/abstract?uid=PIIS0092867407001286. Retrieved 2009-05-31.

* Carroll, Sean B. et al. (May 2008). "Regulating Evolution". Scientific American. New York: Scientific American Inc.. pp. 60–67.
* Lindblad-Toh K, et al. (2005). "Genome sequence, comparative analysis and haplotype structure of the domestic dog.". Nature 438 (7069): 803–19. doi:10.1038/nature04338. PMID 16341006. http://www.nature.com/nature/journal/v438/n7069/abs/nature04338.html.

External links

* The National Human Genome Research Institute
* Ensembl The Ensembl Genome Browser Project
* National Library of Medicine human genome viewer
* UCSC Genome Browser.
* Human Genome Project.
* Sabancı University School of Languages Podcasts What makes us different from chimpanzees? by Andrew Berry (MP3 file)
* The National Office of Public Health Genomics
* New findings challenge established views about human genome

Retrieved from "http://en.wikipedia.org/"
All text is available under the terms of the GNU Free Documentation License