Figures & data
FIGURE 1 A fragment of the annotation path for CDC28 in GO molecular function. Indentation on a line denotes refinement of a parent term on the previous line by the child term; in this example, every child has an “is_a” edge to its parent. The numbers in brackets indicate the number of gene products annotated to the GO term. Note the multiple inheritance from the parents on lines 5 and 9 to the term GO:0004672 on lines 6 and 10 and its descendants. CDC28 is annotated to the term GO:0004693 on lines 6 and 12.
![FIGURE 1 A fragment of the annotation path for CDC28 in GO molecular function. Indentation on a line denotes refinement of a parent term on the previous line by the child term; in this example, every child has an “is_a” edge to its parent. The numbers in brackets indicate the number of gene products annotated to the GO term. Note the multiple inheritance from the parents on lines 5 and 9 to the term GO:0004672 on lines 6 and 10 and its descendants. CDC28 is annotated to the term GO:0004693 on lines 6 and 12.](/cms/asset/2f8e7098-89a2-4f42-8f0a-e3fd314a9412/uaai_a_445223_o_f0001g.gif)
TABLE 1 Tests on GO Categories Where One is More General Than the Other are Not Independent—See Text for Details
FIGURE 2 Example of a DAG-structured ontology (such as GO, but with a single edge type) and associated objects (such as genes) annotated to its terms.
![FIGURE 2 Example of a DAG-structured ontology (such as GO, but with a single edge type) and associated objects (such as genes) annotated to its terms.](/cms/asset/49deac6a-61f8-4c90-8127-f13bf098fa6f/uaai_a_445223_o_f0002g.gif)
TABLE 2 Coverage Matrix for the Example in Figure
FIGURE 5 Web page to combine gene ontology data with other datasets. Along top of the page are tabs to select a dataset type, such as microarray data or interactions. On the left-hand side, the user can select the mode of activity, such as uploading new data, browsing a dataset, or constructing an integrated set of multirelational data. On the right-hand side are shown the current components of the dataset being constructed. In this example, the GO biological process ontology is being integrated with oxidative stress data.
![FIGURE 5 Web page to combine gene ontology data with other datasets. Along top of the page are tabs to select a dataset type, such as microarray data or interactions. On the left-hand side, the user can select the mode of activity, such as uploading new data, browsing a dataset, or constructing an integrated set of multirelational data. On the right-hand side are shown the current components of the dataset being constructed. In this example, the GO biological process ontology is being integrated with oxidative stress data.](/cms/asset/109b67e7-5bcc-4823-88e9-a2afb94d089e/uaai_a_445223_o_f0005g.gif)
FIGURE 6 Accuracy of predicting protein expression given six microarray datasets (see text for details). The dotted horizontal line shows the baseline accuracy of 61% obtained by simply predicting the majority class for all genes.
![FIGURE 6 Accuracy of predicting protein expression given six microarray datasets (see text for details). The dotted horizontal line shows the baseline accuracy of 61% obtained by simply predicting the majority class for all genes.](/cms/asset/fbb1bb36-64e1-48df-87f3-5570d6af89fe/uaai_a_445223_o_f0006g.gif)
FIGURE 7 Accuracy of predicting protein expression given cellular component (C), molecular function (F), and biological process (P) subontologies of the gene ontology. Four feature selections and construction methods are compared for each subontology (see text for details). The dotted horizontal line shows the baseline accuracy of 61% obtained by simply predicting the majority class for all genes.
![FIGURE 7 Accuracy of predicting protein expression given cellular component (C), molecular function (F), and biological process (P) subontologies of the gene ontology. Four feature selections and construction methods are compared for each subontology (see text for details). The dotted horizontal line shows the baseline accuracy of 61% obtained by simply predicting the majority class for all genes.](/cms/asset/2ee61565-ef78-4b1a-abc6-8cf53676b89c/uaai_a_445223_o_f0007g.gif)
FIGURE 8 A decision tree for protein induction repression learned with gene ontology features. Ovals are attribute tests (“Peroxide t” means microarray data at time t), classifications are at leaves. See text for details.
![FIGURE 8 A decision tree for protein induction repression learned with gene ontology features. Ovals are attribute tests (“Peroxide t” means microarray data at time t), classifications are at leaves. See text for details.](/cms/asset/e07b850a-b92d-463e-a0f1-9bccdaf07049/uaai_a_445223_o_f0008g.gif)
FIGURE 9 Accuracy of predicting general vs. specific deletant sensitivity to multiple stresses given cellular component (C), molecular function (F), and biological process (P) subontologies of the gene ontology. Three feature selection and construction methods are compared for each subontology (see text for details). The dotted horizontal line shows the baseline accuracy of 60% obtained by simply predicting the majority class for all genes.
![FIGURE 9 Accuracy of predicting general vs. specific deletant sensitivity to multiple stresses given cellular component (C), molecular function (F), and biological process (P) subontologies of the gene ontology. Three feature selection and construction methods are compared for each subontology (see text for details). The dotted horizontal line shows the baseline accuracy of 60% obtained by simply predicting the majority class for all genes.](/cms/asset/06b9f1ab-6850-423b-aff7-2d5ec2eb4b79/uaai_a_445223_o_f0009g.gif)