P083 - GCB 2006 - German Conference on Bioinformatics 2006

https://dl.gi.de/handle/20.500.12116/24199

Auflistung nach:

1 - 10 von 19

Konferenzbeitrag
Ab initio prediction of molecular fragments from tandem mass spectrometry data
(German Conference on Bioinformatics, 2006) Heinonen, Markus; Rantanen, Ari; Mielikäinen, Taneli; Pitkänen, Esa; Kokkonen, Juha; Rousu, Juho
Mass spectrometry is one of the key enabling measurement technologies for systems biology, due to its ability to quantify molecules in small concentrations. Tandem mass spectrometers tackle the main shortcoming of mass spectrometry, the fact that molecules with an equal mass-to-charge ratio are not separated. In tandem mass spectrometer molecules can be fragmented and the intensities of these fragments measured as well. However, this creates a need for methods for identifying the generated fragments. In this paper, we introduce a novel combinatorial approach for predicting the structure of molecular fragments that first enumerates all possible fragment candidates and then ranks them according the cost of cleaving a fragment from a molecule. Unlike many existing methods, our method does not rely on hand-coded fragmentation rule databases. Our method is able to predict the correct fragmentation of small-to-medium sized molecules with high accuracy.
Konferenzbeitrag
Annotation-based distance measures for patient subgroup discovery in clinical microarray studies
(German Conference on Bioinformatics, 2006) Lottaz, Claudio; Toedling, Joern; Spang, Rainer
Background: Clustering algorithms are widely used in the analysis of microarray data. In clinical studies, they are often applied to find groups of co-regulated genes. Clustering, however, can also stratify patients by similarity of their gene expression profiles, thereby defining novel disease entities based on molecular characteristics. Several distance-based cluster algorithms have been suggested, but little attention has been given to the choice of the distance measure between patients. Even with the Euclidean metric, including and excluding genes from the analysis leads to different distances between the same objects, and consequently different clustering results. Methodology: We describe a novel clustering algorithm, in which gene selection is used to derive biologically meaningful clusterings of samples. Our method combines expression data and functional annotation data. According to gene annotations, candidate gene sets with specific functional characterizations are generated. Each set defines a different distance measure between patients, and consequently different clusterings. These clusterings are filtered using a novel resampling based significance measure. Significant clusterings are reported together with the underlying gene sets and their functional definition. Conclusions: Our method reports clusterings defined by biologically focused sets of genes. In annotation driven clusterings, we have recovered clinically relevant patient subgroups through biologically plausible sets of genes, as well as novel subgroupings. We conjecture that our method has the potential to reveal so far unknown, clinically relevant classes of patients in an unsupervised manner.
Konferenzbeitrag
Characterization of protein interactions
(German Conference on Bioinformatics, 2006) Küffner, Robert; Duchrow, Timo; Fundel, Kartin; Zimmer, Ralf
Available information on molecular interactions between proteins is currently incomplete with regard to detail and comprehensiveness. Although a number of repositories are already devoted to capture interaction data, only a small subset of the currently known interactions can be obtained that way. Besides further experiments, knowledge on interactions can only be complemented by applying text extraction methods to the literature. Currently, information to further characterize individual interactions can not be provided by interaction extraction approaches and is virtually nonexistent in repositories. We present an approach to not only confirm extracted interactions but also to characterize interactions with regard to four attributes such as activation vs. inhibition and protein-protein vs. protein-gene interactions. Here, training corpora with positional annotation of interacting proteins are required. As suitable corpora are rare, we propose an extensible curation protocol to conveniently characterize interactions by manual annotation of sentences so that machine learning approaches can be applied subsequently. We derived a training set by manually reading and annotating 269 sentences for 1090 candidate interactions; 439 of these are valid interactions, predicted via support vector machines at a precision of 83% and a recall of 87%. The prediction of interaction attributes from individual sentences on average yielded a precision of about 85% and a recall of 73%.
Konferenzbeitrag
Classifying permanent and transient protein interactions
(German Conference on Bioinformatics, 2006) Kottha, Samatha; Schroeder, Michael
Currently much research is devoted to the characterization and classification of transient and permanent protein-protein interactions. From the literature, we take data sets consisting of 161 permanent (65 homodimers, 96 heterodimers) and 242 transient interactions. We collect over 300 interface attributes relating to size, physiochemical properties, interaction propensities, and secondary structure elements. Our major discovery is a surprisingly simple relationship not yet reported in the literature: interactions with the same molecular weight or very big interfaces are per- manent and otherwise transient. We train a support vector machine and achieve the following results: Molecular weight difference alone achieves 80% success rate. To- gether with the size of the buried surface the success rate improves to 89%. Adding water at the interface and the number of hydrophobic contacts we achieve a success rate of 97%.
Konferenzbeitrag
Combining sequence information with T-coffee
(German Conference on Bioinformatics, 2006) Notredame, Cedric
Konferenzbeitrag
Comparative analysis of cyclic sequences: Viroids and other small circular RNAs
(German Conference on Bioinformatics, 2006) Mosig, Axel; Hofacker, Ivo L.; Stadler, Peter F.
The analysis of small circular sequences requires specialized tools. While the differences between linear and circular sequences can be neglected in the case of long molecules such as bacterial genomes since in practice all analysis is performed in sequence windows, this is not true for viroids and related sequences which are usually only a few hundred basepairs long. In this contribution we present basic algorithms and corresponding software for circular RNAs. In particular, we discuss the problem of pairwise and multiple cyclic sequence alignments with affine gap costs, and an extension of a recent approach to circular RNA folding to the computation of consensus structures.
Konferenzbeitrag
Comparison of human protein-protein interaction maps
(German Conference on Bioinformatics, 2006) Futschik, Matthias E.; Chaurasia, Gautam; Wanker, Erich; Herzel, Hanspeter
Large-scale mappings of protein-protein interactions have started to give us new views of the complex molecular mechanisms inside a cell. After initial projects to systematically map protein interactions in model organisms such as yeast, worm and fly, researchers have begun to focus on the mapping of the human interactome. To tackle this enormous challenge, different approaches have been proposed and pursued. While several large-scale human protein interaction maps have recently been published, their quality remains to be critically assessed. We present here a first comparative analysis of eight currently available large-scale maps with a total of over 10000 unique proteins and 57000 interactions included. They are based either on literature search, orthology or by yeast-two-hybrid assays. Comparison reveals only a small, but statistically significant overlap. More importantly, our analysis gives clear indications that all interaction maps suffer under selection and detection biases. These results have to be taken into account for future assembly of the human interactome.
Konferenzbeitrag
Docking protein domains using a contact map representation
(German Conference on Bioinformatics, 2006) Lise, Stefano; Jones, David
Konferenzbeitrag
Encoding evolvability: The hierarchical language of polyketide synthase protein interactions
(German Conference on Bioinformatics, 2006) Thattai, Mukund
Konferenzbeitrag
Functional evaluation of domain-domain interactions and human protein interaction networks
(German Conference on Bioinformatics, 2006) Schlicker, Andreas; Huthmacher, Carola; Ramírez, Fidel; Lengauer, Thomas; Albrecht, Mario
Large amounts of protein and domain interaction data are being produced by experimental high-throughput techniques and computational approaches. To gain insight into the value of the provided data, we used our new similarity measure based on the Gene Ontology to evaluate the molecular functions and biological processes of interacting proteins or domains. The applied measure particularly addresses the frequent annotation of proteins or domains with multiple Gene Ontology terms. Using our similarity measure, we compare predicted domain-domain and human protein-protein interactions with experimentally derived interactions. The results show that our similarity measure is of significant benefit in quality assessment and confidence ranking of domain and protein networks. We also derive useful confidence score thresholds for dividing domain interaction predictions into subsets of low and high confidence.

Auflistung P083 - GCB 2006 - German Conference on Bioinformatics 2006 nach Titel

Treffer pro Seite

Sortieroptionen