Skip to main content

Biocuration 2017 Conference Highlights

Several members of the Monarch Initiative convened upon sunny Palo Alto, CA for the International Society for Biocuration (ISB) annual meeting. The meeting, held at Stanford University from March 26-29, 2017, brought together experts in the field of biocuration and included presentations, posters, and workshops on the topics of curating biological data, database creation and maintenance, community annotation, and education. Between brief, but glorious, moments of sun soaking (reminder: several of us are from rainy Portland), we enjoyed meeting our fellow biocurators and presenting talks and posters. Here’s a brief overview of the presentations from four Monarch team members, including links to the slides and posters in case you missed them!


MelissaH_circle.png
Melissa Haendel, Monarch co-PI, presented a talk titled “How Open is Open?” discussing the open science principles of FAIR TLC. Here, FAIR TLC stands for making data: Findable, Accessible, Interoperable, Reusable, Traceable, Licensed, and Connected.  Melissa discussed how we can use the FAIR TLC principles to evaluate open biological databases and repositories and go beyond traditional evaluation metrics, such as publication numbers. You can view her talk slides here: bit.ly/biocurMH.


Lilly_circle.png
New Monarch team member, Lilly Winfree, presented in the precision medicine session on how Monarch uses various ontologies to semantically integrate genotype and phenotype data from multiple species with a goal of disease diagnosis for patients. Lilly explained how Monarch semantically integrates this data using the ontologies GENO (for genotypes) and SEPIO (Scientific Evidence and Provenance Information Ontology), which have been spearheaded by Monarch ontologist Matt Brush. Lilly also showed a use case of the Exomiser tool, which is currently being utilized by Genomics England to identify pathogenic variants. You can view Lilly’s slides here.

ChrisM_circle.png

Chris Mungall, Monarch co-PI, was awarded the ISB Exceptional Contributions to Biocuration Award. Congrats, Chris! In his acceptance talk, Chris described his career path, including his brief stint as a chicken farmer! Or maybe not quite a chicken farmer...you’ll have to ask Chris for the details. We also learned that Chris and fellow award-winner Marc Feuermann share a love of science fiction, and have both authored sci-fi stories.

Nicole_circle.png

Nicole Vasilevsky, Monarch project manager, presented two posters, which both won a best poster award at the conference! Nicole also won an award for a community annotating contest, hosted by GigaScience, using the iCLiKVAL and Hypothesis.is tools. Great work, Nicole!

Nicole’s poster titled “Training future biocurators through data science trainings and open educational resources” was co-authored by several OHSU faculty: Ted Laderas (DMICE), Jackie Wirz, Bjorn Pederson (DMICE), David Dorr (DMICE), Bill Hersh (DMICE), Shannon McWeeney (DMICE) and Melissa Haendel. The poster, available here, described development of in-person data science trainings offered as short courses to OHSU students, and the development of Open Educational Resources (OERs) that are available online (dmice.ohsu.edu/bd2k). Conference attendees were particularly interested in the BD2K tutorials on topics related to biocuration (such as BDK05 on Data Standards and BDK12 on Data annotation and Curation), as there is a lack of formal training in biocuration. Several biocuration efforts discussed at the conference involved crowd-sourced efforts, so these tutorials will be useful for training contributors to these community databases. I encourage you to check out these interesting tutorials!

The second award-winning poster titled “A need for better data sharing policies: a review of data sharing policies in biomedical journals” described a project led by Robin Champieux and co-authored by Nicole, Jessica Minnier (from the OHSU-PSU School of Public Health) and Melissa Haendel, and is available here. This poster described an analysis of biomedical journal data sharing policies. It is widely agreed that data sharing is important for ensuring transparency of research results and scientific reproducibility (and data sharing will certainly facilitate biocuration efforts to extract information from the literature into databases). This analysis showed that approximately 40% of journals (in our sample) either required or strongly encouraged data sharing upon publication. The data from this analysis is shared here (which includes a list of journals that require or encourage data sharing) and we hope that researchers will publish in those journals that require data sharing. A preprint is available here, and the manuscript has been accepted for publication in PeerJ and will be available soon.

Photos from the conference can be viewed
here.


Biocuration F1000 channel presentations:
https://f1000research.com/channels/biocuration

Written by Nicole Vasilevsky and Lilly Winfree
TweetMapBiocuration.png
TweetMapBiocuration2.png

Popular posts from this blog

Finally, a medical terminology that patients, doctors, and machines can all understand.

By Nicole Vasilevsky, Mark Engelstad, Erin Foster, Julie McMurry, Chris Mungall, Peter Robinson, Sebastian Köhler, Melissa Haendel
For many patients with rare and undiagnosed diseases, getting an accurate diagnosis, or even finding the appropriate experts is a long and winding road. To accelerate and facilitate this process, we developed a medical vocabulary (“HPO”) which is comprised of 12,000 terms that doctors can use to codify the precise and distinct observations about patients and their conditions. The HPO is structured in a way that enables machines to intelligently compare a patient’s profile with what scientists worldwide have already uncovered about diseases and their genetic causes.
Until now, most of the HPO labels and synonyms were composed of clinical terms unfamiliar to patients. For example, a patient may know they are ‘color-blind’, but may not be familiar with the clinical term ‘Dyschromatopsia’. This is why we developed a layer of 5,000 corresponding terms that can b…

Why cross-species phenomics informatics is critical to the PMI

Genomics, electronic health records, participant-provided data, sensors, and mobile health technologies can all contribute to personalized medicine. However, we currently cannot achieve statistical correlations amongst these almost unlimited number of parameters that will be collected by the PMI and the depth of mechanistic understanding that will be required for treatment stratification and the development of novel, targeted therapies. The promise of personalized medicine requires deep knowledge of the relationships between genotype, phenotype, and environmental variables - but we simply don’t have enough data. For example, in the ExAC database there are 3,230 genes with near-complete depletion of predicted protein-truncating variants, where 72% of these genes having no currently established human disease phenotype. If we look across organisms, we see that of these 2311 genes with unknown causal phenotypes/diseases, 88% have an associated phenotype in an ortholog, with 56% having or…

Save the Date: Symposium on Linking Disease Model Phenotypes to Human Conditions

Monarch is co-hosting a NIH Symposium titled “Linking Disease Model Phenotypes to Human Conditions” on September 10-11, 2015 at the Fishers Lane Auditorium, NIH, Rockville, MD. 
The purpose of the meeting is to convene a colloquium on the current status of Phenomics and its role in closing the gap that exists between biomedical research and clinical medical practice. The wealth of whole organism, cellular, and molecular data generated in the research laboratory must be translated into clinically relevant knowledge that enables the physician to make the best possible treatment decisions. Phenomics is gaining momentum due to the availability of the complete genomes for many organisms as well as higher throughput methods to genetically modify model organism genomes and observe and record phenotypes. Disease models comprise some of the most important tools of biomedical research. The efficacy of the use of disease models is based upon the principles of evolutionary conservation between sp…