Skip to main content

Freely-Speaking: Biology and Big Data

Computational sciences at the interface of a hard science are gaining in importance as shown by the recent Nobel Price in chemistry awarded this year. I also recently read two more commentaries that appeared in Nature and Nature Biotechnology that I found interesting.



 The first article [1] talked about the need to develop systems that can handle "Big Data" in Biology. Despite the need, a widely-adopted system has not emerged yet. The author credits this to two problems:
  • Biological data exists in a variety of changing formats.
  • Commercial systems may impose steps that are unintuitive to the way data is recorded, the workflow of the scientist or add additional steps for converting data from one format to another - arguably to prevent scientists from using rival systems.
  • A lot of home-grown solutions were not written in a robust way and are badly annotated which makes modifications difficult.
In response to the increasing discrepancy between data generation, and the capturing, storing, annotation, and retrieval of biological information across different systems, the second article [2] was a commentary on interviews that Nature Biotechnology did with  writers of successful computational biology software.

In their summary, the article described that there appears to be a gap in communication between dry and wet-bench biology: Wet-bench biologists often under-appreciate aspects of computation and the craft of software engineering. On the other hand, computational scientists often are oblivious to the need to make their tools more accessible and comprehensible to the wider biology audience.

Both articles seem to suggest that a change in mentality on both sides and closer collaboration between the different sides will be both necessary and helpful in coming up with successful software solutions that address big data problems in biology. These articles describe really well the experiences I have obtained over the last two years myself. And the solutions suggested seem mostly in light with what I think ought to be the solutions. I feel happy to be able to work towards solutions in this space together with my team.

Literature Cited:

[1] Boyle J. "Biology must develop its own big-data systems". Nature 499, 7 (04 July 2013). Last visited: 2013-10-13. Link.
[2] "In need of an updgrade". Nature Biotechnology 31, 837 (2013). Last visited: 2013-10-13. Link.

Comments

Popular posts from this blog

In Other Words: A Life on Our Planet

I just watched this documentary together with my son and my wife. Different from David's typical approach of sparse objective commentary, this documentary movie is a personal witness statement that David Attenborough is making describing how our planet has changed in his life time. It's compelling, and urgent but still hopeful.   Please, watch this documentary and share with your friends so they get the message!

Permaculture: nature is still smarter than us

Permaculture In the year 2010, there are many aspects of humans' daily life that would lead us to believe that we have dominated nature. Unlike the thousands of other species that have gone extinct, we have settled and thrived in almost every environment and every continent on this planet, aside from Antarctica. We have eradicated diseases like smallbox and subdued other diseases which previously decimated our populations on a massive scale (see The Black Death in the 1300s and Columbus' “discovery of the Americas in 1492). We have created chemicals that allow us to blast weeds and insects into submission and thereby cultivate thousands of acres of the same species on farmland; an environment that would be impossible in nature. But nature is still smarter than us. A lot smarter. And we still have much to learn from its processes. Permaculture is the idea of mimicking the ways that ecosystems work in the context of essential human activities: house and settlement design, farming...

Freely-Speaking: Quick note on bio-based antennaes

With my thesis defense coming up this Monday, I really did not have as much time to share all the interesting things I came across lately. But I did not want to miss the chance to make a quick note to myself and the readers of this site of an interesting paper, titled "DNA-based programming of quantum dot valency, self-assembly and luminescence" just published in Nature Nanotechnology . Grigory Tikhomirov et al. report "the self-assembly of quantum dot complexes using cadmium telluride nanocrystals capped with specific sequences of DNA. Quantum dots with between one and five DNA-based binding sites are synthesized and then used as building blocks to create a variety of rationally designed assemblies, including cross-shaped complexes containing three different types of dots...Through changes in pH, the conformation of the complexes can also be reversibly switched, turning on and off the transfer of energy between the constituent quantum dots." In other w...