Stein Gives Bioinformatics Ten Years to Live
by Daniel H. Steinberg02/05/2003
Lincoln Stein's keynote at the O'Reilly Bioinformatics Technology Conference was provocatively titled "Bioinformatics: Gone in 2012." Despite the title, Stein is optimistic about the future for people doing bioinformatics. But he explained that "the field of bioinformatics will be gone by 2012. The field will be doing the same thing but it won't be considered a field." His address looked at what bioinformatics is and what its future is likely to be in the context of other scientific disciplines. He also looked at career prospects for people doing bioinformatics and provided advice for those looking to enter the field.
What Is Bioinformatics: Take One
Stein, of the Cold Spring Harbor Laboratory, began his keynote by examining what is meant by bioinformatics. In the past such a talk would begin with a definition displayed from an authoritative dictionary. The modern approach is to appeal to an FAQ from an authoritative web site. Take a look at the FAQ at bioinformatics.org and you'll find several definitions. Stein summarized Fedj Tekaia of the Institut Pasteur--that bioinformatics is DNA and protein analysis. Stein also summarized Richard Durbin of the Sanger Institute--that bioinformatics is managing data sets.
Stein's first pass at a definition of bioinformatics is that it is "Biologists using computers or the other way around." He followed by observing that whatever it is, it's growing. He showed the results of performing searches in published papers of the last 20 years for keywords in titles of abstracts. As a baseline, over the last two decades the use of "cell" has roughly doubled while the use of "genome" has gone up by a factor of 10 from 1,000 papers per year to 10,000 papers per year. There were essentially no occurrences of "bioinformatics" before 1992. Since then it has grown to three orders of magnitude up to its current rate of 1,000 papers per year.
|
O'Reilly Bioinformatics Technology Conference Coverage |
Is Bioinformatics Really a Field?
The field has meetings, journals, and books. The problem, according to Stein, is that it is a tool and not a scientific discipline. Tools get absorbed into the greater disciplines. There are examples of disciplines defined by a problem domain contrasted with services defined by tools.
Robust scientific disciplines are often defined by a problem domain. For example, a development biologist studies the development of multicellular organisms using what ever tools are at hand. They aren't defined by the tools they use. A pharmacologist studies the interactions of chemicals with physiological properties. Similarly, physicists aren't defined by their tools; they study the nature of matter and energy.
On the other hand, services are defined by tools and they are often time-limited. For example, a microscopist knows how to use microscopes. Now that a microscope is a ubiquitous tool you won't find many specialists in this area. While a pharmacologist has a problem domain, a pharmacist knows how to compound medicines and fill out regulatory paperwork. There are fields that cross over. Stein offered molecular biology as an example of a scientific discipline that has transitioned to a service.
What Is Bioinformatics: Take Two
One of Stein's tests for a discipline is the "Department Of" test. Take your favorite field or service and prepend it with your favorite institution's name, followed by "Department of". For example, he is quite happy with the phrase "the Harvard Department of Genetics." On the other hand, a "Department of Microscopy" seems to him to fit better at an Institute of Technology. He said that for him, a Department of Bioinformatics has the same feel and he doesn't predict the establishment of bioinformatics departments.
Stein returned to the question, what is bioinformatics? In light of his thoughts on services defined by tools and disciplines defined by problem, his answer was simple. Bioinformatics is just one way of studying biology. Whether you think of bioinformatics as High Throughput Biology, Integrative Biology, or Large Data Set Biology, fundamentally Stein argues that bioinformatics is biology.
|
Related Reading
|
Later an audience member asked Stein, "Is it strange when biologists never touch goosh: body parts, liquids.... Steins answer was, "No, they're studying life. Biologists like Ernest Mayer can sit in his office and look at other people's data and develop theories of selection. When people ask me, I say I'm a biologist."
Stein answered another question on whether biologists should be required to take introductory programming classes by saying, "Yes, the computer has become a central tool for biology like the microscope or the centrifuge. Being able to produce your own software for data analysis should be part of the undergraduate and graduate curriculum. In answer to a related question about computational biology, Stein answered that "it is algorithm development. It is a specialized discipline. I think it's a branch of CS."
How Do You Make It in Bioinformatics?
Two years ago there was a huge bubble in bioinformatics with students with BA's in biology who knew a little Perl or Java, or CS people with some biology getting offers for $50,000 to $60,000 per year for entry-level positions from pharmaceuticals and bio tech. More recently, the market has settled down. The 2002 New Scientist salary survey reports the median income for academic positions in bioinformatics is $75,000. This is comparable to the numbers for clinical biologists and slightly better than the numbers for cell biologists.
Stein has some simple advice for how you make it in bioinformatics:
- Learn biology. Investigate the problem domain for bioinformatics.
- Pick a problem that interests you. Don't just follow where you think the hot topic is or what seems to be an easy problem. Consider what you are willing to spend the next decade or two or the rest of your life working on.
- Know your tools. Don't treat your tools as black boxes. Understand how they work and what their limitations are. With a microscope you don't need to know optics, but you should know something about light paths, magnification, and resolution. Don't be afraid to use non-computer tools. Don't find the problems that fit your tools.
- Don't be ghettoized. If you expect to be a scientist and to be doing research then don't come in just to perform services to apply your tools to other people's problems. If you want to write software and provide a service, that's great but do so deliberately.
- Do it because you love it.
After taking time to look back at his own career, Stein advised the audience that "there is an event where you find your true avocation. It's easy to find yourself going down the wrong path. There's no shame in turning back." But, Stein said, when it comes to bioinformatics, "It's the biology stupid".
Daniel H. Steinberg is a podcaster, author, editor, trainer, and developer at Dim Sum Thinking. He co-authored the book Zero Configuration Networking: The Definitive Guide.
Return to the Bioinformatics Technology Conference Coverage Page..
You must be logged in to the O'Reilly Network to post a talkback.
Showing messages 1 through 8 of 8.
-
50-60k is an average starting salary
2003-04-30 08:53:35 anonymous2 [Reply | View]
for CS people anyway.
The real question is: 10 years from now who which title will be in more demand in academia and industry - computer science phd or bioinformatics phd?
-
Bzzzt! Wrong...
2003-03-13 22:09:28 anonymous2 [Reply | View]
A few critiques...
1. It's "Ernst Mayr" _not_ "Ernest Mayer." If you're going to talk about one of the major figures of the Darwinian synthesis at least get the frickin' names right....
2. Mayr spent many years studying birds in the Pacific (mostly Papua New Guinea if I remember correctly). The assertion that he sat "in his office and look[sic] at other people's data and develop theories of selection" is totally bogus. This is symptomatic of not knowing the literature and the history of the field, and people who make such statements come off looking foolish. Given that he's now almost 100 years old, I think we can forgive Mayr for "sitting in his office" as of late....
3. Otherwise I generally agree with Stein's comments about bioinformatics; the tools and approaches will simply be absorbed into the field of biology. Like any technologically driven advances the novelty will wear off. Twenty years on nobody feels the need to explain how PCR works. There's been lots of stuff published in bioinformatics/genomics recently that is completely and absolutely derivative, and should really end up in 2nd or 3rd tier journals. Simply because it has the shine of "genomics" about it, it gets published in fairly high profile pubs. Those days are numbered. Get in on it while you can... ;)
-
Bzzzt! Wrong...
2003-03-14 04:16:15 Daniel H. Steinberg [Reply | View]
Dear Anonymous poster,
I apologize for the first point. That is my mistake and not the speaker's. I am not a biologist and, although I ran the article by several scientists before publishing, the mistake is mine. The hostility in your anonymous is understandable.
The second point is the speaker's and I'm not sure that you and he disagree. Whether or not Mayr spent many years in the field, the speakers point was that much of his contribution was due to his analysis of data collected by others. He was making a positive point about Mayr and pointing out that biology is not limited to the laboratory or field collection of data. Mayr doesn't have to be out in the field during his 90's to continue to contribute to his field.
Daniel
-
Bzzz! Wrong....
2003-03-13 21:46:57 anonymous2 [Reply | View]
OK, in part a critique of the article and in part a critique of some of the quotes attributed to Stein...
1. It's "Ernst Mayr" not "Ernest Mayer." If you're going to talk about one of the "big names" in the Darwinian synthesis at least get the frickin' names right.
2. Mayr spent many years working in the Pacific (Papua New Guinea being one of his major field sites) studying birds. In no way is it justifiable to say that he sat "in his office and look at other people's data and develop theories of selection." This is utter bullshit, and reveals the sort of fallacies that get propagated when people don't know the field. Please read the literature before you spout off....
3. Otherwise I agree with Stein's point about bioinformatics simply being absorbed into "biology". This is true of many of the "hot" tools of today, like microarrays. 20 years on, nobody feels the need to explain how PCR works... -
so nice you posted twice?
2003-03-14 04:20:22 Daniel H. Steinberg [Reply | View]
Dear Anonymous poster
I assume that this is a second posting by the same person. It is again unsigned and more offensive than the first. Your last two sentences in the second point are inappropriate for polite discourse.
Daniel
-
longevity of bioinformatics
2004-09-24 05:03:17 wjbug [Reply | View]
Hello All,
First, let me pass on thanks to O'Reilly for sponsoring this conference and specifically to Mr. Steinberg for passing on his view of Dr. Lincoln Steins presentation.
I think there's a fairly easy way to decide what tone is appropriate for posting comments to a venue your scientific colleagues are likely to peruse.
I) Express yourself in a manner commensurate with an editorial comment in a scientific journal - e.g., would you want to be quoted in a primary scientific publication such as 'Nature' and 'Cell' or a scientific news journal such as 'Science News'.
II) Would you use the same tone were you to have attended the conference and made a similar comment directly to the speaker.
We must assume the statement you are commenting on was put out there by Dr. Stein himself, rather than being an editorial comment made by the author of this article, as it appears in quotes:
"No, they're studying life. Biologists like Ernest Mayer can sit in his office and look at other people's data and develop theories of selection. When people ask me, I say I'm a biologist."
Had the author gone to the web in order to find the correct spelling of Dr. Mayr's name, I expect you'd have been much less likely to adopt such a tone. There are many references to this revered investigator available on-line, including the library of the eminent Museum of Comparative Zoology at Harvard which is named in honor of Ernst Mayr (http://library.mcz.harvard.edu/).
Having made such a preface, I hope I'm now able to express my own criticism cogently.
I agree with the general content of this anonymous posters comment. I assume my criticism ought to be addressed to Dr. Stein, as the comment is given as a quote.
First, it is a bit incendiary to express this observation in such a manner. People familiar with Dr. Mayr's work are likely to have strong feelings about whether they agree or not with his deeply considered theories. Whether they emphatically agree or equally emphatically disagree with Dr. Mayr's theories/observations, the notion a seminal contributer to the field would "...sit in his office and look at other people's data..." is likely not to strike anyone in his field as truly reflecting the nature of Dr. Mayr's contribution even over his post-septuagenarian years.
There is a significant difference between performing novel experimental field and/or benchtop research and working to cast the body of this research in a coherent theoretical framework. Both means of contributing to a scientific field are absolutely essential to the progress of scientific knowledge. I believe this is essentially the point Dr. Stein was seeking to express in his extemporaneous reply to a question after his talk.
My bigger concern, however, is with what I assume based on this report is Dr. Stein's general presentment bioinformatics will be absorbed into the various biological fields to which it is relevant. This appears to be based on the argument "bioinformatics" is a tool analogous to a microscopy or PCR. My difficulty with this argument is these latter "tools" are based on specific physical principles which - broadly defined are:
- microscopy: diffraction, transmission, reflection and/or emission of EMF radiation of various frequencies of to produce description of a material based on its physico-chemical properties
- PCR: temperature-cycling-induced annealing of polynucleotides leading to the amplification of specific individual nucleotide sequences either for the purpose of detecting or producing said sequences.
Though both of these techniques have extremely varied means of application, those general physico-chemical principles are in effect in all these situations.
Bioinformatics can be broadly defined as:
- bioinformatics: computer-aided organization, manipulation and/or analysis of biological scientific data
Not only does this definition extend well-beyond the specific application of computational tools to bio-molecular sequences & structures - the application subject to a decade-long, massive proliferation starting in the late 1980s - it is also so general a category as to be nearly unlimited in its applicable problem domain. Whereas technical innovations - and thus novel experimental observations - using microscopy or PCR need to provide a new means to address the aforementioned physico-chemical principles, the properties being exploited in bioinformatics data manipulation appear to be constantly expanding. Witness the proliferation in the past several years of principles culled from the confluence of computational linguistics, artificial intelligence and database research leading to the wide-spread application of knowledge-frameworks, ontologies and controlled-vocabularies as a means to identify and perform correlative analysis on the semantic qualities of biological data.
I believe this leads to two results:
- the spread of bioinformatic applications to an ever-increasing collection of computational disciplines;
- the extended presence of the various "-ics" - genomics, transcriptomics, proteomics, pharmacogenomics, physiomics, behaviomics (http://ihome.cuhk.edu.hk/~b400559/2003j_functa.html), ecolonomics, evolutiomics (sic) - over the coming century. I believe the existence of these technical disciplines will not only continue through the century, they are likely to uniquely inform & rejuvenate various other biological disciplines such as systems biology, multigenic analysis and complex ecological modeling.
Just my $0.02.
Thanks again for covering the conference.
Cheers,
Bill Bug
Laboratory of Bioimaging & Anatomical Informatics
Dept. of Neurobiology & Anatomy
Drexel University College of Medicine
www.neuroterrain.org
William.Bug@drexel.edu
- microscopy: diffraction, transmission, reflection and/or emission of EMF radiation of various frequencies of to produce description of a material based on its physico-chemical properties







I have managed to survive by staying ahead of the curve. Unfortunatly, the curve always dies in frustration and hate. Bioinformatics will be reduced to just another service, probably done offshore by transnationals who will charge a s mall 'on demand' fee for the requests made by service consumers. Except for the top people, the rest of the Bioinformatics community might as well get ready to train their replacement sooner rather than later.
Yes, I do plan to learn Bioinformatics at the graduate level starting this winter, but I have no illusions about a sustainable future for that field in this country (USA.) We will simply have the bottom cut out from under us. Do it for love and plan to kiss it goodbye really soon.
Regards,
Steve Lenk