1
|
- Stephen R. Heller
- Consultant
- Silver Spring, MD 20902
- steve@hellers.com
|
2
|
|
3
|
|
4
|
|
5
|
|
6
|
- From the 1970’s to 2006 there
has been an evolution of scientific information from paper to electronic
form, coupled with a revolution in computer and network communication
capabilities (i.e., the Internet) which is transforming the way
information is collected, processed,
disseminated, and used.
|
7
|
- Web 1.0 - We have evolved from everything on paper, which needed to be
centrally organized and distributed from a central source to …
- Web 2.0 - Currently uncontrolled chaos and a revolution with data and
information being dumped into systems around the world. à Web 2.0 is an attitude, not a technology
|
8
|
|
9
|
- Printed Abstracts from CAS, UK
- Few databases/compilations
- All on paper – Simple text; few diagrams
- A handful of computers worldwide
- Chemical Information was supported by a thriving chemical industry
|
10
|
- The chemist would read the CAS
or ISI Current Contents sections appropriate to their research needs.
Then he/she would either send a postcard for a reprint or go to the
library to read the full journal article of interest. In the
library this meant a request for an interlibrary loan to obtain the
article.
|
11
|
- Everything is electronic
- Databases are common in chemistry and biology
- Everyone has a PC and Internet access
- Data and databases are commonplace and VERY large
- Databases have gone from primarily text to value-added indexing, coding,
structures, and linking (e.g. PubChem)
- The chemical industry has been overtaken by
biology/biochemistry/biomedicine causing problems for publishers
- Bioinformatics data is the antithesis of the chemical data franchise
- Current Awareness has evolved into Continuous Awareness
|
12
|
- The chemist logs onto
CAS/SciFinder®, ISI Web of Science®, Integrity®, ScienceDirect®,
Scirus.com®, Chemindustry.com®, PubChem, or Chemweb.com® to search for
something of interest. Then he/she clicks in the hyperlink, using
LitLink or ChemPort and, assuming you have a paid for access to the
journal article, the article appears immediately on your computer screen
for you to read or print out and take to the bathroom to read. Now
document delivery is easy and fast. More importantly, one learns
from the experiences of others - being able to do computer searches of
the literature helps a lot and allows one to read more articles of
interest.
|
13
|
|
14
|
- Steve Heller’s Google News
Alert Service
- (A
non-scientific study )
- Started May 2004 – 6 news
articles
- September 2008 – 97 news articles
|
15
|
|
16
|
- Yahoo!- free
- Google - free
- MySpace – free
- Amazon
- Craiglist – free classified ads
- Wikipedia - free
- # 22 – NY Times - free
- # 44 – BBC - free
- # 5 – FaceBook – free
university/college social network
- # 371 – NLM-PubMed & PubChem - free
- # 6,310 - ACS
- # 48,927 – CAS
- # 222,273 – ISI/Web of Knowledge
- # 240,141 - ChemSpider
-
Alexa.com – October 2008
|
17
|
|
18
|
|
19
|
|
20
|
The availability of
for-free services such as those offered in the
patent field by the EPO and now Google (plus others) is a real threat
to financial viability of many traditional, high-cost information
providers. A small core of faithful users -- who feel they need
advanced features -- may stay with Thomson, CAS, Questel, Dialog,
etc.
But this small core may well be too small to support high-cost
services.
Harry Collier, private communication, January 2007
…And this small core of users are aging and retiring with the new generation which has been brought
up on Google, FaceBook, MySpace, and similar technology and services.
|
21
|
|
22
|
- From 2/2007 electronic RSC
journals will have metadata added to each article – CML, InChI/InChIKey,
and OBO – Open Biomedical Ontologies. This way one can search using
chemical structure and these index terms.
- Sooner, rather than later,
secondary publishers (e.g., CAS) will find their role is no longer
needed.
|
23
|
- A project whose time has come.
Without the Internet InChI would be just another in a series of
technically excellent, soon forgotten, projects for representing
chemical structures. The Internet, an international scientific body
(IUPAC), and international cooperation (US, UK, Czech Republic) has
led to the speedy development,
implementation, and use of InChI/InChIKey.
|
24
|
- While InChI is a public domain, open source system for creating a unique
computer-readable identifier (“name”)
it is NOT a registry system.
InChI’s and InChIKey’s are created only by those who choose to
adopt and use the algorithm.
Registry systems which index the literature are complimentary to
any InChI/InChIKey databases that anyone creates.
- What has made InChI/InChIKey
so successful and being widely
adopted by the community is that its’
success and adoption has been uncoerced. The US Government, large
database producers (NIH, ChemSpider, Beilstein, , etc.), Publishers
(Wiley, Nature, RSC, Prous,
etc.), Big Pharma (Pfizer, Novartis, etc.) and chemistry software
companies (Symyx, Microsoft,
CambridgeSoft, ChemAxon, ACD Labs, etc.).
-
Why is this happening?
|
25
|
|
26
|
- InChI/InChIKey is being adopted because organizations need
interoperability. Organizations need to be able to connect all their
internal data and information associated wit h their chemicals and link
both internally and externally. Times are now difficult for Pharma and
others who need to be able to correlate and link both internal and
external information.
- There is a insane amount of information available today , both public
and commercial/private. People
and organizations don't change when the see the light; they
change when they feel the heat.
InChI is open, free, and universal as opposed to unique
identifiers which are proprietary.
Keeping a unique identifier proprietary and (in some cases)
available at a high cost is stupidity on stilts. InChI/InChIKey has become the world
standard for being the unique chemical representation for a defined chemical structure because it
meets the needs of today’s chemical community.
|
27
|
|
28
|
|
29
|
|
30
|
|
31
|
|
32
|
Acknowledgements
Steve Bachrach, Mila Becker, Pieter Bolman, Evan Bolton, Steve Bryant,
Harry Collier, Alice Cooper, Rene Deplanque, Guenter Grethe, Stevan
Hanard, Sami Kassab, David Lipman, Gary Mallard, Randy Marcinko, Alan
McNaught, Bill Milne, Carmen Nitsche, David Prous, Josep Prous, Chris Reed,
Rich Roberts, Peter Murray-Rust, Henry Rzepa, Steve Stein, Peter Shepherd,
Bill Town, Wendy Warr, Ann Wolpert
|