|
1
|
- Stephen R. Heller
- Silver Spring, MD 20902
- steve@hellers.com
|
|
2
|
|
|
3
|
|
|
4
|
|
|
5
|
|
|
6
|
|
|
7
|
|
|
8
|
- From the 1960’s to 2007 there
has been an evolution of scientific information from paper to electronic
form, coupled with a revolution in computer and network communication
capabilities (i.e., the Internet) which is transforming and changing the
way information is collected, processed,
disseminated, and used.
|
|
9
|
- Web 1.0 - We have evolved from everything on paper and, via technology
to electronic form, which needed to be centrally organized and
distributed from a central source to
…
- Web 2.0 - Currently uncontrolled chaos and a revolution with data and
information being dumped into systems around the world. Web 2.0 is more an attitude more than
a technology. Web 2.0 is “leading from below”.
|
|
10
|
|
|
11
|
- Printed Abstracts from CAS, UK, Germany
- Few databases/compilations
- All on paper – Simple text; few diagrams
- A handful of computers worldwide
- Chemical Information was supported by a thriving, high profit
margin chemical and pharmaceutical industry
|
|
12
|
- The chemist would read the CAS
sections appropriate to their research needs. Then he/she would go
to the library to read the full journal article of interest. Often
this meant a request for an interlibrary loan to obtain the article.
|
|
13
|
- Everything is electronic
- Databases are common in chemistry and biology
- Everyone has a PC and Internet access
- Data and databases are commonplace and large
- Databases have gone from primarily text to value-added indexing, coding,
structures, and linking (e.g. PubChem)
- The chemical industry has been overtaken by
biology/biochemistry/biomedicine causing problems for the ACS/CAS
- Bioinformatics data is the antithesis of the chemical data franchise –
virtually all free vs. fee-based
- Current Awareness has evolved into Continuous Awareness
|
|
14
|
- The chemist logs onto
CAS/SciFinder®, ISI Web of Science®, Integrity®, ScienceDirect®,
Scirus.com®, Chemindustry.com®, PubChem, or Chemweb.com® to search for
something of interest. Then he/she clicks in the hyperlink, using
LitLink or ChemPort and, assuming you have a paid for access to the
journal article, the article appears immediately on your computer screen
for you to read or print out and take to the bathroom to read. Now
document delivery is easy and fast. More importantly, one learns
from the experiences of others - being able to do computer searches of
the literature helps a lot and allows one to read more articles of
interest.
|
|
15
|
|
|
16
|
|
|
17
|
|
|
18
|
|
|
19
|
|
|
20
|
|
|
21
|
- Yahoo!- free (#1 of top 10)
- Google - free
- MySpace – free social network
- MSN - free
- EBay
- Amazon
- Craiglist – free classified ads
- CNN news - free
- Wikipedia - free
- # 19 – NY Times - free
- # 27 – BBC - free
- # 66 – FaceBook – free university/college social network
- # 290 – NLM/NIH - free
- # 7,756 - ACS
- # 41,695 – CAS
- # 180,328 – ISI/Web of Science
|
|
22
|
- MySpace – 100 million users/profiles;
- 2,210,000
users/day
- Ebay – 100 million users --
5,044,00 users/day
- FaceBook – 8 million users/profiles of university students
- Yahoo! - 16,031,000 users/day
- Google – 15,130,000 users/day
- Wikipedia – 4,260,000 users/day
- NLM/NIH – PubMed/PubChem – 500,000 users/day
- CAS – 1000 organizations - ? users/day
- ComScore.com – June 2006 analysis
|
|
23
|
|
|
24
|
|
|
25
|
- Steve Heller’s Google
News
- Open Access” Alert
Service
- (A
non-scientific study )
- Started May 2004 – 6 news
articles/month
- As of February 2007 – 48 news articles/month
|
|
26
|
|
|
27
|
“The availability of
for-free services such as those offered in the
patent field by the EPO and now Google (plus others) is a real threat
to financial viability of many traditional, high-cost information
providers. A small core of faithful users -- who feel they need
advanced features -- may stay with Thomson, CAS, Questel, Dialog,
etc.
But this small core may well be too small to support high-cost
services. “
Harry Collier, private communication, January 2007
…And this small core of users are aging and retiring with the new generation which has been brought
up on Google, FaceBook, MySpace, and similar technology and services.
|
|
28
|
|
|
29
|
- London
Stock Exchange Information
- 3/19/1997 567.45 pence
- 3/19/2007 589.50 pence
- 100 UK pound investment in 1997 is now worth 104 pounds.
- http://www.reedelsevier.com/index.cfm?articleid=125
|
|
30
|
- Toronto Stock
Exchange Information
- 3/19/1997 28.15 dollars
- 3/19/2007 49.01 dollars
- http://www.investcom.com/cgi-bin/redir.cgi?url=http://finance.yahoo.com/q/hp?s=TOC.TO&frame=frame/yahoo.html
|
|
31
|
|
|
32
|
|
|
33
|
- ChemSpider is a new Open Access project of chemical structures linked to
data and property prediction programs (at present limited to ACD/Labs
software), all available free on the Internet.
- Going live this week at the ACS meeting is contains some 10 ½ million
structures.
|
|
34
|
|
|
35
|
- “companies that create no content of their own, and make money solely on
the back of other people’s content are raking in billions”
- Tom Rubin, Associate General Counsel, Microsoft , March 2007
- (taken out of context, but it is a great quote)
|
|
36
|
|
|
37
|
- Beilstein Journal of Organic Chemistry – less than 50 articles since
August 2005
- ChemistryCentral.com – just starting
- Arkivoc - The Organic Chemistry Journal
|
|
38
|
- Arkivoc BJOC
- (Arkat-usa.org)
- 2000 - 90
- 2001 - 174
- 2002 - 209
- 2003 - 327
- 2004 – 230
- 2005 - 305 18
- 2006 - 248 26
- 2007 - 98 3 (up to 2/07)
- Totals – 1681 47
|
|
39
|
|
|
40
|
- From 2/2007 electronic RSC
journals will have metadata added to each article – CML, InChI, and OBO
– Open Biomedical Ontologies. This way one can search using chemical
structure and these index terms.
- Sooner, rather than later,
secondary publishers (e.g., CAS) will find their role is no longer
needed. See David Flaxbart:
- http://www.istl.org/07-winter/viewpoints.html
|
|
41
|
- A project whose time
has come. Without the Internet
InChI would be just another in a series of technically excellent, soon
forgotten, projects for representing chemical structures. The Internet,
an international scientific body (IUPAC), and international cooperation
(US, UK, Czech Republic) has led
to the speedy development, implementation, and use of InChI.
- While InChI is a public
domain, open source system for creating a unique computer-readable
identifier (“name”), and (soon) an InChI number, it is NOT a registry
system. InChI’s are created only
by those who choose to adopt and use the algorithms. Registry systems which index the
literature are complimentary to any InChI databases and InChI numbers
that anyone creates.
|
|
42
|
- Digital ‘Naming’ of Chemicals:
- Chemical structure is the true ‘identifier’
- But, structure representations are not unique or convenient for
computers.
- So, convert structure to a unique ‘name’ by fixed algorithms
- The IUPAC International Chemical Identifier (InChI)
|
|
43
|
- 1. Chemicals
- – Fast isomerization
(tautomerization)
- – Ill-defined connectivity
- 2. Chemists
- – Differing conventions
- Depends on discipline, education and convenience
- Imprecision/uncertainty
|
|
44
|
- Formula
- Connectivity
- Stereochemistry/Chirality
- Isotope
- Charge
- Fixed/Mobile Hydrogens
- And so on
|
|
45
|
|
|
46
|
|
|
47
|
|
|
48
|
|
|
49
|
- Journals:
- RSC
- Prous Science – Drugs of the
Future
- Organizations:
- EPO
|
|
50
|
|
|
51
|
|
|
52
|
|
|
53
|
|
|
54
|
Acknowledgements
Steve Bachrach, Mila Becker, Jost Bohlen, Evan Bolton, Pieter Bolman, Evan
Bolton, Bob Bovenschulte, Steve Bryant, Harry Collier, Alice Cooper,
Rene Deplanque, Ron Dunn, Guenter Grethe, Stevan Hanard, Sami Kassab, David
Lipman, Gary Mallard, Randy Marcinko, Alan McNaught, Bill Milne, Carmen
Nitsche, Josep Prous, Chris Reed, Rich Roberts, Peter Murray-Rust, Henry
Rzepa, Steve Stein, Peter Shepherd, Bill Town, Andrea Twiss-Brooks, Wendy
Warr, Ann Wolpert
|