|
1
|
- Stephen R. Heller
- Consultant/Guest Researcher
- NIST/PCPD
- Gaithersburg, MD 20899
- steve@hellers.com
|
|
2
|
|
|
3
|
|
|
4
|
|
|
5
|
|
|
6
|
|
|
7
|
- From the 1950’s to 2006 there
has been an evolution of scientific information from paper to electronic
form, coupled with a revolution in computer and network communication
capabilities (i.e., the Internet) which is transforming the way
information is collected, processed,
disseminated, and used.
|
|
8
|
- Web 1.0 - We have evolved from everything on paper, which needed to be
centrally organized and distributed from a central source to …
- Web 2.0 - Currently uncontrolled chaos and a revolution with data and
information being dumped into systems around the world.
|
|
9
|
- Technical
- Economic
- Political/Cultural
|
|
10
|
|
|
11
|
- Printed Abstracts from CAS, UK
- Few databases/compilations
- All on paper
- A handful of computers worldwide
- Chemical Information was supported by a thriving chemical industry
|
|
12
|
|
|
13
|
- 1. Hardware:
- Smaller
Faster
Cheaper
- Networked
2. Software:
Bigger
Slower
More expensive (fee based)
- Open Source (free)
- 3. Data/Information
- More expensive (fee based)
- Free (Open Access)
|
|
14
|
|
|
15
|
|
|
16
|
- Everything is electronic
- Databases are common in chemistry and biology
- Everyone has a PC and WWW access
- Data and databases are commonplace and large
- Databases have gone from primarily text to value-added indexing, coding,
structures, and linking (e.g. PubChem)
- The chemical industry has been overtaken by
biology/biochemistry/biomedicine causing problems for the ACS/CAS
- Bioinformatics data is the antithesis of the chemical data franchise
- Current Awareness has evolved into Continuous Awareness
|
|
17
|
- The chemist logs onto
CAS/SciFinder®, ISI Web of Science®, Integrity®, ScienceDirect®,
Scirus.com®, Chemindustry.com®, PubChem, or Chemweb.com® to search for
something of interest. Then he/she clicks in the hyperlink, using
LitLink or ChemPort and, assuming you have a paid for access to the
journal article, the article appears immediately on your computer screen
for you to read or print out and take to the bathroom to read. Now
document delivery is easy and fast. More importantly, one learns
from the experiences of others - being able to do computer searches of
the literature helps a lot and allows one to read more articles of
interest.
|
|
18
|
|
|
19
|
|
|
20
|
|
|
21
|
|
|
22
|
|
|
23
|
|
|
24
|
|
|
25
|
- Yahoo!- free
- Google - free
- MySpace – free social network
- MSN - free
- EBay
- Amazon
- Craiglist – free classified ads
- CNN news - free
- Wikipedia - free
- # 19 – NY Times - free
- # 27 – BBC - free
- # 66 – FaceBook – free university/college social network
- # 290 – NLM/NIH - free
- # 7,756 - ACS
- # 41,695 – CAS
- # 180,328 – ISI/Web of Science
|
|
26
|
- MySpace – 100 million users/profiles;
- 2,210,000
users/day
- Ebay – 100 million users --
5,044,00 users/day
- FaceBook – 8 million users/profiles of university students
- NLM/NIH – PubMed/PubChem – 500,000 users/day
- CAS – 1000 organizations - ? users/day
- Yahoo! - 16,031,000 users/day
- Google – 15,130,000 users/day
- Wikipedia – 4,260,000 users/day
- ComScore.com – June 2006 analysis
|
|
27
|
|
|
28
|
|
|
29
|
- Peter Suber list (started in 2001):
-
http://www.arl.org/sparc/soa/index.html
- Steven Harnad List (started in 1998):
- http://amsci-forum.amsci.org/archives/American-Scientist-Open-Access-Forum.html
|
|
30
|
- Beilstein Journal of Organic Chemistry
- Chemistrycentral.com
|
|
31
|
|
|
32
|
|
|
33
|
|
|
34
|
- A project whose time
has come. Without the Internet
InChI would be just another in a series of technically excellent, soon
forgotten, projects for representing chemical structures. The Internet,
an international scientific body (IUPAC), and international cooperation
(US, UK, Czech Republic) has led
to the speedy development, implementation, and use of InChI.
- While InChI is a public
domain, open source system for creating a unique computer-readable
identifier (“name”) it is NOT a
registry system. InChI’s are
created only by those who choose to adopt and use the algorithm. Registry systems which index the
literature are complimentary to any InChI databases that anyone creates.
|
|
35
|
- Digital ‘Naming’ of Chemicals:
- Chemical structure is the true ‘identifier’
- But, structure representations are not unique or convenient for
computers.
- So, convert structure to a unique ‘name’ by fixed algorithms
- The IUPAC International Chemical Identifier (InChI)
|
|
36
|
- 1. Chemicals
- – Fast isomerization
(tautomerization)
- – Ill-defined connectivity
- 2. Chemists
- – Differing conventions
- Depends on discipline, education and convenience
- Imprecision/uncertainty
|
|
37
|
- Formula
- Connectivity
- Stereochemistry/Chirality
- Isotope
- Charge
- Fixed/Mobile Hydrogens
- And so on
|
|
38
|
|
|
39
|
|
|
40
|
|
|
41
|
|
|
42
|
|
|
43
|
|
|
44
|
|
|
45
|
|
|
46
|
|
|
47
|
|
|
48
|
Acknowledgements
Steve Bachrach, Mila Becker, Pieter Bolman, Bob Bovenschulte, Steve Bryant,
Harry Collier, Alice Cooper, Rene Deplanque, Guenter Grethe, Stevan
Hanard, Sami Kassab, Gary Mallard, Randy Marcinko, Alan McNaught, Bill Milne,
Carmen Nitsche, Josep Prous, Chris Reed, Rich Roberts, Peter Murray-Rust,
Henry Rzepa, Steve Stein, Peter Shepherd, Bill Town, Andrea Twiss-Brooks,
Wendy Warr, Ann Wolpert
|