DRAFT of 20 August, 1999
INTERNATIONAL UNION OF PURE
AND APPLIED CHEMISTRY
Working Party on the Internet and IUPAC Publications*
GUIDELINES FOR THE USE OF THE INTERNET BY IUPAC BODIES Prepared for publication by
Antony N. Davies 1,
Stephen R. Heller 2,
and
John W. Jost 3 1ISAS, Institut für Spektrochemie,
Postfach 10 13 52, 44013 Dortmund, Germany *Membership of the Working Party during the preparation
of these guidelines (1997-1999) was as follows:
2NIST/SRD, Mail Stop 820/113, 100 Bureau Drive, Gaithersburg,
MD 20899-2310 USA
3IUPAC Secretariat, PO Box 13757, Research Triangle Park, NC
27709-3757, USA
J.W. Jost (Chairman), A.N. Davies, S.R. Heller, H.V. Kehiaian, A.D.
McNaught.
CONTENTS
*1 ABSTRACT
*2 INTRODUCTION
*3 WWW.IUPAC.ORG
*4 Internet Homepage Design - Some Do’s and Don’ts
*INTERNATIONAL UNION OF PURE
AND APPLIED CHEMISTRY
Working Party on the Internet and IUPAC Publications*
GUIDELINES FOR THE USE OF THE INTERNET BY IUPAC BODIES
Prepared for publication by
Antony N. Davies1, Stephen R. Heller2, and John W. Jost3
1 davies@isas-dortmund.de, ISAS, Institut
für Spektrochemie, Postfach 10 13 52, 44013 Dortmund, Germany
2steve@hellers.com, NIST/SRD, Mail Stop 820/113, 100 Bureau
Drive, Gaithersburg, MD 20899-2310 USA
3secretariat@iupac.org, IUPAC Secretariat, PO Box 13757,
Research Triangle Park, NC 27709-3757, USA
*Membership of the Working Party during the preparation
of these guidelines (1997-8) was as follows:
J.W. Jost (Chairman), A.N. Davies, S.R. Heller, H.V. Kehiaian, A.D.
McNaught.
The rapid development of the Internet as a major communication tool between scientists has led to the need for a co-ordinated IUPAC presence. Many diverse groups have already initiated distribution of IUPAC related material via their Web sites. These guidelines will provide the structure on which the official IUPAC Internet site maintained through the Secretariat will be based and rules governing the interaction between this central site and various sites operated by other IUPAC bodies. Rules will also be published here for the operation of sites maintained by other bodies which contain IUPAC related information. The need for special care when making provisional recommendations widely available on the Internet will be emphasized.
The meeting of the Committee on Printed and Electronic Publications, 24-25 August 1997 in Geneva recognised the need for a set of guidelines for IUPAC bodies wishing to extend their activities onto the Internet.
It was also clear that certain strategic decisions needed to be taken regarding the use and maintenance of the main IUPAC web site. The Internet provides an excellent tool for increasing the profile of the Union.
The main product of the Union is the publications and recommendations of the different commissions. It was felt that in order to maintain the high quality of information emanating from the Union urgent steps needed to be taken to ensure that the rapid introduction of the Internet into scientific life did not result in a lowering of the quality of information linked to the IUPAC name. By their nature the membership of Commissions changes regularly and a mechanism needed to be found to ensure all work invested into developing electronic information had a lifetime well beyond the limited tenure of particular individuals in the commissions.
A further concern was the lack of guidance for commission members on how to implement use of the World Wide Web to improve efficiency within the commissions.
The IUPAC Secretariat at Research Triangle Park, North Carolina, USA, maintains the 'official' IUPAC site on the World Wide Web (WWW). This is located on a very high speed server system linked to the Internet by a high speed high bandwidth professional network with 24 hour a day availability.
http://www.iupac.org/publications/ci/
Other IUPAC publications such as the various colored books will be made available in electronic form via the IUPAC web site at various times following the release of the printed form. The exact timing of the release of the electronic version and the method of distribution will vary depending on the source of the publication, the content, and individual publishing agreements.
A CPEP working party is finalising a publication defining MIME types for chemistry and these should be implemented wherever possible on IUPAC Internet Web Sites. A brief overview of the current status and a table of the data file extensions covered is given below but please refer to the web site at the following URL: http://www.ch.ic.ac.uk/chemime/ for the latest information.
Multipurpose Internet Mail Extensions
In order to identify different types of data the Internet Engineering Task Force (IETF) has approved protocols called Multipurpose Internet Mail Extensions (MIME). MIME types are used to identify content of files or parts of files on the internet. MIME types consist of a primary and sub-type. The primary type defines the general type of data in the file, while the sub-type defines the exact file format. Examples of primary file types include text, image, audio and video. A couple of examples of MIME types are video/quicktime for QuickTime movies and audio/x-wav for wave audio files.A proposal to include common scientific file formats in chemistry as a primary MIME type Chemical has been pioneered by Henry Rzepa (rzepa@ic.ac.uk) of Imperial College, London, UK and his colleagues and adopted as a topic for a IUPAC working party. A draft table of some of the chemical MIME types is given in Table 1 below.
Primary/sub-type |
Suggested qualifier(s) |
chemical/x-cxf |
cxf |
chemical/x-mif |
mif |
chemical/x-pdb |
pdb |
chemical/x-cif |
cif |
chemical/x-mdl-molfile |
mol |
chemical/x-mdl-sdf |
sdf |
chemical/x-mdl-rdf |
rdf |
chemical/x-mdl-rxn |
rxn |
chemical/x-embl-dl-nucleotide |
emb, embl |
chemical/x-genbank |
gen |
chemical/x-ncbi-asn1 |
asn |
chemical/x-gcg8-sequence |
gcg |
chemical/x-daylight-smiles |
smi |
chemical/x-rosdal |
ros |
chemical/x-macromodel-input |
mmd, mmod |
chemical/x-mopac-input |
mop |
chemical/x-gaussian-input |
gau |
chemical/x-jcamp-dx |
dx, jdx |
chemical/x-kinemage |
kin |
What this basically means is that an internet aware software should recognise a file called mydata.gau as a gaussian input file. And a file called myspectrum.dx should be recognized as a JCAMP-DX spectrum and treated as such. The ‘x-‘ refers to the MIME type as being experimental This will be dropped when adopted by the IETF.
It has been suggested that a short description of the Chemical Markup Language (CML) would also be appropriate here although it’s implementation on IUPAC internte sites cannot currently be recommended due to a lack of available software support. This is a developing field which may have relevance in the future for reporting chemistry on the internet and IUPAC bodies should be aware of it’s existence, especially in case of an yoverlap with current or future projects.
CML is an application of the extensible markup language (XML) being developed to contain chemical information within electronic documents. Currently work is concentrated on chemical structure information but IUPAC bodies should watch the web site at: http://www.xml-cml.org/ for developments. There is also a large amount of background information to this project available here.
Many large mainstream development projects on the Internet involve the use of XML owing to the need for and value in electronic commerce. It is therefore only a matter of time before XML, and it's chemical child, CML, will find widespread use see:http://www.xml.org/
DTD Document Type Definition is the formal definition of the elements, structures, and rules for marking up a given type of SGML document. You can store a DTD at the beginning of the document or externally in a separate file.
HTML Hypertext Mark-up Language
PDF Adobe Portable Document Format
SGML Standard Generalised Mark-up Language is an international standard (ISO 8879) published in 1986.
WWW World Wide Web
MIME Multipurpose Internet Mail Extensions
IETF Internet Engineering Task Force
CML Chemical Markup Language
XML Extensible Markup Language