DRAFT of 20 August, 1999


INTERNATIONAL UNION OF PURE
AND APPLIED CHEMISTRY
iupac logo

COMMITTEE ON PRINTED AND
ELECTRONIC PUBLICATIONS

Working Party on the Internet and IUPAC Publications*

GUIDELINES FOR THE USE OF THE INTERNET BY IUPAC BODIES

Prepared for publication by

Antony N. Davies 1, Stephen R. Heller 2, and John W. Jost 3

1ISAS, Institut für Spektrochemie, Postfach 10 13 52, 44013 Dortmund, Germany
2NIST/SRD, Mail Stop 820/113, 100 Bureau Drive, Gaithersburg, MD 20899-2310 USA
3IUPAC Secretariat, PO Box 13757, Research Triangle Park, NC 27709-3757, USA

*Membership of the Working Party during the preparation of these guidelines (1997-1999) was as follows:
J.W. Jost (Chairman), A.N. Davies, S.R. Heller, H.V. Kehiaian, A.D. McNaught.

Contents

CONTENTS *

1 ABSTRACT *

2 INTRODUCTION *

3 WWW.IUPAC.ORG *

Location *
Content
*
Electronic Publications
*
Pure and Applied Chemistry *
Chemistry International
*
Other Publications
*

4 Internet Homepage Design - Some Do’s and Don’ts *

Basic Principles *
Some Do’s and Don’ts *
5 Recommendation to use Chemical Multipurpose Internet Mail Extensions (MIME) on IUPAC Internet Web Sites. *

Multipurpose Internet Mail Extensions *
Chemical MIME
*

6 Chemical Markup Language *

7 Glossary *

INTERNATIONAL UNION OF PURE
AND APPLIED CHEMISTRY

COMMITTEE ON PRINTED AND
ELECTRONIC PUBLICATIONS

Working Party on the Internet and IUPAC Publications*

GUIDELINES FOR THE USE OF THE INTERNET BY IUPAC BODIES

Prepared for publication by

Antony N. Davies1, Stephen R. Heller2, and John W. Jost3

1 davies@isas-dortmund.de, ISAS, Institut für Spektrochemie, Postfach 10 13 52, 44013 Dortmund, Germany
2steve@hellers.com, NIST/SRD, Mail Stop 820/113, 100 Bureau Drive, Gaithersburg, MD 20899-2310 USA
3secretariat@iupac.org, IUPAC Secretariat, PO Box 13757, Research Triangle Park, NC 27709-3757, USA

*Membership of the Working Party during the preparation of these guidelines (1997-8) was as follows:
J.W. Jost (Chairman), A.N. Davies, S.R. Heller, H.V. Kehiaian, A.D. McNaught.

 

  1. ABSTRACT
  2. The rapid development of the Internet as a major communication tool between scientists has led to the need for a co-ordinated IUPAC presence. Many diverse groups have already initiated distribution of IUPAC related material via their Web sites. These guidelines will provide the structure on which the official IUPAC Internet site maintained through the Secretariat will be based and rules governing the interaction between this central site and various sites operated by other IUPAC bodies. Rules will also be published here for the operation of sites maintained by other bodies which contain IUPAC related information. The need for special care when making provisional recommendations widely available on the Internet will be emphasized.

  3. INTRODUCTION
  4. The meeting of the Committee on Printed and Electronic Publications, 24-25 August 1997 in Geneva recognised the need for a set of guidelines for IUPAC bodies wishing to extend their activities onto the Internet.

    It was also clear that certain strategic decisions needed to be taken regarding the use and maintenance of the main IUPAC web site. The Internet provides an excellent tool for increasing the profile of the Union.

    The main product of the Union is the publications and recommendations of the different commissions. It was felt that in order to maintain the high quality of information emanating from the Union urgent steps needed to be taken to ensure that the rapid introduction of the Internet into scientific life did not result in a lowering of the quality of information linked to the IUPAC name. By their nature the membership of Commissions changes regularly and a mechanism needed to be found to ensure all work invested into developing electronic information had a lifetime well beyond the limited tenure of particular individuals in the commissions.

    A further concern was the lack of guidance for commission members on how to implement use of the World Wide Web to improve efficiency within the commissions.

  5. WWW.IUPAC.ORG

Location

The IUPAC Secretariat at Research Triangle Park, North Carolina, USA, maintains the 'official' IUPAC site on the World Wide Web (WWW). This is located on a very high speed server system linked to the Internet by a high speed high bandwidth professional network with 24 hour a day availability.

Content

  • All ‘public’ information made available by IUPAC bodies should be mounted on this web site.
  • Individual groups may wish to mirror their contribution on their group’s own web sites.

  • Recommendations, reports and journal articles from IUPAC bodies will be available through the official web site. These documents will initially be available in Portable Document Format (PDF) to ensure no loss of format on distribution. (For more information on the Adobe Portable Document Format see http://www.adobe.com/prodindex/acrobat/adobepdf.html.) Compatibility with the printed version of a document cannot currently be ensured by using the simple Hypertext Mark-up Language encoding (HTML) implemented in the current generation of Internet browsers. It is also not possible to maintain formatting if the original word processing files are distributed as these will reformat a document depending on the local computer hardware (including printers) and software (operating system, word processor version, installed fonts, etc.)

  • Other information of a more transient nature such as commission or working party drafts will be made available in the most appropriate format depending on content. It is important to build in a formal hurdle to receiving preliminary or provisional information. Currently a request for the information needs to be made and distribution is controlled and this practise should continue in electronic form.
  • Numerical data, which is produced by some commissions in large quantities, will also be made available through the web site but the most appropriate method will be decided in consultation with the various IUPAC bodies and may well vary depending on data type.
  • An agreement with the IUPAC publishers will allow archiving of all IUPAC publications in SGML format at the Secretariat to enable the Union to react to future developments in Internet browser technology. (For more information on publishing in Standard Generalised Mark-up Language 'SGML', see http://www.oasis-open.org/cover/sgml-xml.html.) SGML is an ISO standard for content mark-up in electronic documentation and HTML is a very limited subset of this standard.

Electronic Publications

Pure and Applied Chemistry

In the immediate future Pure and Applied Chemistry will be available in electronic form via the IUPAC home page. Technical details of this mode of access remain to worked out.

Chemistry International

The content of Chemistry International is usually of a more administrative or informative nature for people working within the Union as well as the world chemical community. As such it lends itself more to presentation on the Internet and is now available online at:

http://www.iupac.org/publications/ci/

Other Publications

Other IUPAC publications such as the various colored books will be made available in electronic form via the IUPAC web site at various times following the release of the printed form. The exact timing of the release of the electronic version and the method of distribution will vary depending on the source of the publication, the content, and individual publishing agreements.

  1. Internet Homepage Design - Some Do’s and Don’ts

Basic Principles

It is important not to lose sight of the reason for the presence of a particular IUPAC body on the Internet. The delivery of information is either to a restricted group of people or to a wider audience than would usually be the case. Whichever the case is, the presence of the required information should be clear and retrieval easy.

Some Do’s and Don’ts

  1. Recommendation to use Chemical Multipurpose Internet Mail Extensions (MIME) on IUPAC Internet Web Sites.
  2. A CPEP working party is finalising a publication defining MIME types for chemistry and these should be implemented wherever possible on IUPAC Internet Web Sites. A brief overview of the current status and a table of the data file extensions covered is given below but please refer to the web site at the following URL: http://www.ch.ic.ac.uk/chemime/ for the latest information.

    Multipurpose Internet Mail Extensions

    In order to identify different types of data the Internet Engineering Task Force (IETF) has approved protocols called Multipurpose Internet Mail Extensions (MIME). MIME types are used to identify content of files or parts of files on the internet. MIME types consist of a primary and sub-type. The primary type defines the general type of data in the file, while the sub-type defines the exact file format. Examples of primary file types include text, image, audio and video. A couple of examples of MIME types are video/quicktime for QuickTime movies and audio/x-wav for wave audio files.

    Chemical MIME

    A proposal to include common scientific file formats in chemistry as a primary MIME type Chemical has been pioneered by Henry Rzepa (rzepa@ic.ac.uk) of Imperial College, London, UK and his colleagues and adopted as a topic for a IUPAC working party. A draft table of some of the chemical MIME types is given in Table 1 below.

     

    Table 1. Table of MIME Types

      Primary/sub-type

    Suggested qualifier(s)

    chemical/x-cxf 

    cxf

    chemical/x-mif

    mif

    chemical/x-pdb

    pdb

    chemical/x-cif 

    cif

    chemical/x-mdl-molfile 

    mol

    chemical/x-mdl-sdf 

    sdf

    chemical/x-mdl-rdf 

    rdf

    chemical/x-mdl-rxn

    rxn

    chemical/x-embl-dl-nucleotide

    emb, embl

    chemical/x-genbank 

    gen

    chemical/x-ncbi-asn1

    asn

    chemical/x-gcg8-sequence

    gcg

    chemical/x-daylight-smiles

    smi

    chemical/x-rosdal

    ros

    chemical/x-macromodel-input

    mmd, mmod

    chemical/x-mopac-input

    mop

    chemical/x-gaussian-input

    gau

    chemical/x-jcamp-dx

    dx, jdx

    chemical/x-kinemage

    kin

    What this basically means is that an internet aware software should recognise a file called mydata.gau as a gaussian input file. And a file called myspectrum.dx should be recognized as a JCAMP-DX spectrum and treated as such. The ‘x-‘ refers to the MIME type as being experimental This will be dropped when adopted by the IETF.

  3. Chemical Markup Language
  4. It has been suggested that a short description of the Chemical Markup Language (CML) would also be appropriate here although it’s implementation on IUPAC internte sites cannot currently be recommended due to a lack of available software support. This is a developing field which may have relevance in the future for reporting chemistry on the internet and IUPAC bodies should be aware of it’s existence, especially in case of an yoverlap with current or future projects.

    CML is an application of the extensible markup language (XML) being developed to contain chemical information within electronic documents. Currently work is concentrated on chemical structure information but IUPAC bodies should watch the web site at: http://www.xml-cml.org/ for developments. There is also a large amount of background information to this project available here.

    Many large mainstream development projects on the Internet involve the use of XML owing to the need for and value in electronic commerce. It is therefore only a matter of time before XML, and it's chemical child, CML, will find widespread use see:http://www.xml.org/

  5. Glossary

CPEP Committee on Printed and Electronic Publications

DTD Document Type Definition is the formal definition of the elements, structures, and rules for marking up a given type of SGML document. You can store a DTD at the beginning of the document or externally in a separate file.

HTML Hypertext Mark-up Language

PDF Adobe Portable Document Format

SGML Standard Generalised Mark-up Language is an international standard (ISO 8879) published in 1986.

WWW World Wide Web

MIME Multipurpose Internet Mail Extensions

IETF Internet Engineering Task Force

CML Chemical Markup Language

XML Extensible Markup Language