Paper The following article is Free article

Advances in reference materials and measurement techniques for greenhouse gas atmospheric observations

, , , , , , , , , , , , , , , , , , and

Published 16 May 2019 © 2019 BIPM & IOP Publishing Ltd
, , Focus on Advances in Metrology in Chemistry and Biology Citation Paul J Brewer et al 2019 Metrologia 56 034006 DOI 10.1088/1681-7575/ab1506

0026-1394/56/3/034006

Abstract

We present the global research landscape which aims to deliver a measurement infrastructure to underpin atmospheric observations of key greenhouse gases governing changes in the Earth's climate. These measurements present a significant challenge to the metrological community, analytical laboratories and major producers of reference materials. The review focuses on the progress made in the Gas Analysis Working Group of the Consultative Committee for Amount of Substance: Metrology in Chemistry and Biology (CCQM-GAWG) in establishing the primary realisation of the amount-of-substance fraction for carbon dioxide, methane and nitrous oxide in an air matrix. It also focuses on the importance of providing traceable measurements of isotopic composition of these components for commutability of reference materials and for isotope ratio measurements for greenhouse gas source attribution. The review examines the developments in the Global Atmosphere Watch (GAW) Programme of the World Meteorological Organization (WMO) for providing the framework for the development and implementation of integrated greenhouse gas observations, which is vital for understanding the global carbon cycle and the role greenhouse gases play in climate change. The developments in analytical techniques are also discussed which have shaped the direction of the metrology required to meet the evolving and future needs of stakeholders.

Export citation and abstract BibTeX RIS

1. Introduction

Industrial and domestic activities have been recognised by the Kyoto Protocol [1] and more recently by the Conference of Parties (COP21) [2] as being major contributing sources to one of the greatest risks to society, climate change [3]. To prevent stark changes to the Earth's climate, emissions of key greenhouse gases such as carbon dioxide, methane and nitrous oxide must be reduced. Abundances of carbon dioxide and methane in the atmosphere are at the highest they have been in the past 3 million years and this is mainly attributable to human activities [4]. In 2015, the World Meteorological Organization (WMO) reported that the global average amount-of-substance (later abbreviated to amount) fraction of carbon dioxide exceeded the symbolic 400 µmol mol−1 threshold and it is unlikely to return below it in our lifetimes [58]. Current levels of methane are nearly triple the pre-industrial value [7, 9, 10].

Estimating greenhouse gas emissions based on atmospheric observations is challenging because it requires accurate, high-precision measurements of amount fraction. Measurements made at different locations and by different laboratories must be highly consistent, with minimal bias introduced through sampling and calibration. The Global Atmosphere Watch (GAW) Programme within the WMO has provided a framework for the development and implementation of integrated greenhouse gas observations since 1989 [11]. In the GAW programme, consistency is targeted through traceability to unique scales that are maintained over long time periods. Further, GAW has adopted goals for network compatibility, which stem from maximum bias tolerance among networks and data providers. These requirements make this area specially challenging, as measurement standards are needed with very small uncertainty and good long-term stability to underpin trends in climate monitoring measurement results.

Since its formation, the Gas Analysis Working Group of the Consultative Committee for Amount of Substance: Metrology in Chemistry and Biology (CCQM-GAWG) has made prodigious strides towards ensuring international comparability, meeting the needs of stakeholders and advancing measurement science. An area of notable achievement is the development of gas reference materials and analytical techniques for underpinning atmospheric composition measurements of greenhouse gases to address evolving measurement requirements [12]. In 2010 the WMO became a signatory of the International Committee for Weights and Measures' Mutual Recognition Arrangement (CIPM-MRA) [13]. The partnership formed between the WMO's designated institutes and the CCQM-GAWG has benefitted both the metrology and atmospheric monitoring communities. Key comparisons organised by the CCQM-GAWG provide a useful link between the primary standards used in the WMO-GAW network and the SI and an on-going framework for assessing drift in these measurements systems over time. The metrology community has benefitted substantially from the experience and research input from expert laboratories designated by the WMO which has led to significant advances in measurement science. This partnership is striving to provide a metrology infrastructure to support scientific research related to greenhouse gas sources and sinks and address real world applications.

Measurements of isotopic composition are also needed to help discriminate anthropogenic emissions from natural contributions and to provide important information about the processes involved in the sources, sinks and chemical transformations of these components. The anthropogenic component of these is very difficult to assess (let alone be split into the various emission sources) because of the very significant, temporally and spatially varying natural sources and sinks [14]. Abundance ratios of isotopocules are regarded as useful parameters to infer the origin and production-consumption mechanisms of these substances and to estimate their global budget, but the signals are small and thus an accurate and long-term stable calibration scheme is required. The International Atomic Energy Agency (IAEA), signatory of the CIPM-MRA since 2010, alongside several other organisations and National Metrology Institutes (NMIs), provides the highest-level reference materials for hydrogen, carbon, nitrogen, oxygen and sulphur isotopes. It is recognised as a custodian for stable isotope scales of elements hydrogen, carbon, nitrogen, oxygen and sulphur [15]. Various reference materials produced by other providers are traceable to reference materials from the IAEA. Reference materials for isotope ratio are currently the only approved traceability exception within the CIPM MRA.

Stable isotope ratio mass spectrometry (IRMS) is still the most precise tool for measuring stable isotope ratios but the introduction of various spectroscopic techniques (collectively known as isotope ratio infrared spectroscopy, or IRIS), has revolutionised the measurement of key greenhouse gas components in air by enabling real time in situ field measurements. These measurements help identify and quantify sources and sinks at local, regional, and global scales, and contribute to the understanding of their relative impacts on atmospheric concentrations. IRIS instruments usually report values as amount fractions of individual isotopocules, and for the most accurate measurements of isotope ratio and amount fraction, require calibration gases that have their isotope ratios assigned to at least (preferably better than) the analytical precision of the technique. This need has triggered development of gas reference materials that provide traceability for all stable isotope ratio measurements to the primary scales.

This review focuses on the developments in reference material production and measurement techniques for underpinning atmospheric amount fraction measurements of three key greenhouse gases (carbon dioxide, methane and nitrous oxide) as well as the isotopic composition of these components for source attribution. Current practices and developments for the primary realisation within the CCQM-GAWG, and the WMO-GAW programme are reviewed as well as how the state of the art in measurement techniques for accurate greenhouse gas measurement has evolved in recent years.

2. Developments in the CCQM-GAWG

Members of the CCQM-GAWG have expertise in the analysis of gas composition, the preparation of gas reference materials [16], and maintaining national measurement standards. The most accurate gas reference materials value assigned for the amount fraction of their components are usually realised by gravimetric preparation, where the quantity of interest is the amount fraction of the component in the gas mixture. The lowest measurement uncertainties are required for the atmospheric observations of greenhouse gases, with reference materials for carbon dioxide, methane and nitrous oxide in air being of major interest. Section 2.1 shows the progress made towards improved comparability of the reference materials maintained by the NMIs. Section 2.2 summarises the state of the art in gas mixture preparation, assignment of amount fractions and evaluation of the associated measurement uncertainty. In the quest for low uncertainties for measurements of atmospheric greenhouse gas amount fractions, the isotopic composition of gas reference materials is a subject of growing concern. Section 2.3 highlights how the effects of the isotopic composition were observed in comparisons using spectroscopic methods, and reviews value assignment procedures to ensure compatibility and commutability (the ability of a gas reference material to act as an air sample in a specific air monitoring instrument) of gas reference materials.

2.1. Key comparisons and state of the art in comparability

Reference materials prepared by NMIs using gravimetric methods described in section 2.2 undergo a careful series of verification experiments to ensure the traceability of the amount fraction to the SI with the lowest possible uncertainty [17]. Ultimately, they are compared during key comparisons organised under the CIPM-MRA [13]. A key comparison establishes the equivalence of the NMIs' reference materials at the highest possible metrological level and confirms as far as is possible the complete understanding of the methods used by participants. A specificity of CIPM key comparisons is the choice of one or more key comparisons reference value (KCRV) and uncertainty, considered to be the best estimate(s) of the SI-traceable value and uncertainty that can be demonstrated for the amount fraction measured.

A number of such comparisons have already been undertaken by members of the CCQM-GAWG to underpin gas reference materials of the three major greenhouse gases in air. A pilot study was first conducted in 2003 on carbon dioxide and methane in synthetic air (CCQM-P41 [18, 19]). It used two different models of comparison. The first with an ensemble of reference materials prepared by the pilot laboratory and sent to other participants (Part 1) and the second with each participant sending their prepared reference materials to a central facility (Part 2). The second scheme allows the reference value for cylinders to be calculated from the largest consistent set of reference materials and has the advantage of reducing uncertainties of the reference value, as demonstrated for example with carbon dioxide in air comparisons (now 0.05 µmol mol−1 (CCQM-K120.a 2018 [20]) reduced from 0.2 µmol mol−1 (CCQM-K52, 2006 [21])) some ten years previously.

For these comparisons to fulfil the above principles, there are a number of experimental and measurement conditions which need to be agreed between participants and laid down in the comparison protocol. These include a defined time scale, agreement on the amount fraction in the reference materials, appropriate analytical techniques acting as comparators, agreed mathematical treatment of the results, and sometimes defined requirements on the composition of the gas mixtures prepared by participants.

2.1.1. Improved analytical techniques used as comparators.

Comparisons which follow the model where all participants send their reference materials to a central facility are most beneficial when the instrumentation used for the analysis is specific to the component, unbiased towards potential differences in the reference materials other than the amount fraction and can be operated in a way to minimise the variance of measurements during the time required to conduct the comparison. The aim is to perform analytical measurements that produce an analytical uncertainty that is negligible compared to the uncertainties on reference materials declared by participants.

Component specificity, a requirement of measurements in whole air, is normally achieved by using the same instrumentation found in air monitoring laboratories, as described in section 4. GC-FID was used during the methane and carbon dioxide key comparisons [20, 22] and GC-ECD during the nitrous oxide key comparison [23]. As GC instruments are known to present drifts on a time scale of about an hour, the pilot laboratory normally uses drift correction with a control cylinder and optimises its measurement sequence to obtain minimum repeatability uncertainty. Relative standard uncertainties of the order of 0.02% were achieved on methane measurements during CCQM-K82, and between 0.003% and 0.01% on carbon dioxide during CCQM-K120.

Another beneficial practice introduced during CCQM-K82 was to measure all submitted reference materials using two analytical techniques differing as much as possible in their principle; this increases the robustness of the comparison towards incidents, and most importantly, towards potential method-specific biases. In this way, techniques relying on absorption of infrared light by the component were also selected: CRDS during CCQM-K82, Fourier-transform infra-red (FTIR) during CCQM-K120, and TDLAS to be used in the upcoming comparison on nitrous oxide (CCQM-K68.2019). These techniques are normally characterised with better short-term precisions than GC (section 4.3), which may be mitigated by additional variances due to the time required to compare as many as sixteen reference materials together [20]. They are, however, subject to a specific bias arising from differences in the composition of the matrix (a 'component dependent pressure broadening effect' that drove tight requirements on prepared reference materials during the comparisons). In addition, spectroscopic techniques are normally specific to isotopologues, resulting in biases if the reference materials in a comparison presents different isotopic compositions. This was addressed during CCQM-K120, with a third IRIS instrument introduced to measure carbon dioxide isotope ratios in all reference materials to correct the carbon dioxide amount fractions measured by FTIR.

2.1.2. Increased constraints on the composition of reference materials.

The property of commutability towards measurement techniques introduced in section 2.3 constitutes a strong requirement during key comparisons as well. Comparisons conducted since 2010 introduced tighter limits on the amount fraction of the main components of synthetic air (nitrogen, oxygen and argon) imposed to participants when preparing their reference materials, to ensure a negligible uncertainty due to the pressure broadening effect. During the validation work performed by the BIPM for CCQM-K82, it could be demonstrated that reference materials of methane prepared by the National Institute of Standards and Technology (NIST) in scrubbed dry air were equivalent to those prepared in synthetic air within the limits imposed by the use of CRDS as the analytical technique [24].

For comparisons of carbon dioxide in air reference materials by spectroscopic techniques such as FTIR used during CCQM-K120, the dependency of the technique to particular carbon dioxide isotopologues was considered. However participants were not asked to select a particular composition when preparing their reference materials. Instead, the pilot laboratory measured the isotopic composition of carbon dioxide in each reference material with an IRIS instrument and successfully corrected amount fraction measurements by FTIR [20].

2.1.3. Calculation of key comparison reference values.

The preferred model for comparisons organised by the CCQM-GAWG requires the preparation of gas reference materials by each participant at a defined nominal amount fraction. The measurement of all reference materials is carried out by the coordinating laboratory and considered as a set defining a calibration curve for the analytical instrument used as the comparator. The calibration curve is computed from a regression line of the measured values using uncertainties on both axes, following the procedures outlined in ISO 6143:2001 [25]. Gas reference materials from participants which contribute to the key comparison reference value (KCRV) are then selected on the basis of obtaining the largest consistent data set, as demonstrated with the goodness-of-fit parameter. Technical reasons are normally applied for removing outliers, for example a lack of stability reported in the most recent carbon dioxide comparison CCQM-K120 [20]. A KCRV is then determined for each gas reference material by applying the inverse regression equation to the value measured by the pilot laboratory. This allows degrees of equivalence Di to be defined for each participant, as the difference between the submitted value (typically the gravimetric value) and the KCRV, with its associated uncertainty at a 95% level of confidence.

2.1.4. Improved compatibility over the past ten years.

Over the last ten years, improvements in preparation of gravimetric reference materials described in section 2.2 and in the comparison methods resulted in better agreement amongst NMIs. This is most obvious for the methane and carbon dioxide comparisons. Figure 1 compares the results of CCQM-K82 with results from a comparison performed ten years earlier (CCQM-P41). The level of agreement amongst reference materials improved by a factor of ten. The standard deviation of results in 2003, around the reference value was 30 nmol mol−1, and 11 nmol mol−1 for a more limited set of reference materials, which improved to 1.8 nmol mol−1 in 2013.

Figure 1. Refer to the following caption and surrounding text.

Figure 1. Degree of equivalence between methane in air amount fractions (ambient levels) in the comparisons CCQM-P41 (2003) and CCQM-K82 (2013). The error bar represents the expanded uncertainty at a 95% level of confidence. For each comparison, the standard deviation σ of the degrees of equivalence is indicated.

Standard image High-resolution image

The most recent carbon dioxide comparison CCQM-K120 also demonstrated improvements in the uncertainty of the reference materials and the comparison method, as displayed in figure 2 with its results compared to the previous exercise CCQM-K52. The improvement is most striking when only participants selected as forming the largest consistent set of reference materials are considered, as indicated in the figure. A similar improvement is expected in the future nitrous oxide comparison to be commenced in 2019 by the BIPM (CCQM-K68.2019).

Figure 2. Refer to the following caption and surrounding text.

Figure 2. Degree of equivalence between carbon dioxide in air amount fractions (nominal value 380 µmol mol−1) during the key comparisons CCQM-K52 (2006) and CCQM-K120 (2016). The box indicates participants selected as forming a consistent set of standards. The error bars represent the expanded uncertainty at a 95% level of confidence.

Standard image High-resolution image

2.2. State of the art in gravimetric reference materials

One of the core capabilities of NMIs in gas analysis is the preparation of gas reference materials for one or several components at a given amount fraction in a well-defined matrix. The composition of these reference materials, expressed in amount fraction (mol mol−1), is traceable to the SI through gravimetric techniques. The preparation involves the transfer of nominally pure starting materials into gas cylinders and is fully described in ISO 6142-1 [26]. The major contributors to the uncertainty of the reference material are the mass of components added, the assessment of purity of the starting materials, determination of the molar mass of the air based on its composition of amounts of oxygen, argon and nitrogen, and the trace levels of other components [2729]. For the latter, the most important factor in determining the trace levels of contaminants in the reference material is the determination of these in each of the input gases. In recent years, a major focus of the CCQM-GAWG has been to reduce the uncertainties of the assigned amount fractions in the primary realisation of composition for key greenhouse gases to align with the data quality objectives (DQOs) of the WMO-GAW programme deemed necessary to enable trends to be identified [30]. The DQOs are qualitative and quantitative statements that clarify the objectives of observations, define the appropriate type of data, and specify tolerable levels of uncertainty. DQOs are used as the basis for establishing the quality and quantity of data needed to support decisions. This has directed research towards achieving standard uncertainties as small as 0.025 µmol mol−1 at a nominal amount fraction of 400 µmol mol−1 for carbon dioxide in air [20], 0.5 nmol mol−1 at a nominal amount fraction of 1800 nmol mol−1 for methane in air [22] and 0.05 nmol mol−1 at a nominal amount fraction of 330 nmol mol−1 for nitrous oxide in air [23], as reported in respective key comparison reports. Expanded uncertainties, at 95% coverage, of (0.13 to 0.20) µmol mol−1 for carbon dioxide, (1.0 to 3.5) nmol mol−1 for methane and 0.2 nmol mol−1 for nitrous oxide for secondary reference materials (SRMs) have been achieved [8, 31]. Until recently these targets were unprecedented amongst the metrology community.

Gravimetric reference materials for carbon dioxide, methane and nitrous oxide can be prepared in 'synthetic' or 'whole' air. In the case of synthetic air, nominally pure samples of argon, oxygen and nitrogen are mixed in 'atmospheric' proportions. Each one of the gases should be analysed for impurities (particularly the components of interest: argon, oxygen, nitrogen, carbon dioxide, methane and nitrous oxide). Whole air is collected from the atmosphere, preferably at an un-polluted location, dried and compressed into cylinders. It is important that any critical impurities, particularly the target component(s) are removed or reduced to a low level. Any trace impurity of the target component in the matrix gas (synthetic or whole air) should be accounted for in the final gravimetric value assignment, including the uncertainty, of the reference material [26, 29].

When synthetic air is used to prepare gas mixtures, the amount of argon, oxygen and nitrogen should be prepared at ambient levels to accurately match the amount fractions of those components in the whole air [32], this is especially important for use with spectroscopic techniques such as non-dispersive infra-red (NDIR), FTIR, tunable diode laser absorption spectroscopy (TDLAS) and cavity ring-down spectroscopy (CRDS). Potential sources of bias have been reported, including matrix composition dependent pressure broadening effects, described for example by Chen et al [33], Rella et al [34], and Nara et al [35], who quantified the bias introduced by deviations in the main component fractions within synthetic air matrices compared to ambient air composition. This is also critical to determine an accurate molecular mass for the whole air, which in turn is used in the calculations to determine the final amount fractions of carbon dioxide, methane and nitrous oxide in the reference material [28].

Improvements in instrumentation for gravimetry and determinations of trace amount fractions of the target component in the matrix air have been reported to result in reductions in combined uncertainties. For example, for methane reference materials, standard uncertainties have been reduced from 0.2% to 0.033% relative [36, 37]. State-of-the-art instrumentation has allowed NMIs to measure these components at trace levels and with small uncertainties as shown in table 1.

Table 1. Estimated best detection limits (expressed as amount fractions) and measurement techniques for carbon dioxide, methane and nitrous oxide.

Component Measurement technique Best detection limit (nmol mol−1)
CO2 Gas chromatography (methaniser and flame ionisation detector) Spectroscopy 2.0  ±  1.0
1.0  ±  0.2
CH4 Gas chromatography (flame ionisation detector) Spectroscopy 1.0  ±  0.5
0.2  ±  0.1
N2O Gas chromatography (electron capture detector) Spectroscopy 0.1  ±  0.1
0.02  ±  0.02

The state-of-the-art in preparation of gas reference materials incorporates the use of automated weighing systems [38]. The only operation required of the user is to place the cylinders onto the facility at each weighing stage in the preparation process, which allows more consistent positioning of the cylinder on the balance each time. Automation allows for more measurements than would be achieved with a manual procedure and as a result, measurement precision is enhanced by producing larger data sets in the same time. These systems typically produce mass data with a standard uncertainty of 2 mg from a distribution of ten measurements. Therefore, if sufficient gas either from pure, a diluted mixture, or the balance air, is added to the cylinder then the relative uncertainty in the gravimetric measurement can become almost insignificant. A cylinder is typically evacuated to around 1  ×  10−5 Pa and the average mass of the evacuated cylinder (mevac) is measured and subtracted from the mass of the cylinder when the pure gas (mp-gas) has been added. From this the mass of pure gas added (m) is determined. For example, 20 g of pure gas with a standard uncertainty of 2 mg contributes 0.014% to the overall relative standard uncertainty (${{u}_{{\rm rel}}}$ ) of the mass transferred as calculated from the equation (1)

Equation (1)

Therefore, adding ten times more pure gas would result in ${{u}_{{\rm rel}}}$ of 0.0014% which is negligible compared to all other sources of error. Adding more pure gas would result in a lower uncertainty, however this would be at the expense of an increasing number of dilution steps required to achieve the ambient levels for the component(s) of interest which would lead to increased uncertainty. Typical gravimetric dilution factors are chosen to be between 10 and 50. Facilities with enclosed automatic weighing systems, also record temperature, humidity and atmospheric pressure of the environment around the system and balance, allowing a correction to be made for any changes in air buoyancy.

Historically, carbon dioxide, methane and nitrous oxide gas reference materials at atmospheric amount fractions have been considered to be stable within the stated uncertainties. However, with recent analytical improvements, the effects of adsorption of the target component to the walls of the cylinder and valve have been demonstrated, as studied further by Leuenberger et al [39]. Within NMIs, 'daughter/mother' testing (also known as the decant method) was introduced to estimate losses due to initial adsorption of the component on the surface [40]. For example, an aliquot from a mixture of nominally 400 µmol mol−1 of carbon dioxide in air (mother) is transferred to a clean and evacuated cylinder (daughter). The daughter and mother mixtures are then analysed and a ratio between the two is calculated from the instrument responses. Differences of around 0.2 µmol mol−1 from initial adsorption loss have been reported and shown to be independent of the amount fraction of the parent mixture. [41] In addition, increases in the amount fraction have been observed as the pressure in the cylinder is reduced due to desorption of the carbon dioxide from the walls of the cylinder [42]. Corrections based on the initial adsorption of carbon dioxide are made to the overall gravimetric amount fraction. These effects were considered in the international comparison CCQM-K120 on carbon dioxide [20]. These adsorption/desorption issues have not been observed with methane or nitrous oxide. While these adsorption/desorption effects are still not fully understood or quantified, what is known has helped to increase the accuracy of the reference materials.

2.3. New activities for isotope ratio

2.3.1. The growing importance of isotope ratio measurements for accurate gas reference materials.

Three major sources of uncertainty for gravimetric preparation of reference materials must be characterised and controlled: mass measurement, gas purity and the relative atomic or molecular masses of the gases being weighed.

The relative atomic masses of the elements are tabulated [43] with uncertainties that, except for the mononuclidic elements, are dominated by the distribution of isotopes seen in naturally occurring elements from different environments. Reducing the uncertainties requires the measurement of isotope ratios in the sample, or by using the information available about the source(s) of the material(s) at hand [28].

Historically, the component of uncertainty that has limited accuracy has been the determination of purity of the gases being weighed, with the potential variation in isotope ratios not considered to significantly affect the amount fraction values and uncertainties achievable. Similarly, gas reference materials have been compared with techniques that were sensitive to the total amount fraction of the component of interest (e.g. GC with flame ionisation detectors (FID) or GC with thermal conductivity detectors (TCD)) and not a particular isotopocule. However, as gas analysers that are based on techniques that are sensitive to variations in isotopocule fractions have become more prevalent, further attention has been paid to the isotopic composition of the component of interest in the reference materials. Spectoscopic techniques used for high-end applications are also sensitive to differences in the composition of the matrix gas, adding a further demand to the basket. This section reviews the impact of isotope ratio variation of gas reference materials on their commutability, and how measurements of isotope ratio are being incorporated into gas reference material production and value assignment procedures to ensure the compatibility.

2.3.2. Comparisons affected by variability in the isotopic composition of the component.

One of the first CCQM comparisons, in which gas reference materials were compared under repeatability conditions, CCQM-P23 [44], focused on verifying the uncertainties of gravimetrically prepared gas reference materials (carbon monoxide in nitrogen), by reducing the influence of analytical uncertainties. The comparison included gas mixtures submitted at several amount fractions to one laboratory for analysis under repeatability conditions. At these higher than ambient amount fractions, relative standard uncertainties of the order of 0.01% were reported and expected to be validated. However, the comparison results demonstrated much larger inconsistencies when the analytical method was NDIR, up to 1% in some instances. These discrepancies were largely removed if GC-TCD or GC-FID was used as the comparison method. Subsequent measurements of the absolute 13C/12C ratios of the carbon monoxide in these gas mixtures reported a subset with ratio values grouped around 0.011, close to natural abundance, but others with measured values of 0.009 and even 0.004. The sensitivity of NDIR to isotopic composition of carbon monoxide was subsequently studied and reported [45], as well as methods to screen pure carbon monoxide gas to ensure its isotopic composition was close to natural abundance if it was to be used for the manufacture of calibration reference materials. Furthermore, whilst a bias proportional to the difference in isotope ratio from natural abundance could be demonstrated for the NDIR instruments tested, the exact relationship was dependent on the particular analyser. The variation in isotope ratios found in the pure carbon monoxide gases was reported to be due to the source and treatment of the gases, with a number of the gases having been processed to remove 13C carbon monoxide for delivery to the 13C labelled chemical market, leaving the remaining carbon monoxide gas depleted of 13C isotopologues.

2.3.3. Accurate and commutable reference materials for carbon dioxide amount fraction measurements.

The atmospheric monitoring community has driven the requirement for gas reference materials with reduced measurement uncertainties for monitoring long-term trends of greenhouse gas levels. Global networks [30, 46] for carbon dioxide monitoring within the WMO-GAW programme aspire to keep network biases less than 0.1 µmol mol−1 (0.05 µmol mol−1 in the southern hemisphere).

The first comparisons of carbon dioxide in air reference materials at atmospheric background amount fractions organised by the CCQM-GAWG, CCQM-P41 [18], occurred in 2003 with a second comparison, CCQM-K52 [21] in 2006. The direct comparison of the gas reference materials (CCQM-P41 part 2) was carried out using GC-TCD. This method is insensitive to the isotopic composition of carbon dioxide in the gas reference materials, and the spread of results, which were within a range of  ±0.5 µmol mol−1 of the reference value (if calculated using methods developed in later comparisons of this type—see CCQM-K82 [22]), were due to other factors. The CCQM-K52 comparison of carbon dioxide in air, at nominally 365 µmol mol−1, was performed by all nineteen participants measuring reference materials that had been sent to them with reference values that had been assigned with standard uncertainties of 0.2 µmol mol−1. The majority of laboratories demonstrated agreement of their measurement capabilities within  ±1 µmol mol−1. Thirteen participants used GC-TCD or GC-FID analysers for comparison with their in-house reference materials and no isotope effects were observed. For the seven participants using NDIR analysers, no isotopic effects were quantified or reported and measurement uncertainties were large in comparison to the expected size of the effects, apart from two participants (who did report isotope effects or whose uncertainties were not large in comparison). One of the participants, working closely with the atmospheric monitoring community, investigated and published [47] the effect of carbon dioxide isotopic variations on NDIR measurements. A difference approaching 0.2 µmol mol−1 was reported for measurements of reference materials with near ambient isotopic composition compared to those with highly depleted isotope ratios made from carbon dioxide from a combustion source.

Commercial laser-based analytical systems, such as TDLAS and CRDS, for trace gases have been developed with sufficient resolution to be able to measure the absorption of single isotopologues of carbon dioxide, which can be used for the measurement of amount fractions. Good agreement between CRDS and NDIR measurements was demonstrated [33] when a number of potential sources of bias were controlled, including the isotopic composition. The correction for isotope effects was shown to lead to the removal of a 0.15 µmol mol−1 bias, resulting in the mean difference between the measurement techniques being reduced to (0.05  ±  0.09) µmol mol−1.

The theoretical bias in amount fraction measurements caused by different isotope ratios in calibrant and sample gases can be readily calculated from measured isotope ratios, expressed in delta notation and traceable to conventional standards. As a first approximation, when carbon dioxide is measured as the major isotopocule 12C16O2, a 1‰ difference in δ13C values between the calibration and sample gases leads to a difference of 4 nmol mol−1 in the amount fraction of a 400 µmol mol−1 mixture, and 2 nmol mol−1 for the same difference in δ18O values. Blending carbon dioxide gases from different sources provides a method for matching to natural abundance isotope ratios for 13C, with the mixing of industrially sourced carbon dioxide with highly enriched 13CO2 described [48] as well as mixing with pure carbon dioxide obtained from a gas well source [49] (δ13C at  −1‰ versus Vienna Peedee Belemnite (VPDB)), with δ18O values better matched to atmospheric values using the latter source.

The latest comparisons of carbon dioxide in air mixtures CCQM-K120 [20], have focused on demonstrating equivalence of reference materials at better than the 0.1 µmol mol−1 level and correcting optical instrument responses for measured isotope ratios of compared carbon dioxide gases. The comparison at the BIPM consisted of forty four gas reference materials, prepared by fourteen participating laboratories. Two measurement techniques were used to compare the reference materials, to ensure no measurement method dependent bias: GC-FID and FTIR spectroscopic analysis. The latter had to be corrected for the isotopic composition of the carbon dioxide gases, involving measurements of isotope ratios in each reference material. The method used by the BIPM for measuring isotope ratios is described in a recent publication [49] and was validated with carbon dioxide in air reference materials that had been value assigned for their isotopic composition by BGC-IsoLab Jena, with traceability of the standards used to the VPDB-CO2 scale realised by the Jena reference air set (JRAS)-06 scale. The isotope ratios of submitted gases were found to range from  −66.9‰ to  +0.2‰ versus VPDB for δ13C and  −34.5‰ to  −0.3‰ versus VPDB-CO2 for δ18O. Typical standard uncertainties for the isotope ratio measurements made by laser spectroscopy were 0.2‰ and 0.5‰ for δ13C and δ18O respectively. Whilst these uncertainties are considerably greater than can be achieved by mass spectrometry, achieving them still requires careful consideration of the calibration strategy and traceability chain, both in terms of strategy [50] and standards to be used [49]. A comparison being coordinated jointly by the BIPM and the IAEA [51] aims to determine the compatibility of isotope ratio measurements of carbon dioxide, both in nominally pure carbon dioxide and carbon dioxide in air reference materials using a variety of analytical techniques.

The CCQM-K120 comparisons were successful in demonstrating the degrees of equivalence of carbon dioxide in air reference materials amongst participating NMIs at nominal amount fractions of 380 µmol mol−1, 480 µmol mol−1 and 800 µmol mol−1, with a standard uncertainty of 0.05 µmol mol−1 (a reduction by a factor of at least 4 relative to CCQM-K52). The methods based on spectroscopy (FTIR, with corrections for isotope ratios) and GC-FID for comparing carbon dioxide in air reference materials demonstrated excellent agreement, with the standard deviation of the difference in reference values found by the two methods being 0.07 µmol mol−1 at 380 µmol mol−1 and 0.05 µmol mol−1 at 480 µmol mol−1. It also demonstrated that providing reference materials to communities measuring atmospheric carbon dioxide with spectroscopic methods produced with carbon dioxide originating from combustion sources can lead to isotope ratios in the carbon dioxide which will produce biases in amount fractions measurements of up to 0.3 µmol mol−1. These biases can be corrected by either measuring isotope ratios of the gas reference material and appropriate processing by the user, or by using carbon dioxide which has been closely matched to atmospheric isotope ratio values in the production of the reference material. Isotopic differences in the reference materials that define the WMO-CO2-X2007 (CO2-in-air) scale have also been measured [52] and are now accounted for in the propagation of the scale, with isotope ratios reported as characterised to  ±0.2‰ for both δ13C and δ18O values. Absolute ratios for 13C/12C and 18O/16O which are currently known with uncertainty at about 1‰ (see below) appear to be another source of uncertainty.

Based on these developments and the advent of optical spectroscopy for measurements of isotope ratio, some NMIs are now focusing on developing appropriate reference materials with isotope ratios traceable to reference materials produced by the IAEA.

3. Developments in the WMO-GAW programme

Within the WMO-GAW programme, the concepts of scale and scale propagation are fundamental to underpinning amounts fractions of carbon dioxide, methane, nitrous oxide and other gases. The concept of scale rests on the idea that measurement precision is usually smaller than the absolute uncertainty of the quantity measured [53]. To achieve a high level of network compatibility (minimal bias between sites), all measurements should be traceable to the same reference which is proven stable over time [30]. Within the WMO-GAW programme, that reference consists of one or more primary gas reference materials, value-assigned using primary methods, or value-assigned by consensus in the case of stable isotopes of carbon dioxide and methane (artefact-based VPDB and Vienna standard mean ocean water (VSMOW) scales) [30]. Primary standards are used to realise a scale over a particular amount fraction or isotopic delta range (see section 3.3). Scales for greenhouse gas amount fractions (excluding stable isotopes) are maintained and propagated by Central Calibration Laboratories (CCLs) [30, 54, 55]. For example, the National Oceanic and Atmospheric Administration (NOAA) acts as the CCL for carbon dioxide, methane and nitrous oxide.

The stable isotope scales have some principal differences. The scales and initial primary standards are based on historical artefacts used for scale-definitions [15]. Primary stable isotope reference materials are maintained by the IAEA and are intended for the highest scale-realisation in practice [15]; there is no primary method linking these scales to SI; preparing materials of desired isotope composition independently of artefact-based reference materials is not possible.

Scale propagation for amount fraction typically follows a hierarchical scheme. The primary standards are used to define an instrument response function, and this is then used to value-assign secondary and tertiary standards. In the case of carbon dioxide, methane and nitrous oxide, secondary standards consist of cylinders containing whole air, or modified whole air with amount fractions spanning the range of interest. Whole air is used rather than synthetic air mixtures to avoid gas matrix effects when propagating the scale to WMO laboratories. Primary standards are analysed periodically to verify secondary assignments, but this is done relatively infrequently (once or twice per year) in order to preserve the primary standards. Scales are distributed by the CCL to other WMO-GAW laboratories through tertiary or quaternary level standards rather than through primary or secondary standards. The CCL for carbon dioxide and methane has improved scale propagation in recent years. Until recently, gas reference materials were value assigned using NDIR for carbon dioxide and GC-FID for methane. Both of these analytical systems were updated with state-of-the-art spectroscopic analysers in 2016. Scale propagation improved by about a factor of three to 0.01 µmol mol−1 for carbon dioxide [49], and approximately a factor of five to 0.1 nmol mol−1 for methane. A similar upgrade is planned for nitrous oxide.

For stable isotopes of carbon dioxide in air, the VPDB scale is propagated by distribution of carbon dioxide in air standards, known as JRAS flasks or by user-supplied tanks calibrated by the Stable Isotope Laboratory of the Max-Planck-Institute for Biogeochemistry (BGC-IsoLab), the CCL for carbon dioxide stable isotopes [30]. The carbon dioxide in JRAS mixtures is derived from carbonates and traceable to the internationally accepted VPDB scale through a chain of analyses against primary reference material(s) (see section 3.3).

Also fundamental to the WMO-GAW programme for greenhouse gases is the concept that scales are well-defined scales. Scale realisations are typically identified by the year in which they are adopted (e.g. WMO-CO2-X2007) and are updated as needed, such as when primary standards are added or removed from the set [56], or when advances in analytical methods reveal previously unknown deficiencies. Sometimes a scale revision is necessary due to propagation issues but is still based on the same set of primary standards. The revisions are still uniquely identified by name. An example of this is the WMO nitrous oxide scale. WMO-N2O-X2006, was realised in 2006 [57] and at that time secondary standards were assigned relative to the primaries. Several years later it was discovered that one of the secondary standards was drifting at ~  −  0.05 nmol mol−1a−1. This impacted reference gas assignments made by the CCL from January 2006 to October 2011, and required re-assignment of nitrous oxide values of the secondary standards and retroactive propagation through the calibration hierarchy. This re-assignment was identified as a new scale, WMO-N2O-X2006A.

3.1. State of the art in underpinning measurements of amount fraction

Two primary methods, gravimetry and manometry, are used to provide primary reference materials for WMO-GAW measurements of carbon dioxide, methane and nitrous oxide. Here we provide brief histories of these methods as they apply to WMO-GAW along with some recent advances.

3.1.1. Gravimetry.

Primary standards for methane and nitrous oxide are prepared using analytical balances, mass comparators, and gas blending manifolds as described in ISO 6142-1 [26]. In general, a known mass of the minor component is added to an evacuated cylinder and diluted with a known mass of the major component. The mass of the minor component can be determined in various ways: (a) by direct transfer of an aliquot to an evacuated cylinder followed by weighing, or (b) by adding a small amount of gas to a small transfer vessel (5–50 cm3), weighing, and then transferring the aliquot to an evacuated cylinder. Transfer from a vessel can involve expansion from the transfer vessel into the target, or expansion plus flushing [56] in which the vessel is repeatedly pressurised and the contents expanded into the cylinder. For methane, all three methods were used to create a suite of primary standards (direct addition, expansion, expansion with flushing). For nitrous oxide, only the expansion with flushing method was used [57]. An advantage of direct addition is that it ensures complete transfer because the mass of the aliquot is determined after it has been added to the cylinder. An advantage of the transfer vessel method is that relatively large dilution factors can be achieved (up to 2  ×  105 in a single step). A disadvantage of the transfer vessel method is that it requires quantitative knowledge of the transfer efficiency. High transfer efficiencies can be achieved with care [58].

For both methane and nitrous oxide, primary standards were prepared gravimetrically in 5.9 l aluminium cylinders. For methane seventeen standards were prepared from 1991–1995 over the nominal range (300–2600) nmol mol−1 [56] with additional standards prepared in 2013 to verify the original work and extend the range to 5900 nmol mol−1 (table 2). Scrubbed whole air (i.e. whole air from which methane, carbon dioxide, and nitrous oxide is removed to produce residual levels of  <10 nmol mol−1, <100 nmol mol−1 and  <5 nmol mol−1 respectively) was used as the dilution gas. Uncertainties were limited to some extent by difficulties in determining the amount of methane in the dilution gas (often a few nmol mol−1) [24, 56].

Table 2. Primary standards used to define WMO scales for carbon dioxide, methane and nitrous oxide.

Component No. of primary standards Method Nominal amount fraction range Scale ID
CO2 15 Manometry (250–520) µmol mol−1 WMO-CO2-X2007
CH4 22 Gravimetry (300–5900) nmol mol−1 WMO-CH4-X2014A
N2O 13 Gravimetry (260–370) µmol mol−1 WMO-N2O-X20006A

For nitrous oxide fifteen standards over the nominal range 260–370 nmol mol−1 were made gravimetrically from four parent mixtures at higher amount fractions [57, 59]. Synthetic air (oxygen and nitrogen) was used as dilution gas because scrubbed whole air often contains trace amounts of nitrous oxide, which can be difficult to quantify. This constrains the use of the current WMO-GAW primary standards as they cannot be used to calibrate spectroscopic instruments directly because they lack argon, which could be important with respect to pressure broadening [60]. Scale transfer is currently achieved using gas chromatography with electron capture detection (GC-ECD). Work is underway to prepared nitrous oxide standards in scrubbed whole air.

The sets of primary standards for nitrous oxide and methane show good internal consistency. The standard deviation of residuals from a 2nd order polynomial fit to twenty two methane standards, based on GC-FID analysis, is 3.4 nmol mol−1. For nitrous oxide, the standard deviation of residuals is 0.33 nmol mol−1 [57] based on GC-ECD analysis. In addition to CCQM comparisons, WMO-GAW scales have also been compared with other independent scales, such as those from the Scripps Institution of Oceanography (SIO) and Tohoku University (TU) [61]. Through comparison of more than twenty years of atmospheric measurements at multiple sites, a divergence between the SIO scale 'SIO-98' and the WMO scale 'WMO-N2O-X2006A' has been observed. A recent update of the SIO scale (now SIO-16) has improved the comparison, but small differences remain [62]. This underscores the need for repeated comparisons.

3.1.2. Manometry.

The manometric determination of carbon dioxide in air involves loading a sample of dry whole air into a known volume ('large volume') where its temperature and pressure are determined. Carbon dioxide is then cryogenically extracted from that air sample, cryogenically purified to remove (residual) water vapour, and transferred into a much smaller volume ('small volume'). After reaching thermal equilibrium, the temperature and pressure of the purified carbon dioxide in the small volume are determined. The volumes of the large and small volumes need not be known exactly, but their ratio must be well-known. The amount fraction of carbon dioxide in dry air is derived from ratios of temperature, pressure, and volume, with appropriate corrections for non-ideality [63]. Since nitrous oxide cannot be separated from carbon dioxide cryogenically, a correction must be made for the amount of nitrous oxide and its non-ideality.

Manometry has been used to prepare standards for carbon dioxide measurements since C D Keeling began measuring carbon dioxide in background (remote) air at the South Pole in 1957, and at Mauna Loa, Hawaii in March 1958, for Scripps Institution of Oceanography (SIO) [64]. SIO data are traceable to carbon dioxide primary standards value-assigned using either a constant-volume mercury-column manometer (CMM) or an electronic constant-volume manometer (ECM) [6568].

The CMM used at SIO had a large volume of 5014 cm3 and a small volume of 3.80 cm3 [69]. Pressure was measured by comparing the height of mercury in a column containing the sample to the height of mercury in a vacuum column. Temperatures were determined using mercury thermometers. Cathetometers were used to read the mercury column heights and temperatures through windows in a temperature-controlled housing. The volume ratio was determined by directly weighing the main volumes filled with water or mercury, and by measuring pressure and temperature after expanding gas (carbon dioxide, argon or nitrogen) from the small volume into successively larger volumes [67].

The SIO carbon dioxide amount fraction scale, as determined from CMM measurements of a series of primary standards, was used by the WMO Background Air Pollution Monitoring Network, a predecessor to the GAW programme, as the world reference for carbon dioxide from 1975–1995. Primary standards were measured using the CMM approximately every two years. Results of the CMM measurements were documented in technical reports (e.g. [70, 71]). In 1995, NOAA assumed the role of CCL and established manometric capabilities. The NOAA manometer was modelled on the Scripps CMM, but with electronic devices for pressure and temperature measurement [63, 72]. NOAA prepared a suite of 15 primary standards, which were initially value-assigned based on both the SIO CMM and the NOAA ECM from 1995–2001. In 2001, NOAA derived an independent scale based solely on NOAA measurements [72]. Similarly to SIO, NOAA analyses primary standards using their ECM approximately every two years. The current WMO-GAW scale (WMO-CO2-X2007) is based on NOAA manometric results from 1996 through 2007. NOAA has continued to measure primary standards every two years and has presented updated manometric results at WMO-GAW meetings [73, 74].

Repeatability (estimated as the standard deviation) of CMM and ECM instruments is about 0.018% (0.07 µmol mol−1 at ~400 µmol mol−1) [69, 72] and has improved over time (figure 3).

Figure 3. Refer to the following caption and surrounding text.

Figure 3. Results from one NOAA primary standard at ~400 µmol mol−1, showing 10 calibration episodes. Consistency between episodes has improved over time. Here, the standard deviation of all measurement results is 0.1 µmol mol−1 but improves to 0.04 µmol mol−1 from 2004–2015.

Standard image High-resolution image

Historically, several issues complicated the long-term maintenance of carbon dioxide scales using CMM and ECM. Carbon dioxide 'drift' in cylinders is of primary concern, and can be at the limit of what can be detected over even a decade, given the typical expanded uncertainty associated with the technique (~0.05%). Changes in manometer-assigned values over time could be the result of cylinder drift, or changes in the manometer. This dilemma confounded SIO to the point that two separate scales were derived in 1999: one in which cylinders were assumed to be stable and any observed drift attributed to changes in the volume ratio (SIO scale X99A), and one in which the manometer was assumed to be stable and the cylinders thought to be drifting (SIO scale X99B) [69]. This issue was likely complicated by the fact that some of the early SIO primary standards were prepared in steel cylinders, which tend to show higher drift rates than aluminium cylinders [39]. Nevertheless, drift in the amount fraction is not easily resolved. Keeling et al [69] investigated multiple issues related to drift, and concluded that their small volume did not change between 1985 and 2016, but their thermometers drifted 0.11 °C between 1961 and 2006, and that some of the auxiliary volumes (used to determine the volume ratio) changed over time as they were annealed and 'ashed' each year to remove grease. A revised SIO scale is under development and will incorporate these latest findings.

Interactions of carbon dioxide with manometer surfaces also complicate interpretation of manometer results. SIO has documented small changes in their CMM results following cleaning to remove grease and residual mercury and NOAA is investigating the possibility that carbon dioxide adsorbs to or permeates O-rings. Despite these issues, both SIO and NOAA have maintained carbon dioxide scales that have remained remarkably stable, and in close agreement, for many years [69, 72].

BIPM has developed an all SilcoNert treated stainless steel ECM with metallic, instead of elastomer, O-rings in all valves to address the interaction of carbon dioxide with glass and other surfaces. Preliminary results show good agreement between this system and the all-glass NOAA system.

3.1.3. Quality assurance and quality control.

To address scientific needs for interpreting atmospheric observations of the major greenhouse gases, the GAW programme sets ambitious network compatibility goals, which are reviewed and, if necessary, revised during biennial meetings of the WMO-GAW community. Network compatibility goals are defined for the unpolluted troposphere and represent the maximum bias that can generally be tolerated in measurements of well- mixed air used in global models to infer fluxes. Extended network compatibility goals are provided as a guideline for many other studies in which the smallest bias tolerance is not required [30]. These network compatibility goals currently stand at  ±0.05 µmol mol−1 for carbon dioxide in the Southern Hemisphere, relaxed to  ±0.1 µmol mol−1 elsewhere, ±2 nmol mol−1 for methane, and  ±0.1 nmol mol−1 for nitrous oxide. The extended goals are set to  ±0.2 µmol mol−1 for carbon dioxide, ±5 nmol mol−1 for methane, and 0.3 nmol mol−1 for nitrous oxide.

The WMO-GAW programme has implemented a quality management framework (QMF) to ensure consistent scales and comparability, in the recognition that 'collecting adequate information on the chemical composition of the atmosphere and on the consequences of the anthropogenic impact on different spatial and temporal scales is valuable and possible only if all the relevant measurements are expressed in the same units and on the same scale and if data from different countries and from different sites are comparable' [75]. Further details on the WMO/GAW QMF can be found in the WMO GAW Implementation Plan: 2016–2023 [75]. Briefly, the QMF requires CCLs maintain and enable access to the primary network standards and calibration scales. CCLs are encouraged to compare the WMO-GAW scales to other independent scales whenever possible, e.g. through participation in key comparisons organised by BIPM [22, 23] and other exercises as appropriate. All WMO measurements are made traceable to an appropriate scale [30, 53], which leads to high internal consistency of data within and between networks. World Calibration Centres (WCCs) organise comparison campaigns and performance audits at GAW sites using transfer standards as an independent check of the traceability to the CCL. World Data Centres (WDCs) provide archiving facilities, perform plausibility and consistency checks on submitted data, and provide feedback to the data providers when necessary. QA/QC related issues are further coordinated by Scientific Advisory Groups (SAGs) and Quality Assurance/Scientific Activity Centres (QA/SACs).

Zellweger et al [76] published results of performance audits made by the WCC for methane (2005–2014) and carbon dioxide (2010–2015) that showed newer spectroscopic techniques such as CRDS have clear advantages in stability and linearity of the response compared to traditional methods. Figure 4 shows an update of the analysis made by Zellweger et al [76] including audit data until 2018 for methane. Briefly, the bias in the centre of the relevant amount fraction of the unpolluted troposphere versus the slope of the linear regression analysis of the standard comparison is shown, which results in allowed bias/slope combinations meeting the WMO-GAW network compatibility (green area) and extended network compatibility goals (yellow area) for the relevant amount fraction. Figure 4 also shows comparisons made between the WCC and standards prepared by NMIs. The NMI standards showed an offset of about 2–3 nmol mol−1 compared to the WMO-CH4-X2004A calibration scale, which is in good agreement with the results of the BIPM key comparison CCQM-K82 [22].

Figure 4. Refer to the following caption and surrounding text.

Figure 4. (Left) Analysis of WCC results. Methane bias versus the slope of the audit for individual travelling standard comparisons. Different symbols and colours indicate different measurement techniques of the station analysers. The error bars correspond to the uncertainty of the slope and the bias (1σ). The green and yellow areas correspond to the WMO/GAW compatibility and extended compatibility goals for the relevant amount fraction range. (Right) Analysis of the WMO Round Robin (RR6) results. Different colours refer to the calibration scales used by the participating laboratory.

Standard image High-resolution image

The results obtained during the WCC station performance audits are similar to those from recent WMO/IAEA Round Robin Comparison Experiment (RR6) organised and coordinated by the CCL for methane hosted by NOAA. RR6 took place in 2014/15 with 35 participating laboratories, and involved two methane standards, one containing (average 1756.2 nmol mol−1) and the other containing (average 1943.1 nmol mol−1) amount fraction [77]. With this data set, the analysis described above was repeated after exclusion of one laboratory, which was far outside the compatibility goal. This excluded laboratory was also audited by the WCC in 2012 and the results were excluded because the instrument was found to be unsuitable for methane analysis. The results of RR6 are also presented in figure 4 (right panel). The percentage of laboratories fulfilling the WMO network compatibility and extended network compatibility goal during WMO RR6 was similar to the results from the WCC station performance audits. The laboratories reporting data on scales other than WMO-CH4-X2004 and WMO-CH4-X2004A were Japanese labs that used their own gravimetric calibration scales.

Currently, QA/QC is still not systemised for measurements of stable isotopes; there is no WCC identified and there have been no performance audits performed. These shortcomings need to be addressed [30]. The work performed at the CCL aimed at the practical scale-realisation by JRAS mixtures should be independently verified by another laboratory [30]. This is crucial as the stable isotope scales are artefact-based and there is no primary method in use to verify absence of drifts and/or unrecognised biases. Round-robin inter-comparisons, the only QA tool currently in use, demonstrate lab-to-lab discrepancies in δ13C and δ18O on air-CO2 which largely exceed the WMO network compatibility goals [30, 55]. These discrepancies, mainly offsets, in RR6 appear to be largely driven by the fact that most laboratories are not yet on the CCL realisation of the VPDB scale and several improvements have been proposed at GGMT-2017 for RR7 [30].

Figure 5 shows an update of the analysis made by Zellweger et al [76] including audit data until 2018 for carbon dioxide. The same analysis as above was made and also applied to the RR6 results. Again, the addition of new performance audit data confirms the advantage of the CRDS/off-axis integrated cavity output spectroscopy (OA-ICOS) techniques compared to NDIR. Figure 5 also shows comparisons with NMIs and Japanese laboratories reporting on their own calibration scales. Compared to the methane study, the proportion of laboratories complying with the WMO network compatibility goal was smaller for the audits by the WCC, which indicates that the transfer of the calibration to the actual field measurement is more challenging for carbon dioxide. However, this significantly improves if only audits of CRDS instruments are considered.

Figure 5. Refer to the following caption and surrounding text.

Figure 5. (Left) Analysis of WCC results. Carbon dioxide bias versus the slope of the audit for individual travelling standard comparisons. Different symbols and colours indicate different measurement techniques of the station analysers. The error bars correspond to the uncertainty of the slope and the bias (1σ). The green and yellow areas correspond to the WMO/GAW compatibility and extended compatibility goals for the relevant amount fraction range. (Right) Analysis of the WMO Round Robin (RR6) results. Different colours refer to the calibration scales used by the participating laboratory.

Standard image High-resolution image

3.2. State of the art in underpinning isotope ratio measurement

The isotopic composition of a sample is generally reported in terms of isotope delta 'δ', which is the relative differences of the isotope ratio of a sample to a scale-defining standard, i.e. an artefact accepted by consensus. This is expressed in equation (2)

Equation (2)

where (i) denotes the higher and (j ) the lower atomic mass of Element E and R  =  N(iE)/N(jE) is the isotope ratio of the sample (S) or the reference standard [78]. Over the last 60 years a simplified version of equation (2) has asserted itself in the literature (at least regarding the light stable isotopes carbon, hydrogen, oxygen and nitrogen), where the lower atomic mass (j ) of an element is omitted [79]. δ values are typically small and are usually multiplied by 1000 and expressed as 'per mil' (‰).

3.2.1. Artefact based stable isotope scales (for δ13C, δ18O, δ2H, δ15N) and developments at the IAEA.

In the 1960s the IAEA established tritium and stable isotope measurements of water in precipitation that required the development of specific stable isotope reference materials. In particular, this programme required the development of both the scale-defining reference materials and high-level reference materials aimed at scale realisation for stable isotope measurements of hydrogen, carbon, nitrogen, oxygen and sulphur. As custodian for these stable isotope scales, the IAEA maintains the stable isotope scales, monitors and introduces replacement primary reference materials, and develops new reference materials to address critical applications [15].

Currently, all stable isotope scales for δ13C, δ18O, δ2H and δ15N are based on artefacts, i.e. the ratio of isotopic species in the artefact used to define the scale-zero and where possible a scale span. These historical artefacts (or their later replacements) have also been used for the highest-level scale-realisation. With the exception of the δ15NAIR scale, these are now exhausted for distribution. The scale defining materials include VSMOW, NBS19 and Air-N2 (see table 3), and in contrast to greenhouse gas amount fraction scales, these scales are not based on an SI-traceable method, i.e. a mixture of isotopic entities (13C and 12C) with a desired isotope ratio. Nevertheless, best current measurements of the absolute ratios of scale defining primary standards exist and are the basis of calculating atomic weights of natural materials [80]. The exhausted scale-zero defining materials SMOW and PDB were first replaced by VSMOW and NBS19 (NBS19 defines 13C and δ18O values close to the scale zero) [84] and later have been replaced by VSMOW2 and IAEA-603 [15, 81, 82] with the value assignment based on measurements against the VSMOW and NBS19 materials. The first replacements are reflected in adding V (Vienna) to the scale abbreviations. IAEA-N-1 is the primary reference material for the N2-Air scale [78].

Table 3. Accepted scale-defining materials, primary reference materials and second scale anchors.

Delta value Isotope ratio Scale Scale definining reference material Primary reference material Consensus value Second scale anchor Consensus value
δ13C 13C/12C VPDB NBS19 (marble)a Previously NBS19 (marble) 1.95‰ exactly Previously LSVEC (lithium carbonate)b −46.6‰ exactly
Currently IAEA-603 (marble) 2.46‰  ±  0.01‰ Nonec  
δ18O 18O/16O VPDB, VPDB-CO2 NBS19 (marble)a Previously NBS19 (marble) −2.20‰ exactly None
Currently IAEA-603 (marble) −2.37‰  ±  0.04‰ None
VSMOW VSMOW (water) Previously VSMOW (water) 0‰ exactly VSMOW (water) −55.5‰ exactly
Currently VSMOW2 (water) 0.00‰  ±  0.02‰ VSMOW2 (water) −55.5‰  ±  0.02‰
δ2H 2H/1H VSMOW VSMOW (water) Previously VSMOW (water) 0‰ exactly Previously VSMOW (water) −428‰ exactly
Currently VSMOW2 (water) 0.00‰  ±  0.3‰ currently VSMOW2 (water) −427.5‰  ±  0.3‰
δ15N 15N/14N N2-air N2-air IAEA-N-1 (ammonium sulfate) 0.43‰  ±  0.07‰ USGS32 (potassium nitrate) 180‰

aExhausted. bDiscontinued [88]. cUse of other available reference materials instead of LSVEC is recommended [88].

The replacements, VSMOW2 and IAEA-603 were introduced to guarantee the consistency of the scale realisation. VSMOW2 was prepared at the IAEA by careful mixing of waters and is identical to VSMOW, within the uncertainties [83]. In contrast, marble carbonates of the desired isotopic composition cannot be prepared, so the isotopic composition of IAEA-603 [82] differs from that of NBS19.

All stable isotope ratio measurement instruments have been shown to suffer from some systematic effects (e.g. memory, cross-contamination, linearity, interference, sample gas preparation steps etc) that can influence the correctness of measurement results and require harmonised calibrations (see section 4). Therefore, in addition to the primary reference material a second scale 'anchor' (with its value assigned based on measurements against primary reference material under optimal conditions) is needed to synchronise the calibration scale span and to be used for data normalisation.

Standard light antarctic precipitation (SLAP), the second scale anchor accepted on the VSMOW scale (δ2H  =  −428‰ and δ18O  =  −55.50‰, values assigned without uncertainty [84]) was developed and later replaced by SLAP2 (δ2H  =  −427.5‰  ±  0.3‰ and δ18O  =  −55.50‰  ±  0.02‰ as assigned by measurements against SLAP [81]). The replacement SLAP2 was prepared at the IAEA by careful mixing of waters and is almost identical to SLAP (with 0.5‰ difference in δ2H for SLAP2) [83].

LSVEC (lithium carbonate) was introduced as the second anchor on the VPDB δ13C scale, with a value of  −46.6‰ accepted without uncertainty [85]. Later, LSVEC values were found to be drifting [86, 87] so IUPAC has recommended that LSVEC be discontinued as a reference material for δ13C data normalisation; other valid reference materials are suggested to be used for this purpose until a replacement for LSVEC is found [88]. However, values for these reference materials were assigned using LSVEC as a scale anchor so when the LSVEC replacement material is released the values of all standards that were standardised with LSVEC may need to be revised. Currently there are efforts underway to make a replacement available to the isotopic community as soon as possible.

All available stable isotope reference materials are based on and are traceable to the historical artefacts. The hierarchy of reference materials (traceability chain) is described below and as the chain is descended, the uncertainty of the reference material increases.

  • Historical artefacts are used to define the scales and as a concept, each scale-defining material has its value assigned with zero uncertainty. In most cases, these artefacts are no longer accessible [78].
  • Primary reference materials which either are, or have the closest link to, the historical artefacts. These primary reference materials are used to realise their respective scales, e.g. VSMOW and NBS19 and their replacements VSMOW2 and IAEA-603 (table 3). Like other reference materials, they have an uncertainty (at least due to homogeneity) and this is well characterised for VSMOW2 and IAEA-603 [81, 82].
  • The scale-anchors. These materials are used to perform a two-point data normalisation for the scale. Example materials are SLAP (now replaced by SLAP2) for water and LSVEC (see below) for carbon.
  • Secondary reference materials. These materials have been characterised against primary reference materials with their values normalised against the scale-anchors. They have larger uncertainty than the primary reference materials.
  • Tertiary level reference materials. While there is no strict formal recognition of tertiary level reference materials there may be a valid case for recognising these in some instances. An example is carbon dioxide gas mixtures prepared and characterised against primary or secondary reference materials (carbonates), such as the JRAS air sets [89]. Other examples include matrix reference materials for GC combustion applications. In particular, this class of reference material is susceptible to stability issues and there is a need for it to be re-characterised periodically against more stable reference materials. Tertiary level materials have larger uncertainty than secondary reference materials
  • Working laboratory standards. These are generally produced for specific laboratory applications and have the largest uncertainty that includes uncertainty from the reference material used to assign values and a component introduced by the two-point data normalisation.

The uncertainty introduced by the two-point data normalisation is intrinsic to all secondary and tertiary reference materials as well as working laboratory standards. Often a pair of reference materials is used for calibration purpose (addressing two-point data normalisation requirement), with a third reference material to be used for quality control.

3.2.2. Developments at the IAEA.

Critical uncertainty limits for international reference materials are imposed by δ13C observations of atmospheric carbon dioxide. Inter laboratory comparability of such measurements should be within 0.01‰ to allow meaningful comparisons of data [30]. This, as well as the LSVEC-drift found [86, 87] and the strong need for an LSVEC-replacement, supports that IAEA focus on developing new reference materials for δ13C. Further improvements to the QA/QC, addressing the recently introduced ISO 17034 for reference material producers [90] and comprehensive uncertainty evaluation based on metrological principles have been recommended [15]. Using this new approach, the IAEA-603 preparation [82] included several steps as (i) a batch production of 5000 ampules flushed with argon and sealed off (material has been reserved for production of 4 such batches), (ii) 52 ampoules selected for the homogeneity study; (iii) extensive characterisation study performed at the IAEA. The assigned value δ13C  =  2.46‰  ±  0.01‰ (k  =  1) and δ18O  =  −2.37‰  ±  0.04‰ (k  =  1), for IAEA-603 have a dominating uncertainty component due to homogeneity (type A uncertainty) estimated over 5000 ampoules rather than characterisation measurements against the NBS19 primary reference (type B uncertainty) [82].

Three new carbonate reference materials, under development at the IAEA, have been sealed in glass ampoules in 2018 (around 11 000 in total) and material for further batches has been reserved. The development of carbon dioxide gas reference materials is planned to commence in 2019. Both types of reference material will cover a range of δ13C and δ18O values, have well-characterised uncertainty and be well-preserved to eliminate storage effects. These reference materials will be used for harmonised realisation of the VPDB δ13C scale and provide several scale-anchors on the scale. Thus, users will be able to select anchors in the most suitable form (carbonate or carbon dioxide) and with the most appropriate δ13C values. In particular, several scale anchors for δ13C will help (i) to monitor drifts in δ13C, if any and (ii) verify exactness of the 17O correction [91] applied in the mass-spectrometry software or in user' calculations. The value and uncertainty assignment for new reference materials shall be based on the traceability to the highest reference materials, careful validation of analytical methods in use and thoughtful assessment of all corrections involved as well as distinguishing type A and type B uncertainty components.

To realise scales independent from the historical artefacts, it is important that the absolute ratios (13C/12C, 18O/16O, 2H/1H, 15N/14N) of the scale-artefacts are determined [15]; current best measurements of the absolute ratios as recommended by IUPAC can be found in Meija et al and references therein [79]. Several attempts to determine the 13C/12C value of the VPDB scale zero [9295] have been made, however discrepancies among several 13C/12C determinations were observed and the absolute ratios are still not known with the uncertainty required for robust scale realisation which, for δ13C, must be 0.01‰ or lower. In addition, with the development of optical spectroscopic methods for stable isotope measurements, it will be necessary to develop reference materials or calibration approaches that allow both mass-spectrometry and optical spectroscopy results to be reported on the same reference scales. The realisation of isotopic scales may not be straight forward for users of optical analysers, i.e. scaling δ2H measurements of methane gas to VSMOW (primary reference is water), or δ13C measurements of atmospheric carbon dioxide to VPDB (primary reference is carbonate). Therefore, other practical solutions such as optical transfer standards [96] may be considered for the future. For isotopes of atmospheric carbon dioxide, the Jena reference air sets (JRAS), represent standards usable by both optical, and IRMS instruments (see section 3.3).

3.3. Isotopic analyses of atmospheric carbon dioxide, methane and nitrous oxide

Table 4 presents gas reference materials. A comprehensive list of gas standards for isotope analyses is provided by Brand et al [78].

Table 4. Delta values of gas reference materials [84, 97101]. The site preference (SP) is defined as the difference in 15N substitution between the central (δ15Nα) and terminal (δ15Nβ) nitrogen position of the nitrous oxide molecule (SP  =  δ15Nα  −  δ15Nβ).

CO2 ${{\delta }^{{\rm 13}}}{{{\rm C}}_{{\rm VPDB{{\text -}}C}{{{\rm O}}_{{\rm 2}}}}}$ ${{\delta }^{18}}{{{\rm O}}_{{\rm VPDB{{\text -}}C}{{{\rm O}}_{2}}}}$ Comment
NIST RM 8562 −3.72‰  ±  0.035‰ −18.49‰  ±0.22‰ CO2 heavy
NIST RM 8563 −41.59‰  ±  0.045‰ −33.52‰  ±  0.24‰ CO2 light
NIST RM 8564 −10.45‰  ±  0.035‰ −10.09‰  ±  0.20‰ CO2 biogenic
NARCIS I −8.55‰  ±  0.02‰ −0.7‰  ±0.06‰  
NARCIS II +1.923‰  ±  0.003‰ −2.6‰  ±0.06‰  
N2O ${{\delta }^{15}}{\rm N}_{{\rm AIR{{\text -}}}{{{\rm N}}_{2}}}^{\alpha }/{{\delta }^{15}}{\rm N}_{{\rm AIR{{\text -}}}{{{\rm N}}_{2}}}^{\beta }$ a ${{\delta }^{15}}{{{\rm N}}_{{\rm AIR{{\text -}}}{{{\rm N}}_{2}}}}/{{\delta }^{18}}{{{\rm O}}_{{\rm VSMOW{{\text -}}SLAP}}}$ a Comment
USGS51 +0.48‰  ±  0.09‰/+  2.15‰  ±  0.12‰ +1.32‰  ±  0.04‰ SP  =  −1.67‰
+41.23‰  ±  0.04‰
USGS52 +13.52‰  ±  0.04‰/−  12.64‰  ±  0.05‰ +0.44‰  ±  0.02‰ SP  =  +26.15‰
+40.64‰  ±  0.03‰

aDelta values are a preliminary assessment provided by Naohiro Yoshida and Sakae Toyoda/Tokyo Institute of Technology. Uncertainties are standard deviations of repeated measurements against the laboratory reference gas and do not include any calibration uncertainties of the laboratory reference gas with respect to AIR-N2 and VSMOW.

3.3.1. Carbon dioxide.

δ13C and δ18O isotopic values of atmospheric carbon dioxide are reported on the VPDB and VPDB-CO2 scales respectively. The quantitative conversion of the carbonate to carbon dioxide using the classical method, i.e. the reaction with 100% phosphoric acid at 25 °C [102], does not lead to a fractionation of the carbon isotopic signature. However, the oxygen isotopic signature of the carbonate is not preserved as only 2 out of 3 oxygen atoms are transferred to the carbon dioxide molecule with some fractionation involved [100]. This is why δ18O values of carbon dioxide gas are reported on the VPDB-CO2 scale, and not on the VPDB scale. Correspondingly, the oxygen isotopic composition of the produced carbon dioxide was found to be dependent on the reaction temperature, the isotopic composition of the acid, its concentration and the water content [103]. Since the results of the phosphoric acid reaction depend on numerous variables, the reaction is one of the limiting factors when considering inter-laboratory comparability of the isotopic composition of carbon dioxide gas. For example, a comparison between three laboratories has shown that carbon dioxide evolved from NBS 19 carbonate varies by no more than 0.003‰ in its δ13CVPDB value, but by over 0.1‰ in its ${{\delta }^{{\rm 18}}}{{{\rm O}}_{{\rm VPDB{{\text -}}C}{{{\rm O}}_{{\rm 2}}}}}$ value [100]. One way to circumvent this problem is to make standard carbon dioxide gases with certified δ13CVPDB and ${{\delta }^{{\rm 18}}}{{{\rm O}}_{{\rm VPDB{{\text -}}C}{{{\rm O}}_{{\rm 2}}}}}$ values available (NIST RM 8562, NIST RM 8563, NIST RM 8564, NARCIS I, NARCIS II, but table 4 demonstrates that more effort may still be needed).

Since GAW long term data comparability between laboratories requires that the same reference scale realisation be used, the WMO designated the Stable Isotope Lab of the Max Planck Institute for Biogeochemistry (BGC-IsoLab) as CCL for stable isotopic calibrations of measurements on atmospheric carbon dioxide. The BGC-IsoLab developed the JRAS-06 realisation of the VPDB-CO2 scale consisting of two carbon dioxide in air mixtures [89, 104]. The carbon dioxide is derived from two carbonates, MAR-J1 (${{\delta }^{{\rm 13}}}{{{\rm C_{\rm VPDB}}}}$ :  +1.958‰  ±  0.010‰; ${{\delta }^{{\rm 18}}}{{{\rm O}}_{{\rm VPDB{{\text -}}C}{{{\rm O}}_{{\rm 2}}}}}$ :  −2.578‰  ±  0.031‰) and OMC-J1 (${{\delta }^{{\rm 13}}}{{{\rm C}}_{{\rm VPDB}}}$ :  −4.373‰  ±  0.014‰; ${{\delta }^{{\rm 18}}}{{{\rm O}}_{{\rm VPDB{{\text -}}C}{{{\rm O}}_{{\rm 2}}}}}$ :  −8.928‰  ±  0.025‰) [104]. The BGC-IsoLab has developed an automated carbon dioxide preparation system, ARAMIS (acid reaction and mixing system) that is used for the phosphoric acid reaction [89, 103] and routinely produces carbon dioxide gas from carbonates, including primary international reference materials, with a precision of 0.01‰ and 0.03‰ for δ13CVPDB and ${{\delta }^{{\rm 18}}}{{{\rm O}}_{{\rm VPDB{{\text -}}C}{{{\rm O}}_{{\rm 2}}}}}$ , respectively. The produced carbon dioxide is mixed into carbon dioxide free air achieving an amount fraction close to ambient, and these mixtures are then analysed by cryogenically extracting the carbon dioxide [89] in order to assign δ13CVPDB and ${{\delta }^{{\rm 18}}}{{{\rm O}}_{{\rm VPDB{{\text -}}C}{{{\rm O}}_{{\rm 2}}}}}$ values. Since the mixing procedure and subsequent analyses introduces only small isotopic offsets between the pure carbon dioxide, and that mixed into carbon dioxide free air [100], isotopes of carbon dioxide in air are reported on the JRAS-06 scale realisation of the VPDB-CO2 scale.

3.3.2. Methane.

The 13C/12C and 2H/1H isotope ratios of methane are reported on the VPDB and the VSMOW scale, respectively. The first δ13C measurements on atmospheric methane were made at the National Institute for Water and Atmosphere Research (NIWA, NZ) [105]. A barium carbonate standard (NZCH, later distributed as IAEA-CO-9) and a carbon dioxide standard gas (NBS-16) were used to standardise measurement results. These reference materials, re-characterised against LSVEC in 2006 [85], offered by NIST and IAEA in the past, are now exhausted.

Most laboratories have developed their own lab-standards and associated protocols to link δ13C and δ2H methane isotope data to the VPDB-, and VSMOW-scales, respectively [106]. The use of different calibration approaches in laboratories, different laboratory standards, inconsistent use of the 17O correction and not correcting for instrumental memory and cross-contamination have contributed to inter-laboratory measurement offset of up to 0.5‰ for δ13C, and up to 13‰ for δ2H measurements of atmospheric methane isotopes [106]. These inter-laboratory offsets clearly indicate that a unified effort needs to be undertaken to improve calibrations [106].

The most recent work linking isotopes of methane to primary reference materials was undertaken by Sperlich et al [107]. They selected a suite of pure methane gases that are directly characterised in the δ13CVPDB and δ2HVSMOW-SLAP values via elemental analyses (EA) and high temperature conversion (HTC) IRMS techniques, respectively. Significantly, the δ13C values of the methane gases were calibrated using LSVEC as a scale anchor, which means that these values may need to be revised once an LSVEC replacement is available. To address commutability requirements and have reference materials as close in composition as possible to samples, calibrated methane gas was mixed into methane free air, and these mixtures used to calibrate the working standard for methane in air measurements at the BGC-IsoLab [108]. δ2H–CH4 scales linked to the δ2HVSMOW-SLAP scale and based on synthetic mixtures of pure isotopologues [109] are in use at the Max-Planck-Institute for Chemistry in Mainz, Germany, and the Institute for Marine and Atmospheric research in Utrecht, Netherlands [106]. However, we note that the work by Sperlich et al [107] is the first attempt to generate a δ2H–CH4 methane in air mixture that is directly traceable to primary reference materials.

3.3.3. Nitrous oxide.

15N/14N and 18O/16O isotope ratios of nitrous oxide are reported on the AIR-N2 and the VSMOW scale, respectively. Being an asymmetric linear molecule the central (α) and terminal (β) nitrogen atoms of nitrous oxide provide position-specific ${{\delta }^{{\rm 15}}}{{{\rm N}}_{{\rm AIR{{\text -}}}{{{\rm N}}_{{\rm 2}}}}}$ isotopic information [110]. The difference between ${{\delta }^{{\rm 15}}}{{{\rm N}}_{{\rm AIR{{\text -}}}{{{\rm N}}_{{\rm 2}}}}}$ values of the central (δ15Nα) and terminal (δ15Nβ) nitrogen position of the nitrous oxide molecule is called site preference and defined as SP  =  δ15Nα  −  δ15Nβ, while the average ${{\delta }^{{\rm 15}}}{{{\rm N}}_{{\rm AIR{{\text -}}}{{{\rm N}}_{{\rm 2}}}}}$   =  (δ15Nα  +  δ15Nβ)/2. The synthesis of nitrous oxide by the thermal decomposition of isotopically characterised ammonium nitrate has been suggested as an approach to link the position-dependent nitrogen isotopic composition of nitrous oxide to AIR-N2 [110]. During decomposition, the nitrogen atom of the ${\rm NO}_{3}^{-}$ ion is converted into the central nitrogen position of N2O, while the end nitrogen comes from the ${\rm NH}_{{\rm 4}}^{+}$ ion. However, the accuracy of this approach for the calibration of δ15Nα and δ15Nβ was found to be limited by non-quantitative ammonium nitrate decomposition in combination with substantially different isotope enrichment factors for the conversion of the ${\rm NO}_{3}^{-}$ or ${\rm NH}_{{\rm 4}}^{+}$ nitrogen atom into the α or β position of the nitrous oxide molecule [111]. The yield of the ammonium nitrate decomposition reaction to nitrous oxide was found to be dependent on reaction conditions, e.g. temperature, which impedes the implementation of this technique in many laboratories [112]. Consequently, the compatibility between eleven IRMS and laser spectroscopy laboratories was found to be limited to approximately 6‰ for the nitrous oxide site preference [113]. In order to overcome such experimental artefacts when comparing data between laboratories, two new nitrous oxide reference gases, USGS51 and USGS52 have been produced [98] and are available (table 4). The delta values provided for USGS51 and USGS52 are based on a preliminary assessment by Naohiro Yoshida and Sakae Toyoda/Tokyo Institute of Technology without uncertainty estimates and offer only a small range of δ15N and δ18O values (<1‰), they are not suitable for a two-point calibration. Therefore, novel nitrous oxide reference materials are being developed within the European Metrology Programme for Innovation and Research (EMPIR) 16ENV06 project 'Metrology for Stable Isotope Reference Standards (SIRS)' [114]. Reference materials will be available as pure gas, to be used directly for traditional mass-spectrometry and/or for mixture preparation by users and as nitrous oxide diluted in whole air for laser spectroscopic measurement techniques.

4. Advances in measurement techniques

Prior to the development of modern commercial spectroscopic analysers, atmospheric carbon dioxide, methane, and nitrous oxide were most frequently measured using gas chromatography where packed columns containing Porapak Q or Hayesep Q are used to separate these components from other air components. Detection is accomplished using a flame ionization detector (FID) (for methane and carbon dioxide after conversion to methane on a nickel catalyst in the presence of hydrogen) or an electron capture detector (ECD) (for nitrous oxide) [115118]. The linear response of an FID makes the GC method an attractive technique for quasi-continuous sampling. Repeatability standard deviations of approximately 0.1 µmol mol−1 and 1 nmol mol−1 can be achieved for carbon dioxide and methane, respectively [115]. For nitrous oxide, a repeatability standard deviation of 0.1 nmol mol−1 is possible using GC-ECD, but this is difficult to achieve (0.2–0.3 nmol mol−1 is more typical) and the response is typically non-linear requiring careful attention to calibration strategy [115, 119]. Detection of nitrous oxide by ECD is indirect (i.e. there is no net electron capture). It requires a dopant gas, such as methane, oxygen, or carbon dioxide, to facilitate electron capture [120]. Thus, nitrous oxide analysis by GC-ECD is typically carried out using 5% argon in methane (P-5), or carbon dioxide doped nitrogen [57] as a carrier gas. High-precision work using GC methods requires high-quality carrier gases, particularly with respect to nitrous oxide.

For carbon dioxide, NDIR methods were more common than GC methods and are still used. An advantage of the NDIR method is that the analysers are relatively inexpensive and robust. However, their response tends to be non-linear, and frequent (sub-daily) calibration over a range of amount fractions is typically required. Repeatabilities of a few nmol mol−1 can be achieved with care [72, 116, 121]. It is also important to consider matrix effects, including those related to carbon dioxide in air versus carbon dioxide in nitrogen [122, 123], and those related to the relative abundances of stable isotopes of carbon dioxide between samples and standards [47]. Analyser response to the minor isotopologues of carbon dioxide, such as 16O13C16O, can vary among analysers, even from the same manufacturer [124]. This can lead to small, and difficult to quantify, calibration offsets when samples and reference standards differ in the carbon dioxide isotopic composition. Tohjima et al [124] compared ambient air samples (δ13CVPDB  ≈  −8‰) to isotopically light references (δ13CVPDB  ≈  −32‰), and found offsets that ranged from  −0.04 to  −0.08 µmol mol−1 among different NDIR instruments.

4.1. Isotope ratio mass spectrometry

The basic measurement principle for high precision analyses of light (carbon, nitrogen, oxygen and hydrogen) stable isotopes was introduced by Nier in 1947 [125]. There are now numerous publications and text books with detailed histories and explanations on the measurement principles and applications of IRMS [126128]. IRMS instruments measure gases introduced into a 'Nier-Type' ion source and ionised by a tightly collimated stream of electrons [129]. The resultant ions leaving the ion source are accelerated into the magnetic analyser, where the ions are separated based on their mass to charge ratio. The discrete ion beams at integral masses are simultaneously detected with a number of Faraday cups that measure the number of ions arriving in the form of an induced electrical current [130]. The typical sensitivity of a modern IRMS, defined as the number of molecules needed to detect an ion, is between 1000–2000 molecules, depending on the component gas and instrumental parameters.

Modern IRMS systems may be operated in dual-inlet- or continuous flow-mode. Dual-inlet measurements are achieved using a system of gas filled fixed and/or variable volumes that are connected via a changeover valve to the ion source that allows direct comparative measurements of two gases [131]. This system enables high-precision measurements for carbon isotopes of carbon dioxide of 0.01‰ or better [132]. Continuous flow mode allows the IRMS to be coupled to numerous preparative analytical systems, such as elemental analysers [133, 134], or high temperature coupled to gas chromatograph separation columns [135, 136]. These systems enable the on-line analysis of different sample types by combustion, although at lower precision than dual inlet mode. The produced gas is introduced into the IRMS in a carrier gas stream (usually helium). The advent of this technique has made IRMS technique useful to a wide range of analytical fields including, but not limited to geochemistry and food authentication [137, 138].

The interest in clumped isotope analysis [139] has necessitated the development of higher resolution IRMS systems. Separating isotopologues with more than one rare isotope and isotopologues that have the same cardinal mass (e.g. 12C18O16O and 13C17O16O) requires very high mass resolution and ultra-clean baselines. Double focusing mass analysers reduce the energy dispersion of the ions, allowing the sector field to produce more focused ion beams which increases mass resolution [140, 141]. Interference from the high and low energy tails of adjacent masses is also suppressed.

4.2. Cavity ring-down spectroscopy and off axis integrated-cavity-output spectroscopy

CRDS was introduced by O'Keefe and Deacon three decades ago as an ultrasensitive laboratory technique for measuring gas-phase absorption spectra [142]. Since its inception, more than 4000 CRDS-based research articles have appeared in the peer-reviewed scientific literature testifying to the impact of the method. Although most CRDS applications involve gas samples, the technique also can be used to measure aerosols [143, 144] and liquids [145]. Useful reviews of CRDS and its applications can be found in [146148] and references therein.

In CRDS, a laser beam enters an optical resonator comprising at least two high-reflectivity mirrors and containing the sample. Turning off the laser light leads to an exponential decrease of the light intensity exiting the resonator with a decay rate depending on the mirror losses and sample absorbance. Unlike transmission-based techniques, this measurement is immune to intensity fluctuations in the light source as it is the decay (ring down) that is measured. Moreover, exceptionally long effective pathlengths (on the order of tens of km) in a compact mode volume (mm-scale diameter) and high-spectral resolution (sub MHz-level) can be realised. Early demonstrations of CRDS used pulsed lasers [149151], and by the early 2000s these sources were largely supplanted by tunable single-frequency continuous-wave diode lasers (CW-DLs) available throughout the visible, near-infrared and mid-infrared spectral regions [152155].

For an isolated absorption transition, the peak area equals the product of absorber number density and line intensity, independently of pressure-broadening mechanisms affecting the line shape. With single-frequency cavity excitation, spectral line shapes and peak areas can be accurately determined from observed decay times and optical frequency shifts [156, 157]. In this case, CRDS (similar to TDLAS in section 4.3) is an SI-traceable measurement of amount provided that the line intensity is known and the pressure-broadening mechanism is well understood. Ideally the line intensity constitutes an intrinsic standard, eliminating the need for calibration. In most applications, however, SI-traceability of CRDS spectra is achieved by calibration against gas standards, which are valid only for a particular set of conditions (e.g. spectrometer configuration, pressure, temperature and matrix gas composition) [158].

There are several methods for determining the line intensity. Data are usually analysed by fitting the peak area of a normalised line profile to the measured spectrum [159, 160]. Line intensity determinations from multiple spectroscopic observations can also be analysed and combined by global fits of semi-classical, effective dipole moment surfaces (DMSs) [161]. A third approach for determining line intensities is based on ab initio calculations of the DMS and measured potential energy surfaces (PESs) [162, 163] of the active components. These calculations compare favourably with measurements, with agreement at the 0.5% level for many examples [164166]. For all cases, the temperature dependence of the line intensity can be calculated based on the lower state energy, transition frequency and partition function [163]. When intensities are known a priori without reference to previous absorption measurements, high spectral resolution optical techniques like CRDS or TDLAS can be treated as a potential primary measurement for amount.

Spectra fitted using the Voigt profile (a probability distribution given by a convolution of a Cauchy–Lorentz distribution and a Gaussian distribution) yields relatively large systematic W-shaped fit residuals, resulting in fitted areas that usually underestimate the actual peak area [167]. Therefore, more advanced line profiles are required to minimise peak-area bias. Profile-dependent deviations in peak area ranging from 0.2% to 2% have been reported for CRDS measurements of carbon monoxide and water [164, 168]. However, when using line profiles with at least one more degree of freedom than the Voigt, relative differences between fitted areas are nominally 0.05% and difficult to distinguish. In the last decade line intensity measurements with relative uncertainties typically near 1% and approaching 0.1% have been reported [160, 165, 169, 170]. Accurate measurements have also been reported using other techniques including traditional FTIR (see section 4.4), and direct TDLAS (see section 4.3) and cavity-enhanced laser spectroscopy methods. Importantly, these measurements now provide the most stringent tests of ab initio calculations which are contributing to spectroscopic line lists [159, 165, 171].

High sensitivity transmission measurements can also be achieved using the OA-ICOS technique [172174]. OA-ICOS also uses high-reflectivity mirrors to produce an optical resonator with an effective pathlength much longer than the cell length, with reported pathlengths up to 27.5 km [175]. The off-axis cavity excitation results in a near-continuum of transverse spatial modes in the cavity; nearly eliminating the frequency selectivity of the optical cavity and making the spectrometer insensitive to cavity alignment. However, because it is difficult to model the spectrally resolved instrument function, converting measured transmitted spectra to amount requires instrument-specific calibration against reference gas mixtures and for equivalent matrix gases. Nevertheless, amount fraction detection limits of  ∼10−10 mol mol−1 or below are achievable for infrared transitions of small molecules including carbon monoxide, carbon dioxide and water. OA-ICOS analysers have been used to measure amount and isotopologue ratios of carbon dioxide and methane [176], oxygen [177] and water [178]. Other applications include rapid measurements of gas flux and eddy covariance [179] as well as real-time chemical process measurements [180] and open-path emissions monitoring [181].

4.3. Tunable diode laser absorption spectroscopy

Laser absorption spectroscopy (LAS) [182] is a powerful tool for gas analysis and nearly as old as the laser itself. LAS converts the laser's extreme spectral resolution and power density into high chemical selectivity and sensitivity. Tunable diode laser (TDL) based LAS (i.e. TDLAS) strongly spurred industrial [183] and environmental [184] diagnostics. Continuous technical TDL advantages in the meantime allow compact, portable, field-deployable TDLAS instruments, which routinely realise near real-time gas analysis with up to pmol mol−1 precision [185, 186]. This made TDLAS a complementary technique for quantification of greenhouse gases and because of their highly characteristic rotational-vibrational transitions, the respective singly and doubly substituted isotopic components (figure 6) [187]. Several examples include the analysis of 12C16O, 12C17O, 12C18O [188], 13CO2, 12CO2 [189], N2 16O, N2 18O [190] and 12CH4, 13CH4, 12CH3D [109]. Recently H2 16O, H2 18O has been analysed even inside ice clouds by TDLAS employing NIR DFB lasers [191, 192]. Quantum Cascade Laser (QCL) based isotope ratio measurements were demonstrated for carbon dioxide (δ13C, δ18O) [193, 194], N2O (δ15Nα, δ15Nβ, δ18O) [195, 196] methane (δ13C, δD) [197] and most recently for the analysis of the doubly substituted so called 'clumped' carbon dioxide, nitrous oxide (δ15N14N18O, δ14N15N18O, δ15N15N16O) [198] and methane (δ13CH3D) [199]. High precision analysis for isotopic components with very low abundance at ambient amount fractions is currently realised via a combination of laser spectroscopy with pre-concentration [197, 200].

Figure 6. Refer to the following caption and surrounding text.

Figure 6. Ro-vibrational transitions of greenhouse gases in the MIR spectral range (500–5000 cm−1), retrieved from the HITRAN database for water (blue), carbon monoxide (orange), methane (black), carbon dioxide (red) and nitrous oxide (green). The line strength of each component is weighted by its abundance in the atmosphere.

Standard image High-resolution image

The success of gas analytical TDLAS is driven by persistent improvements in wavelength coverage and commercialisation of novel laser architectures like QCL [201, 202], or inter-sub-band cascade lasers (ICLs) [203205]. TDLAS in general covers isolated absorption features by laser-current induced wavelength tuning, while laser embedded distributed feedback (DFB) structures ensure monochromatic emission [206]. Scanning frequencies beyond 10 kHz allow measurement times below 100 µs, particular interesting e.g. for trace gas flux sensors using eddy correlation techniques [207209] or even high-speed engine analysers [210, 211]. The 'extractive' TDLAS configuration uses absorption cells at stable pressure, temperature, path length and matrix gas composition. A photodetector captures wavelength dependent light losses described by the Beer–Lambert Law [212] and relates to path-averaged absorber number densities hence component amount fraction [213, 214].

A TDLAS instrument is usually built and optimised for 1–2 molecular or isotopic species per laser. However, various multiplexing techniques [215] working in the time-, wavelength- or modulation-frequency-domain enable multi-species instruments combining four and more lasers [216, 217]. Development of integrated dual-wavelength lasers [218, 219], electrically tunable QCL devices or laser arrays will allow developing spectrometers capable of measuring up to 10 components.

TDLAS variants such as direct absorption (dTDLAS) [220], wavelength modulation (WMS) [184, 221, 222] or integrative spectroscopy [223], balance electronic or spectroscopic complexity against noise suppression enabling 10−3 to 10−6 fractional absorptions, ultimately approaching shot noise [224]. Rapid electronic modulation multiplexing allowed development of hybrid versions which directly combine WMS and dTDLAS [225].

In general, TDLAS-sensitivity scales linearly with path length L, line strength S, and gas pressure p  (for p   <  few 10 mbar) and with the square root of measurement time t1/2 or electronic bandwidth (1/BW)1/2 if white noise-limited. Numerous optical multi-pass concepts are used to improve sensitivity, e.g. from White [226], or Herriott and Schulte [227], or improved versions with better volume to path ratio [228], lower weight [229], or size [230], for airborne applications or with special beam geometry [231, 232] e.g. for drones.

In common 'calibrated TDLAS' the system response (sensitivity, accuracy, linearity, stability) is characterised via comparison with a set of known gas mixtures, which entirely ensures traceability. TDLAS of isotopic species does not require corrections for mass interferants, e.g. 17O and N2O for δ13C-CO2 analysis, like IRMS, but spectral interferences from other atmospheric components may occur and have to be tested and corrected. In addition, the gas matrix of calibration and sample gas have to be closely matched and therefore WMO-GAW recommends the use of 'air standards' for calibration and drift-correction of optical instruments. Static carbon dioxide and nitrous oxide isotopic reference materials in whole air and at ambient amount fractions will be developed within the European metrology project SIRS [114]. In summary, TDLAS can achieve similar precision levels to state-of-the-art IRMS for many components with the advantage of high selectivity and real-time data availability.

The most attractive feature of direct TDLAS (dTDLAS) is its first principles potential [233, 234], i.e. employs a full absorption model to yield absolute species concentrations without instrument calibration, and hence acts as optical gas standard, OGS,—similar to the ozone SRP [235]. Such TDLAS-OGS require absolute values of all input parameters (e.g. p , T, L), accurate frequency linearization, dynamic laser tuning [169], line shape models [167, 236] and all relevant spectral parameters (e.g. line strength, collisional broadening parameters etc). Accurate (<1%), metrologically defined spectral data [237], have been measured recently for ammonia [238], water [214, 239], hydrogen chloride [240, 241], nitrous oxide [242, 243], carbon dioxide [244], and carbon monoxide [245], but are frequently a leading source of uncertainty in TDLAS OGS-realisation. Furthermore, in dTDLAS, high path length cells require accurate length determination [246], while short cells, ask for careful suppression of parasitic absorption contributions [247].

'Calibration-free' TDLAS measurements are very frequently 'claimed', but rarely metrologically scrutinised. Few TDLAS-OGS are solidly validated using primary reference gases, e.g. for carbon dioxide [248] or carbon monoxide in biogas [249]. One of them could be shown to fulfil WMO requirements in the OGS-mode, which should be interesting for very remote field applications. The most mature TDLAS-OGS-realisations are the two laser-hygrometers, SEALDH [233] and HAI [250], which were successfully validated in [251, 252] side by side comparisons with primary metrological humidity generators, but were also successfully used in numerous harsh airborne field-applications [253, 254]. For example SEALDH-II is a fully validated transfer standard which offers an extraordinary relative temperature stability of the humidity signal of 0.026% K−1, and an optimum precision of 56 nmol mol−1 (0.125 µmol mol−1 m−1 Hz−1/2). A relative stability of 0.5% (close to the primary water standard) was demonstrated [255] under metrological conditions. SEALDH-II nevertheless fits in a 4HU 19'' rack and only needs a power supply. This successful primary TDLAS-OGS validation [256] and extended field application nicely demonstrates that TDLAS offers great opportunities to generalise the realisation of primary OGS to a broad range of molecules.

4.4. Fourier transform infra-red spectroscopy

FTIR [257, 258] is a well-established analytical technique that offers an approach to trace gas analysis complementary to that offered by laser-based methods (sections 4.2 and 4.3). FTIR is a broadband technique covering a wide spectral range using a thermal light source. Analysis of the broadband FTIR spectrum allows the retrieval of concentrations of several trace gases simultaneously from a measured spectrum. An FTIR spectrometer typically consists of a broadband source (globar in the mid infrared (MIR) or halogen lamp in the near infrared (NIR)), an interferometer, sample cell (usually multi-pass for enhanced absorption) and IR detector. The instrument records an interferogram which is Fourier transformed into an infrared spectrum. A transmittance or absorbance spectrum is constructed by dividing the sample spectrum by a pre-measured spectrum of the empty sample cell. For quantitative analysis the spectrum is typically analysed over a broad spectral range encompassing many thousands of (overlapping) spectral lines in contrast to laser techniques which exploit the high resolution of the laser at low sample pressure to analyse single rotational-vibrational lines of molecular species in the spectrum. FTIR and laser techniques achieve comparable precisions—the high brightness of the laser source over a single rovibrational line on the one hand is balanced by the broad spectral range and analysis of many thousands of lines that allow the simultaneous quantification of component and impurities on the other.

Peak height or area-based quantitative analysis of single absorption lines is not appropriate to the lower spectral resolution of FTIR compared to laser spectroscopy due to spectral line overlapping at the lower spectral resolution and in samples that may be at atmospheric pressure. Quantitative spectrum analysis typically uses either chemometric techniques or non-linear least squares fitting over whole or partial spectral bands to retrieve trace gas amounts from spectra. In chemometric techniques such as classic least squares (CLS) [259, 260] or partial least squares (PLS) [259, 261], the best fit to a measured absorbance spectrum is constructed as a linear combination of pure single component spectra, or factors derived from a principal components analysis. Chemometric techniques are based on the Beer–Lambert Law and its implied linearity between absorbance and concentration, which breaks down at low resolution and introduces non-linearity into the calibration. In non-linear least squares analysis [258, 262] a transmittance spectrum is calculated from absorption line parameters (the forward model) and the parameters of the model, including the trace gas amounts, are iteratively adjusted to achieve best fit to the measured spectrum (the inverse model).

FTIR analysis has been used in many applications of industrial and pollution monitoring, but here we restrict the history to measurements of atmospheric greenhouse gases focused on high accuracy and traceability to SI standards. Esler et al [263, 264] provided the first such measurements using a commercial FTIR spectrometer, 9.6 m White cell and liquid nitrogen cooled MCT detector to measure carbon dioxide, methane, nitrous oxide, carbon monoxide and δ13C-in-CO2 simultaneously in air samples. The spectra were analysed by CLS using synthetic reference single component spectra calculated at the temperature and pressure of the measurement [259, 265]. This FTIR system was developed further to improve accuracy and precision using least squares spectrum analysis, and ease of use in the field, and is described in detail by Griffith et al [262].

In parallel to this development, Mohn et al [261, 266] used a laboratory FTIR spectrometer and White cell combined with PLS chemometric analysis focussed on δ13C-in-CO2 measurements in field applications, but also including measurements of methane, nitrous oxide and carbon monoxide.

At the BIPM, Flores, Viallon and co-workers assessed FTIR in metrological trace gas applications and applied this technique to underpin international comparisons among NMIs [20, 267271]. At the BIPM a laboratory FTIR system has been refined for δ13C measurements in carbon dioxide with a formal assessment of errors [49]. This work demonstrated a new approach to isotopic measurements described below in which a range of δ13C in standard gases is not required—the calibration is based on isotopologues rather than isotope ratios and it is sufficient that the calibration standards cover only the range of isotopologue concentrations or amount fractions.

In principle, the quantitative analysis methods described above can be traceable to the SI. Quantitative reference library spectra or line parameter databases should be traceable, i.e. calibrated and with an uncertainty estimation, but this is rarely the case. Work at Physikalisch-Technische Bundesanstalt (PTB) intends to address these deficits by establishing a European spectroscopy infrastructure to produce traceable measurements of spectral line data under well controlled conditions at a central spectroscopic facility (EUMETRISPEC) [272]. The limitation in estimating uncertainty of FTIR measurements comes from the forward model used to calculate spectra: both the true spectral and instrument line-shapes are not accurately known, and the forward model is an approximation of the true molecular and instrument line-shape functions. Approaches other than the analytical uncertainty estimation must be employed, such as the sensitivity study developed by Flores et al [273] for determining the uncertainty of laboratory-based measurements of nitrogen dioxide and nitric acid amount fractions. This method may be useful when no reference gas standard is available, but the uncertainties are close to 5% to 10%. To reach lower uncertainties, as with the laser-based techniques, FTIR measurements of air must be calibrated against reference standard gases and the normal approach is calibration against reference gases traceable to primary SI quantities through NMIs and/or the WMO-GAW calibration laboratories.

Spectroscopic analysers are inherently isotopologue specific in their response because molecular rotation-vibration spectra depend on the masses of the constituent atoms. The output of an optical analyser thus depends on the isotopic composition of the measurand. In practice, optical spectroscopic analysers can be used for explicit measurements of isotopic fractionation and as a proxy for total trace gas amounts. In the former case the individual isotopologues are measured as independent components and there is no inherent ambiguity. In the latter case a total trace gas amount measured by the optical analyser based on a specific isotopologue can only be accurately calibrated if isotopic composition is not significantly different between the sample and the reference standard. Differences in isotopic composition may in some cases lead to significant errors in deriving total trace gas amounts from an optical measurement. Recent works have quantified this potential for error and provided the strict calculations required to calculate trace total amounts from isotopologue measurements. The potential errors should be assessed when using any optical spectroscopic technique [52, 274, 275].

5. Conclusions and outlook

The developments in reference materials and measurement techniques for underpinning atmospheric amount fraction measurements of three key greenhouse gases (carbon dioxide, methane and nitrous oxide) as well as isotopic composition for source apportionment have been reviewed. Substantial progress has been made in recent years, particularly within the CCQM-GAWG and WMO-GAW towards providing a measurement infrastructure to underpin accurate and reliable measurements. However there remains an urgent requirement for further developments. Future work is focussed on the next generation of greenhouse gas reference materials, which will be value assigned for amount fraction and isotope ratio and matrix matched to atmospheric compositions, providing instrument manufacturers and atmospheric scientists with the standards required to monitor amount fractions and isotope ratios accurately in real time. There is also a significant focus towards a more robust metrology infrastructure for isotope ratio underpinned by new reference materials and advances in instrumentation.

Acknowledgments

This work was funded, in part, by the United Kingdom National Measurement System. Christoph Zellweger acknowledges funding from the World Calibration Centre WCC-Empa which is supported by MeteoSwiss and Empa.

Please wait… references are loading.