Climate Science Glossary

Term Lookup

Enter a term in the search box to find its definition.

Settings

Use the controls in the far right panel to increase or decrease the number of terms automatically displayed (or to completely turn that feature off).

Term Lookup

Term:

Settings

Beginner Intermediate Advanced No Definitions Definition Life:

All IPCC definitions taken from Climate Change 2007: The Physical Science Basis. Working Group I Contribution to the Fourth Assessment Report of the Intergovernmental Panel on Climate Change, Annex I, Glossary, pp. 941-954. Cambridge University Press.

Home

Arguments

Software

Resources Comments

The Consensus Project

Translations

About Support

	Climate's changed before
	It's the sun
	It's not bad
	There is no consensus
	It's cooling
	Models are unreliable
	Temp record is unreliable
	Animals and plants can adapt
	It hasn't warmed since 1998
	Antarctica is gaining ice
	View All Arguments...

Latest Posts

The GLOBAL global warming signal

Posted on 4 July 2012 by Kevin C

Highlights

New global temperature series confirm the GISTEMP results using only the HadCRUT3, NCDC and/or UAH data.
Once El Nino is taken into account there is no evidence for a slowdown in warming over the period 1996-2010.
If the HadCRUT4/HadSST3 ocean temperature corrections are also included then the underlying global warming rate is ?0.2°C/decade.
There remain uncorrected cool biases in the temperature trends.

Introduction

Global warming involves warming of the whole globe (the clue is in the name), but it does not necessarily affect every part of the globe at the same rate.

Different parts of the globe can experience very different changes under greenhouse warming. As a result, if you want to measure global warming, you have to measure the whole globe, not just a part of it. But of the three main in situ temperature records (GISTEMP, NCDC and HadCRUT), only GISTEMP is near-global in coverage over recent decades, and only by means of allowing each weather station to cover a larger region of the map. The incomplete coverage of the Hadley and NCDC datasets may be seen in Figure 1, along with three global temperature reconstructions.

Figure 1: Coverage maps for various temperature series. Colors represent mean change in temperature between the periods 1996-2000 and 2006-2010, from +2C (dark red) to -2C (dark blue). Note that the cylindrical projection exaggerates the polar regions of the map.

In this article I will attempt to redress the balance by presenting 16 new versions of the temperature record (10 of which I consider to be realistic), each of which has global or near-global coverage. Thirteen of these are created by creating a composite of a non-global series (HadCRUT3/4, NCDC or BEST) with a global series (GISTEMP, UAH or the NCEP/NCAR reanalysis). The remaining three series are derived by extrapolating the HadCRUT3/4 and NCDC data.

As with my previous article, these results have not been subject to peer review and should be treated as tentative. However the diversity of methods and consistency of the results are strongly suggestive.

Summary of results

I will start by summarizing the results, and then describe the methods and data in detail.

Since the main impact of incomplete coverage is seen in recent temperature trends, we will start by looking at these. The trend over the 15 year period 1996-2010 is shown in Figure 2 for each of the 16 new temperature series (colored bars), compared to the official versions (white bars with coverage percentages superposed). Some of the temperature series have known problems, however two clusters of likely trends have been identified; I have named these the GISS cluster (due to similarity with GISTEMP) and the Had4 cluster (for series based on HadCRUT4).

Figure 1: Global temperature trends 1996-2010 Figure 2: Global temperature trends 1996-2010

The global temperature records from these two clusters are shown in Figure 3, along with GISTEMP, NCDC and HadCRUT3 using a 60 month moving average. Use the buttons below the graph to select between 30- and 60- year views. The records are also available as a .CSV file which may be read into any spreadsheet program.

Figure 2: Comparison of official and global temperature series

Figure 3: Comparison of temperature series 1950-2010 1980-2010

The 15 year trend (1996-2010) for the GISS cluster of 6 temperature series is 0.147°C/decade (this is the value for NCDCext, chosen because it is unaffected by the HadSST2 1998 discontinuity). However there has been a tendency towards more frequent La Nina conditions over the period, which has impacted this trend. Foster and Rahmstorf (2011) estimate this influence at -0.026°C/decade, so the underlying warming trend is 0.173°C/decade. This is comparable to the 0.17°C/decade long term trend found by Foster and Rahmstorf. (They also estimate a net cooling effect due to the solar and volcanic influences, however these terms are less well determined and so I have omitted them.)

The 15 year trend (1996-2010) for the Had4 cluster of 3 temperature series, each of which includes the new HadSST3 bias corrections, is 0.179°C/decade (for HAD4ext). Including the El Nino term, the underlying warming trend is 0.205°C/decade.

The two representative trends quoted above are conservative estimates and include a known cool bias due to smoothing - the other trends variously include known cool and warm biases. There is additional evidence for each of these biases, but they have not yet been quantified.

The problem

I have presented 3 articles on the subject of coverage bias - HadCRUT3: Cool or uncool?, GISTEMP: Cool or uncool? and HadCRUT4: Analysis and critique. The most important issues are as follows:

Poor coverage of both poles leads to a signficant cool bias in the Hadley and NCDC datasets.

This has become an issue since around 1980 because rapid greenhouse warming leads to a significant change in the global distribution of temperatures.

How can this problem be addressed? Two approaches are explored here. We could use a globally complete dataset, such as the GISTEMP extrapolated data, the UAH satellite data, or the NCEP/NCAR reanalysis data to fill in the missing regions of the map. Alternatively, we could simply extrapolate the Hadley or NCDC data to fill in the missing regions of the map. We will consider each of these approaches in turn.

Coverage completion using two datasets

In HadCRUT4: Analysis and critique I showed how the GISTEMP or UAH map data could be used to estimate the coverage bias in the Hadley datasets. The basic method was to work out how much the GISTEMP (or UAH) temperature estimate for any month would be biased if its coverage were reduced to match the corresponding month from the Hadley dataset (with one caveat - the map series need to be adjusted to have the same baseline period).

But if we know how big the bias is, we can correct for it by subtracting the bias term from the biased data, and thus create an unbiased temperature series. To do this we need two temperature series:

A time series, which provides data with good temporal stability, but incomplete geographical coverage, and

A coverage series, which provide data with good geographical coverage. Temporal stability is not required for this series.

Map series are required for both datasets. The maps from the time series are used to determine a coverage mask for each month, and a (biased) global temperature estimate. The maps from the coverage series are used to determine the coverage bias. Temperatures for the coverage map are calculated with and without the mask - the difference between them is an estimate of the coverage bias.

There is an alternative (mathematically equivalent) way of looking at this method. We are taking the map from the coverage series, and adding a constant to every cell so that the mean of the map matches the mean of the times series map over cells where they both have coverage. The average over the whole of the adjusted coverage map then provides the unbiased temperature estimate.

It should be clear that we can add or subtract a constant to every cell in a map for the coverage series - and thus adjust the global temperature for that series up or down for any month - without affecting the result. Thus temporal instability or bias in the coverage series does not affect the results. The only factor which affects the results is the difference in mean temperature between the masked region and the whole map.

I have performed this calculation using 4 time series and 3 coverage series. The time series are as follows:

HadCRUT3: The HadCRUT3 dataset has 80% global coverage over the past 15 years, with poor coverage of the Arctic, Antarctic and also parts of Africa, Asia and Australia.

HadCRUT4: The HadCRUT4 dataset has 84% global coverage over the past 15 years. It improves the land coverage of the Arctic, Asia and Australia, but not the Arctic ocean, Antarctica or Africa.

NCDC: The NCDC dataset has 90% global coverage over the past 15 years. This relatively good coverage is acheived by filling in missing map cells by modelling the long range geographical variations in the data. However this infilling step is only performed at low latitudes, hence the lack of polar coverage seen in Figure 1.

BEST: The BEST dataset only covers the land area (i.e. 29%) of the planet. However this is just another kind of incompleteness, and so can be addressed in the same way as the incompleteness in the Hadley and NCDC datasets. The BEST preliminary data only runs to March 2010, so to estimate the trend to the end of 2010 the trend to the end of the data has been adjusted downward by 0.006°C/decade (this figure being the change in the GISTEMP trend on the same period when the last 9 months are omitted). The BEST team do not distribute map series, so I have assumed complete coverage of land areas only.

The coverage series are as follows:

GISTEMP: The GISTEMP dataset provides ~99% coverage over the period 1996-2010, by allowing each weather station to provide temperatures for a 1200km radius circle about the station rather than an arbitrarily sized grid box. (The 1200km radius was determined by examining temperature variations in regions with plentiful weather stations.) However it is based on much of the same data as the time series datasets, and so does not provide significant additional information.

UAH: The UAH satellite series by contrast is an independent source of temperature data. Coverage is near global, with 5° around the pole interpolated, and the entire globe is sampled by a single instrument. High altitude locations such as the Antarctica and the Andes are problematic however, with some ground contamination of the microwave signals. A bigger issue is temporal stability - there have been numerous adjustments to the data over previous years, and there are still significant uncertainties, see for example Of Satellites and Air, Seeing the BEST part of the Satellite Temperature Record?, and a comparison of UAH and RSS. Also UAH measures lower troposphere (LT) temperatures which are not directly comparable to surface temperatures - from Figure 1 it may be seen that the geographical variation is slightly muted in comparison to GISTEMP. As a result the UAH data is expected to undercorrect any coverage bias.

NCEP/NCAR reanalysis: The NCEP/NCAR reanalysis dataset provides an alternative method of determining global temperatures. Instead of just performing a statistical analysis of the weather station and sea surface observations, the reanalysis attempts to construct a complete model of the state of the Earth's atmosphere at any point in time. This is achieved by using a modern weather model to produce a 'nowcast' for any date in the past using not just the weather stations, but also data from weather balloons, satellites and radar. This data is less useful for determining long term trends because the results are influenced by the introduction of new data sources (in particular the satellite data in 1979; e.g. Chen et al, 2008), but is useful for determining a physically consistent geographical picture.

These datasets allow the construction of 12 composite series. These have been named using a composite of the name of the time series followed by the name of the coverage series, i.e. HAD3giss, HAD4giss, NCDCgiss, BESTgiss, HAD3uah, HAD4uah, NCDCuah, BESTuah, HAD3ncep, HAD4ncep, NCDCncep, BESTncep.

One additional issue arises with the BEST time series data. Since the BEST data is based on land stations and covers land areas only, to make a realistic assessment of bias, the common coverage region of the coverage series map must be defined in the same way. In the GISTEMP land-ocean data and in the UAH and NCEP data, coastal cells contain a hybrid of land and ocean temperatures, which prevent the calculation of an uncontaminated land-only series. This leads to an undercorrection, and so an overestimate of the resulting trends. To address this issue I constructed one additional composite series - BESTgiss2 - in which the common coverage temperature for GISTEMP was constructed from the GISTEMP land-only data (i.e. the dTs 1200km dataset).

Extrapolating a single dataset

The simplest approach to improving the coverage of a temperature series is to fill in any map cell for which no observations are available with the temperature from the nearest cell for which an observation is available. Omitting the cell completely is mathematically equivalent to setting it to the mean value for the whole map. The reasoning for using a nearby value is that this is likely to be a better estimate of the temperature than the value of the more distant cells which comprise the bulk of the map. Like GISTEMP I have imposed a maximum distance limit of 1200km, which is enough to provide near complete coverage for most of the last 60 years.

GISTEMP uses a related but more sophisticated approach of allowing each station to influence an area of radius 1200km around the station, with a weight that decreases with distance. This is a form of kernel smoothing.

These methods are not without their own biases. If the missing region represents an extreme of temperature anomaly, then the global average from the incomplete data will be biased, as shown in Figure 4. Kernel smoothing introduces two effects - firstly it flattens out both high and low extremes (which doesn't introduce a bias in the global average), and secondly it extrapolates from well populated regions into sparsely populated regions (which does). As a result, GISTEMP and GISTEMP-based temperature series (see below) may be expected to have a cool bias.

Figure 4: Bias due to nearest neighbor or kernel smoothing

Can this cool bias be confirmed from the data? GISTEMP provide two datasets based on land temperate stations only: dTs 1200km and dTs 250km, which differ in the area of influence of a given station. The 250km radius dataset should show less smoothing bias over the region where both datasets have coverage. Temperature series were therefore calculated for both series over the region where they share coverage, and the trends on the period 1996-2010 compared, with the following results:

GISTEMP dTs 250km:	0.312°C/decade
GISTEMP dTs 1200km:	0.284°C/decade

Note that this bias only applies to temperatures estimated from land stations, but does not account for regions where there is no coverage and thus no bias estimate is available. Therefore at this stage the total impact of this bias is unknown.

The extrapolation approach allowed the construction of 3 additional series, named using the name of the time series and the suffix 'ext', i.e. HAD3ext, HAD4ext, NCDCext. Extrapolation cannot be applied to the BEST data due to the lack of maps and the small starting coverage.

Results

The trends in °C/decade on the 15 year period 1996-2010 (with the previously noted caveat for the BEST data) are given in the following table for all 16 new temperature series. Values I consider likely to be unreliable are given in italics.

HAD3giss 0.156	HAD3uah 0.149	HAD3ncep 0.191	HAD3ext 0.155
HAD4giss 0.181	HAD4uah 0.174	HAD4ncep 0.218	HAD4ext 0.178
NCDCgiss 0.155	NCDCuah 0.144	NCDCncep 0.185	NCDCext 0.147
BESTgiss 0.169	BESTuah 0.214	BESTncep 0.180	BESTgiss2 0.153

The following details are noteworthy:

HAD3giss duplicates GISTEMP very well. This makes sense, given that we already know that HadCRUT3 and GISTEMP agree well where they share coverage. HAD4giss then provides an estimate of what GISTEMP might look like if it adopts the new HadSST3 ocean temperature data.
HAD3ext and HAD4ext duplicate HAD3giss and HAD4giss well, but without using the GISTEMP data. Given that GISTEMP also extrapolates into the polar regions (although using a different method), it is unsurprising that the results are similar.
HAD3uah and HAD4uah are interesting because they represent a completely independent approach to filling in the missing regions of the globe, and yet give similar results. The undercorrection resulting from the use of satellite data may be seen in the fact that HAD3uah is low and BESTuah is high: The UAH data does not provide enough correction to bring the two results together.
The NCDC data gives marginally lower trends than the HadCRUT3 data for each of the above cases, but is otherwise very similar.
BESTgiss2 gives an indication of what a global BEST dataset may look like using a current-generation SST dataset. The other BEST temperature series are biased high due to the coastal contamination issue.
All of the 'good' datasets (i.e. the GISS cluster, the Had4 cluster and BESTgiss2) suffer either from smoothing bias because they rely on nearest neighbour interpolation or the kernel-smoothed GISTEMP data, or from undercorrected coverage bias because they use the UAH data.

The outlier temperature series (i.e. the ones I suspect are unreliable) are shown in Figure 5. BESTuah highlights the problem of undercorrection when using UAH for infilling - the BEST land-only series overestimates recent warming, and UAH does sufficiently correct for this. The NCEP corrected series also show higher rates of warming over recent decades - this could reflect the fact that they avoid the smoothing bias of the other series, but my working hypothesis is that there is an unindentified problem in using the reanalysis data in this way.

Figure 4: Outlier temperature series Figure 5: Outlier temperature series

The remaining series, BESTgiss2, is shown in Figure 6. This provides an indication of what a future BEST land-ocean temperature series might look like (assuming that the HadSST3 bias corrections are not included). This series shows more pronounced warming since 2002 than GISTEMP.

Figure 5: BESTgiss2 land-ocean series Figure 6: The BESTgiss2 temperature series

Conclusions

Coverage bias signficantly impacts recent temperature trends. The methods used here to estimate and correct for that bias are rudimentary, but present a coherent picture of continued warming. It is striking that three different approaches, when applied to either the HadCRUT3 or NCDC data, yield a record which is very similar to GISTEMP. The same approaches applied to the HadCRUT4 data lead to a greater warming trend owing to the inclusion of the HadSST3 bias corrections.

Taking into account the effect of the El Nino cycle on recent trends, the warming rate of the largest cluster of datasets is consistent with longer term trends. If the HadSST3 adjustments are also correct, the underlying warming rate probably exceeds 0.2°C/decade. There are still known cool biases in each of these time series.

We have not taken any account the impact of a possible increase in aerosol cooling. This raises the worrying possibility that the underlying warming rate has been accelerating, and has been masked by aerosol emissions and the biases in the temperature series. Further developements on SST adjustments and aerosol impacts will hopefully clarify the situation.

0 0

Printable Version | Link to this page

Comments

Comments 1 to 14:

KR at 01:27 AM on 4 July, 2012
Kevin C - Thanks for this series of posts. It really reinforces the points that, when the full set of data is considered, the identified trends are: * Visible in all of the collected datasets, * Consistent from one dataset to another, and * The trends are significant, well above noise levels.
0 0
Paul Magnus at 10:52 AM on 6 July, 2012
Question: So is the rate of GW increasing significantly now? Must be as sea level rise is accelerating by quite a bit, but there must be a lage. So what warming years are responsible for the increase in slr rate happening now? Is it a 2 5 or 10yr lag?
0 0
Tristan at 12:48 PM on 6 July, 2012
Paul: As best we can tell, the underlying temperature trend has remained roughly constant over the past three decades. The recent short-term sea level rise is mostly the result of all the water that was dumped over South America, Pakistan and Aus during the 2009-11 period making its way back to the ocean.
0 0
Kevin C at 15:15 PM on 6 July, 2012
Agreed. Also, the SST bias, if right, has been building steadily over a longer period. While it would bring the instrumental record over the last few decades into very good agreement with the CMIP3 results, it wouldn't produce an acceleration of warming over the last couple of decades. One other update: I've now implemented polyharmonic spline interpolation to fill in the missing regions, which allows a crude estimate of the smoothing bias in GISTEMP and the *giss and *ext methods. This is very preliminary, but my current best guess is that the smoothing bias exists but is small - about 0.005C/decade for 1996-2011. So my best guessed are ~0.18C/decade without the SST corrections, or 0.21C/decade with. Polyharmonic spline interpolation takes about 2hrs for 60 years of data (compared to ~20s for nearest neighbour). Kriging would be better but slower still. The next step is to test the skill of the various reconstructions by omitting known regions.
0 0
Kevin C at 15:44 PM on 6 July, 2012
OK, here's an even more tentative early-morning result. In the final graph, BESTgiss2 shows an much bigger uptick around 2007 (but is a 60 month smooth, so must come from 2009/2010) than any of the other records. My new HAD4spline shows the same uptick. So it looks as though temperatures in 2009/2010 may be significantly affected by the smoothing bias, and both Kriging and polyharmonic splines are in agreement over this. While the difference in trend on 1996-2010 is only 0.005C/decade, the difference on 2002-2010 is 0.02C/decade.
0 0
Tristan at 15:45 PM on 6 July, 2012
Kevin Your posts inspire me to learn more numerical techniques so I can try to do some of the things you do, thanks for making me want to reach further.
0 0
Kevin C at 16:00 PM on 6 July, 2012
Thanks. If you're interested, I can make python code available for all of this. It should be fairly easy to get most of it running on Linux/Mac. For windows you'd need to change a few command scripts to Windows command language. The one really painful part is extracting and regridding GISTEMP, which needs a fortran compiler.
0 0
Tristan at 18:28 PM on 6 July, 2012
Kevin Thanks for the generous offer. Unfortunately I'm still a ways from having the requisite expertise to do anything fun with it. I need to hit the books (again).
0 0
Kevin C at 23:41 PM on 6 July, 2012
Ah - don't trust early morning results (I should know better). Here's the 60-month moving average comparison of nearest neighbour (HAD4ext) and splines (HAD4s) with BEST. They are almost indistinguishable after 1960 - the trend difference since 2002 is a slight lowering about that date in HAD4s. BESTgiss2 shows a bigger uptick and short term trend because it is lower around 2002. (It would be very interesting to get to the bottom of that.) However the interesting thing is the difference before 1958. The spline version looks more like GISTEMP, eliminating the difference caused by the SST corrections. I'm guessing the change is coincident with the introduction of the Antartic stations. However why this should counteract the SST correction (except by coincidence) is baffling.
0 0
Paul Magnus at 07:02 AM on 7 July, 2012
Yes Tristan. The rate I refer to though is the average over the last couple of decades ie the 3mm compareded to the 1mm previous to that....
0 0
Paul Magnus at 15:07 PM on 8 July, 2012
Here we have the graph of ghg well, co2 emissions.( I am sure that total ghg emissions is going to be much steeper) we can see that there is a marked up tick around 1945. http://thinkprogress.org/wp-content/uploads/2012/07/chart751.png I think that that might correlate quite closely to the the up tick in sea level rise rate acceleration, The next surge will be soon and must be related to the albedo effect in the article and it's effect on Greenland ice sheets. What seems likely in the south is massive collapse and disintegration. So the ice going tthere will result in a jump in sea level rise rathere than a rapid increase in rate,
0 0
Kevin C at 06:54 AM on 21 July, 2012
OK, I've got Kriging working. It's not very much slower than splines. Interesting results: The Kriging results look very similar to nearest neighbour with a 1200km cutoff. I sort-of expected that from the theory, but the agreement is really quite good. Kriging gives global coverage, but as you move further away from the nearest observed cell, the weights for all the cells in the map approach equality - in other words, cells very far from any observation get set to the average of the observations, which is the same as leaving the cell unobserved. And it happens automatically. (Actually, this does depend on being able to solve the complete system of equations. BEST can't do this, because they are dealing with individual stations, not a coarse grid, so instead they impose a distance cutoff like GISTEMP in order to obtain a sparse matrix.) That also means that the GISTEMP 1200km kernel smoothing should give results fairly close to Kriging. Also, the divergence shown in Had4s in #9 before 1960 is an artifact of the spline extrapolation. The splines work well when coverage is good, but once you go back far enough to lose the Antarctic stations the spline method starts giving erratic. Unlike Kriging, the splines will generate more extreme values as the extrapolation distance increases.
0 0
KR at 08:50 AM on 21 July, 2012
Kevin C - I would be curious as to behavior of the various techniques with hold-out checks: choose a sparse set of hold-out stations, run the spline/kridging/nearest neighbor with cutoff methods, and see just how close they come (and with what bias) to the hold-outs. How hard would that kind of analysis be?
0 0
Kevin C at 21:21 PM on 8 August, 2012
KR: Excellent question. I've done it using the NCEP data as a basis, and masking the map series down to match the coverage of the corresponding month from either Hadley dataset. (Before 1965 I loop the temperatures from years 1965-1975.) This is a harder and more realistic challenge than a random holdout, because the correlation structure means that extrapolation over larger distances is harder than smaller distances. I can then compare the interpolated reconstruction with the value from the complete data. Results: 1. Once the Antarctic stations become available, the Krig, nn1200km and spline methods all give much lower noise and bias than ignoring the missing cells. 2. Going back before the Antarctic stations are available, the spline approach becomes noisy. Unlike the others, it does not degrade gracefully as coverage reduces. It tends to produce extremes a long way from the nearest station, where the other methods are both conservative and revert to the global mean if you get too far from the nearest station. 3. Going back to 1880 the performance of the Krig and nn1200km reconstructions degrade gracefully with time. At 1880 they don't provide much benefit, but they never make things worse either. Kriging is formally more correct, and the effective area of influence is determined by the data itself, but in practice there is not much to choose. Thus I think the GISTEMP results are already pretty close to optimal (given the SST data they are using). Caerbannog's observation that you can get a similar result to GISS by just (for land data) using the CRU method and 20 degree boxes highlights this result - almost any global reconstruction method gives the same answer. This kind of test, where you try and find tough challenges which can potentially falsify your approach, is precisely the thing which differentiates real science from blog science. It goes back to Popper: A theory which has survived multiple severe tests is a fitter theory than one which has yet to face a severe test. Applying severe tests like this is the kind of thing I need to do to make this publishable.
0 0