Bakken flares enough gas to heat 0.5 million homes

(npr)

Produces enough oil to heat 10 million. That is, to anyone stupid enough to simply burn it for heat, when its monetary value as fuel is quadruple that.

The issue: natural gas being flared (burned as waste) in the new oil fields of the Bakken shale in North Dakota (USA).

The numbers: (North Dakota/2011)

Heat content CO2 content Value
(i) Oil production 153 million barrels 936 PJ (9.36*1017 Joules) 64.8 million tons $14.5 billion
(ii) Gas production 97 billion ft3 105 PJ 5.3 million tons $384 million
(iii) Gas flaring 50 billion ft3 54 PJ 2.7 million tons $196 million
ratio (iii)/(i) 5.7% 4.2% 1.4%

(Heat contents: 5.8 million btu per barrel crude oil, 1023 btu per ft3 natural gas. At 2011 mean prices of $94.88/barrel (WTI spot), $3.95/ft3 gas (at wellhead). CO2 intensities of 161, 117 kg/million btu respective)

(Note this 2011 data is quite obsolete. The most recent data is an annualized oil production rate of 273 million barrels, nearly twice as high. I don't have gas flaring numbers more recent than 2011).

A recent NYT article explains the (probably obvious) economics behind this:

[NYT] In North Dakota, Flames of Wasted Natural Gas Light the Prairie

The oil companies say economic reality is driving the flaring in the Bakken, the biggest oil field discovered in the United States in four decades. They argue that they cannot afford to pay for pipelines and processing plants to capture and sell the gas until they actually drill oil wells and calculate how much gas will bubble out of the oil. And reinjection of the carbon dioxide, commonly done in conventional oil fields, is more difficult and expensive in less permeable shale fields.

...

With oil prices high amid strong global demand and leases as short as five years for land in the Bakken, drillers have found it more profitable to just grab the oil and burn the gas. Building out the infrastructure to handle gas would substantially raise costs and slow development, and efforts so far to use the gas for electrical generation have had limited success because it contains components that burn too hot.

“I’ll tell you why people flare: It’s cheap,” said Troy Anderson, lead operator of a North Dakota gas-processing plant owned by Whiting Petroleum. “Pipelines are expensive: You have to maintain them. You need permits to build them. They are a pain.”

The oil expansion is growing faster than the ability to build gas pipelines.

It goes beyond natural gas. According to the WSJ, they don't even have pipelines to get the oil out -- the main product. With oil pipes under construction, they are moving crude by freight rail, at an extra cost estimated at $5-10 per barrel.

[WSJ] Riding the Dakota Oil Boom

North Dakota's output has grown in the last three years from a trickle to nearly 450,000 barrels a day—trailing only Texas, Alaska and California—and could double by the middle of the decade, according to analyst and industry projections. But pipelines in the region already are operating at capacity, and major new lines aren't expected to start going into service until 2013.

In response, companies are building rail terminals. Rail terminals can be developed quickly, giving them an advantage for now over pipelines. The North Dakota Pipeline Authority estimates a doubling in rail-terminal capacity next year alone to more than 700,000 barrels a day.

Most of the oil gets to the train depots by truck, picked up from thousands of wells scattered off dirt roads. At night, natural gas being flared from the oil wells casts an eerie glow over the vast prairie lands.

Note that, like flaring, the absence of oil pipelines has a substantial cost in energy waste. To push a ton of oil 1,000 miles by rail, about 2 gallons of diesel are burned -- about 1% of the cargo.

Shipping coal vs. shipping electrons

Which is cheaper?

  • To ship one ton of coal 1,000 miles (by freight rail), or
  • To transmit one ton of coal-electricity 1,000 miles (on high-voltage power lines)

Intriguingly, moving coal is much cheaper than moving eletrons, even with the most efficient HVDC lines.

The average cost (precisely, revenue to railroads) for shipping coal is 2.50 cents/ton-mile, as of 2010 [1]. More fine-grained cost data is available from the EIA [2], from sampling of rail waybills. There's a large variance, ranging from 1.07 cents/ton-mile (Wyoming->Iowa) to 7.40 cents/ton-mile (West Virginia intra-state). On a cursory look, the broad trends seem to be (i) midwest freight (flat terrain) is cheaper than Applachian; and (ii) long routes are cheaper per mile than short ones.

An average ton of US coal has 19.583 million btu of heat. The average US coal plant uses 10,415 btu per kWh (i.e., is 32.8% efficient), for a yield of 1,880 kWh(elec.) per ton coal. So, normalized to 1,000 miles: the average shipping cost is $25/ton or $13.30/MWh(e). On cheaper midwest routes, such as Wyoming->Illinois or Wyoming->Michigan, the cost is $6.70/MWh, $7.82/MWh, respectively. An expensive Appalchian route, West Virginia->Pennsylvnia, is $24.89/MWh. (Again, this is per 1,000 miles of distance). (More advanced coal plants, like this recent German project, are about 30% more efficient; this assumption would lower per-MWh rail costs by 30%).

Here are some representative cost figures [4] for long-distance, high-capacity transmission. For 1,000 miles, even the cheapest HVDC solution (+/- 800 kV) is rather more expensive than the $13.30/MWh(e) freight equivalent. Compared to midwestern rail freight ($6-7/MWh), it's more than twice as expensive!

Michael Bahrman, P.E. (ABB): "HVDC Transmission: An economical complement to AC transmission"

As it stands right now, the cost gap is even bigger than this: ultra-high-voltage transmission isn't really deployed in the US. As this map from NPR shows [5], most long-distance transmission is 345 kV AC -- the most expensive on that graph, up to ten times the cost of freight coal. (This map doesn't include lower-voltage lines below 345 kV, which are far more common on short distance scales [6][7]).

NPR: Visualizing the U.S. Electric Grid

This puts some logic behind the absurd amount of coal freight in the US. [1] puts coal as 43% of US rail freight (by weight). From the 1.1 billion tons of US coal mined annually, more than half a trillion ton-miles are logged -- an average migration of 514 miles. The bulk of this by rail, at costs ($10-20/ton) as high as the mined cost of coal itself. One map puts the distances in perspective: there is Wyoming coal being burned outside of Altanta.

EIA: Coal Transportation Rates to the Electric Power Sector

[1] Association of American Railroads | Railroads and Coal

[2] PDF EIA | Estimated rail transportation rates for coal, basin to state, STB data (main site: [3])

[3] EIA | Coal Transportation Rates to the Electric Power Sector

[4] ABB | HVDC Transmission: An economical complement to AC transmission

[5] NPR | Visualizing the U.S. Electric Grid

[6] FEMA | United States Transmission Grid

[7] PDF NARUC | Transmission 101

Solar subsidies in Massachusetts (worked example)

A new solar PV calculator from MIT's Sustainable Design Lab. Demonstrates (correctly) how Cambridge, MA residents can save money with solar PV panels.


http://www.fastcodesign.com/1670986/infographic-how-much-money-will-solar-panels-save-you

Now to see where these "savings" come form. This MIT calculator is conveniently transparent about these facts.

  • 70% of annual "revenues" are a state-level subsidy, the Solar Renewable Energy Certificiate ($6,090 out of $8,902).
  • Roughly 60% of the remaining $2,812/year is subsidy. Thought called "utility generation value", this is really the retail value: $2,812 for 22,494 kWh works out to 12.5 c/kWh. The wholesale rate is 4.7 c/kWh in New England (2011 average). This is an indirect subsidy, called net metering: forcing utilities to value PV electricity at non-market rates, like at retail rate instead of wholesale.
  • 30% of the cost is a federal tax subsidy (FTC)

What would the finances look like without subsidies? This sample customer

  • Pays $101,720, for a PV system that averages only 2.6 kW (22,494 kWh/year)
  • Earns $1,050/year selling at market rate
  • Sees a payoff time of 97 years

Present value calculations (25 year PV lifetime, 5% discount):

value or cost present value
solar system cost -$101,720 -$101,720
tax credit (subsidy) +31,516 +31,516
electricity value (wholesale) +$1,050/year +$15,173
electricity value (retail) +$2,812/year +40,615
wholesale-retail difference (subsidy) +$1,762/year +25,449
SREC value (subsidy) +6,090/year +87,960
All subsidies PV +$144,926
Net PV without subsidies -$86,554
Net PV with subsidies +$58,371

In particular, in present value terms, the subsidy is 954% of the market value of the electricity being subsidized.

Blog is archived

No more new posts. (Most) old posts remain available in archives -- see right-hand toolbar.

"Thorium" scam widely linked, hits Slashdot

[Slashdot] 8 Grams of Thorium Could Replace Gasoline In Cars

Some devious misanthrope has latched onto the thorium hype for his own personal scam. Dr. (allegedly) Charles Stevens claims to be developing a thorium-powered-car, whose nature is impossible to pin down beneath the incoherent mismash of idiotic technobable... i.e.:

However, the use of thorium is controversial because, as with uranium, it is used as a nuclear power source. Indeed, the internal heat of the Earth largely is attributed to the presence of thorium and uranium.

The key to the system developed by inventor Charles Stevens, CEO and chairman of Connecticut-based Laser Power Systems, is that when silvery metal thorium is heated by an external source, it becomes so dense its molecules give off considerable heat.

Small blocks of thorium generate heat surges that are configured as a thorium-based laser, Stevens tells Ward’s. These create steam from water within mini-turbines, generating electricity to drive a car.

The key to the system developed by inventor Charles Stevens, CEO and chairman of Connecticut-based Laser Power Systems, is that when silvery metal thorium is heated by an external source, it becomes so dense its molecules give off considerable heat.

Natural thorium has little radioactivity, Stevens says. What isotopes there are could be blocked by aluminum foil, so the power unit’s 3-in. (7.6-cm) thick stainless-steel box should do the trick.

Stevens has worked out you’d require a 227kg, 250MW thorium engine in order to power a typical road car. Within that system 1 gram of thorium produces the equivalent of 7,500 gallons of gasoline. So if you fit the Thorium engine with 8 grams of Thorium, it will run the vehicle for its entire lifetime without needing to be refueled while all the time not producing any emissions. The engine lasts so long in fact, that it could be taken from one vehicle and used in another as and when they wear out.

Stevens agrees, emphasizing his system is “subcritical.” This means no nuclear reaction occurs within the thorium. It remains in the same state and is not turned into uranium 233, which happens only if thorium is sufficiently super-heated to generate a fission reaction. “It’s very safe,” he says.

http://wardsauto.com/ar/thorium_power_car_110811/

Please don't subject yourself of the anguish of trying to interpret this gibberish! He alternately describes this as not a fission reaction (although he doesn't appear know to what fission is, describing it as 232Th --> 233U...), but claims energy densities which strictly apply to fission reactions only. Claims thorium "molecules" [sic] give off heat when they become "dense". Claims there is lasing involved somewhere.

The scammer has incorporated himself as "Laser Power Systems, LLC." His previous startup, "Helyxzion, LLC", was in genomics.

[LinkedIn] Charles Stevens

Laser Power Systems, LLC

Helyxzion, LLC

Texas grid declares Level 1 Emergency as ten thousand megawatts of wind power stands paralyzed

Heat wave (NOAA)

Tuesday (two days ago):

[KHOU] Texas now under level 1 emergency; Residents asked to conserve more electricity

HOUSTON – ERCOT put out a notice at 2:30 p.m. Tuesday afternoon saying the state’s reserve levels dropped below 2,300 megawatts, putting into effect an Energy Emergency Alert level 1.

“We are requesting that consumers and businesses reduce their electricity use during peak electricity hours from 3 to 7 p.m. today, particularly between 4 and 5 p.m. when we expect to hit another peak demand record,” said Kent Saathoff, vice president of system planning and operations. “We do not know at this time if additional emergency steps will be needed.”

More blackouts this afternoon (i.e. Thursday):

[Reuters] Texas grid warns of rolling outages on heat

Extreme heat and soaring power demand may force the Texas power grid operator to impose rolling outages on Thursday afternoon to prevent a wider blackout as residents struggle with a record-breaking heatwave.

The Electric Reliability Council of Texas, ERCOT, moved to curtail power to some industrial customers Thursday afternoon as a way to boost surplus power to keep residential air conditioners running over the hottest part of the afternoon.

The state -- broiling under a relentless streak of 100-plus degree temperatures and drought -- has set three power consumption records this week, straining power plants.

And where are Texas' 9,727 MW of wind turbines in this crisis? Out to lunch. One picture says it all (note the distinct scales):

http://www.ercot.com/gridinfo/generation/ (August 2011)

Note how wind power is strongly anticorrelated with the afternoon demand peak! Wind power "reliably fails" whenever electricity is most needed.

The American Wind Energy Association is here to give us their "spin" on the situation:

[AWEA blog] Wind helps meet new Texas record for electricity demand

Commented American Wind Energy Association Manager of Transmission Policy Michael Goggin, "At a time when the extreme heat prevailing in Texas is pushing the utility system close to its limits, wind generation is making a valuable and much-needed contribution to system reliability."

At the risk of revisiting the obvious, these were the operating power levels of Texas' four nuclear power reactors over the same week, according to the NRC:

power reactor capacity (net) 7/27 7/28 7/29 7/30 7/31 8/1 8/2 8/3
Comanche Peak #1 (PWR) 1,209 MWe 100% 100% 100% 100% 100% 100% 100% 100%
Comanche Peak #2 (PWR) 1,158 MWe 100% 100% 100% 100% 100% 100% 100% 100%
South Texas #1 (PWR) 1,280 MWe 100% 100% 100% 100% 100% 100% 100% 100%
South Texas #2 (PWR) 1,280 MWe 100% 100% 100% 100% 100% 100% 100% 100%
4,927 MWe

[NRC] Power Reactor Status Reports for 2011

[IAEA] United States of America: Nuclear Power Reactors - Alphabetic

Update (8/11): One week later, more of the same:

Informative article in WSJ:

[WSJ] Texas Power Grid Falls Short

Very strange technical critique of the molten salt reactor -- by a PhD engineer, supposedly

I just want to point out this nonsense piece that's been making the rounds. It is a long, technical, detailed critique of the molten-salt reactor (MSR) concept, from someone who claims to be a PhD engineer (with expertise in "renewables"). It is absurd hogwash, full of utterly insane misconceptions. But it looks superficially credible, so I expect this "expert" will be widely quoted in the near future.

http://daryanenergyblog.wordpress.com/ca/part-8-msr-lftr/

The 8th in a series (!).

I'm not a PhD engineer, or any sort of engineer at all. Part of this blog's extremely small readership is such engineers. To those of you, here's a quick summary of some of the things in this critique which are complete nonsense, so you can judge for yourselves (I'll go into a bit more detail after the summary):

Summary of some of the biggest howlers

  • Claims MSRs have "Isotope Separation Plants" which separate 233U and 232U (the trace contaminant)
  • Warns of hazardous fission products, such as thorium isotope "T-232" [sic], which supposedly is a disadvantage of thorium-fuelled reactors because of its 14 billion year half-life
  • Warns that electrolyzing nuclear fuel salts is energy-intensive
  • Warns that heat inputs in fluoride reprocessing are energy-intensive
  • Asserts that thorium MSRs are constrained to a lower temperature limit of 1,110 °C, the melting point of pure ThF4. Concludes MSRs must be built entirely from ceramics
  • "Obviously, once we exhaust the world’s U-235 stockpiles, LFTR’s and any other Thorium fuelled reactors will cease to function."
  • Argues against using molten fuel salt as a working fluid in a gas turbine

Well, that pretty much says it all, don't you agree?

I'll go into a bit of detail on individual points, pointing out where in the 23-page screed they are from (so no one else has to waste time).

Isotope separation plants

§8.4:

The Isotope Separation plant and waste output

[...] Also the supporters of the LFTR seem to assume that this ISP can operate with 100% efficiency (i.e remove all the radioactive poisons). This would be very technically challenging, especially in the LFTR case given the importance (if you followed the points made by IEER earlier) about separating out of U-232 (and its Thallium-208 payload) from U-233.

He has the misconception that 233U/232U isotopic enrichment is necessary for MSR operation. Spends many paragraphs speculating on this imaginary thing, finally concluding it will consume up to 25% of an MSR's electric output:

All in all my suspicion is that our heat exchanger would struggle to produce thermal efficiencies any greater than 70-75%. Assume a good high efficiency Gas driven Brayton cycle the other end (55-60%) so that yields us an overall efficiency of 38% – 45%, oh! but we almost forgot about that isotope separation plant and its net energy inputs, say we deduct 5-10% of reactor power output to account for running that, so overall between 29% – 40%, with a 35% overall efficiency being my best WEG

Dangerous fission products

§8.3:

Certainly the fission products from a Thorium reactor are a worry, Technetium-99 has a half life of 220,000 years, uranium-232 produces thallium-208 (a nasty wee gamma emitter), Selenium-79 (another gamma emitter with a 327,000 year half-life), even Thorium-232 is a problem with its half life of 14 Billion years (and while the T-232 isn’t a major worry, all the time during this 14 Billion years it will be decaying and producing stuff that is!).

Names the long-lived fission products as the most "worrying". Thinks thorium-232 is a fission product (abbreviates it "T-232"). Is very worried about its daughter products -- they're continuously generated over 14 billion years! Thinks this is somehow a disadvantage of the thorium reactor (the one that consumes 232Th dug up out of the ground).

Where will we find so much energy?

Sounds the alarm about the energy intensiveness of using electrolysis in reprocessing (§8.4):

Notably, the LFTR supporters have suggested (see here) using electrolysis to help improve the filtering efficiency of their plant. An excellent idea, it would solve a number of problems, but unfortunately electrolysis systems practically eat electricity! Where’s all that electricity going to come from?

And (same section), the energy intensity of heating fuel salt to high temperatures in reprocessing:

As the working fluid will be coming off the exhaust from the heat exchange cycle it will be relatively cool (in the MSRE it was at around 570 °C) yet some of these processing stages will require the fluid to be heated back up to 1,600 °C. Where’s that energy going to come from?

Perhaps he is confused about the difference between temperature and heat? There is enough energy coming out of a nuclear reactor at full power to heat a ton of fuel salt by a thousand degrees in one second. (...within an order of magnitude. Don't have temperature-dependent heat capacities on hand, so this number is rough.) That a multi-gigawatt-thermal MSR "only" operates at 600° C or at 800 °C, is hardly a statement against its enormous power!

Insane temperatures

One section (§8.8) of his critique is based on the bizarre error that thorium MSRs are limited to operating at above 1,110 °C, the melting point of pure ThF4:

The dendrite problem above demonstrates that the LFTR/LFUR has a relatively narrow thermal window. Its filtering plant will not work if the temperature of the fluid drops much below a certain threshold and the danger of fuel solidification raises the risk of the reactor being damaged. With UF4 the solidification temperature is 1,036 °C and its vapourisation temperature is 1,417 °C. [...] With TF4 our “window” is 1110 – 1,680 °C, but again we can potentially move this by lowering the pressure (or raising it if we want to go the other way…not that we do!). A low vapour pressure also creates a few potential problems in terms of keeping the reactor sealed (air is more likely to leak in if the pressure inside is less than atmospheric…possibly starting a fire!) and maintaining a good flow rate from our pumps.

I don't know what confusion of his provoked this nonsense. Maybe he hasn't done his basic research, that all MSR proposals involve solvating actinide fluorides in other fluoride salts -- mixtures of LiF, NaF, BeF2, ZrF4, and/or others -- with the mixture having far lower melting points than actinide fluorides. Or maybe he's under the illusion that individual components of a chemical solution precipitate out at their pure melting points. At any rate, his chain of reasoning starts from this major error and leads to others:

With the LFTR however, I doubt you could operate one made out of any Nickel alloy, contrary to everything said on the internet. Bare in mind I’m thinking in terms of a good lengthy service life with a sensible factor of safety, not a flimsy test reactor in a lab (with a 100 mile exclusion zone!).

[...]

Thus the pressure vessel of any LFTR would likely have to formed out of Ceramics (very expensive and difficult to form, especially given how critical getting an air tight seal is given the graphite core) and key internal components out of Refractory metals, as would be the case for certain high temperature parts of any ISP (in both the LFTR and LFUR cases) given talk of operating temperatures in the range of 1600 °C.

[...]

So my instinct from a materials science point of view would be to drop the LFTR idea altogether and focus instead on a LFUR. While this isn’t able to use the Thorium cycle, the point was raised earlier about how the Thorium cycle isn’t all its cracked up to be. Its going to be a lot easier to build a LFUR than a LFTR, cheaper (relatively speaking) and likely safer too. Of course it does come at the disadvantage of a slightly awkward acronym! but overall that would be my focus of attention.

In reality, an MSR runs at fuel temperatures of only 700 °C (at least the first generation), which is why "internet" people like Kirk Sorensen, PhD assert that Hastelloy alloys are perfectly acceptable structural material.

What is breeding?

His presentation of the thorium fuel cycle, is, well, I don't know what to make of it!

(§8.3)

...and the fact we’ll still need supplies of Uranium to get Thorium reactors going again whenever we have to turn it off (which will happen at least once a year or so during its annual maintenance shutdown).

[...]

Obviously, once we exhaust the world’s U-235 stockpiles, LFTR’s and any other Thorium fuelled reactors will cease to function. The LFTR fans usually groan at this point and state that “all we need is a little plutonium”. Now while I’m quite sure that in the fantasy world which the LFTR fans inhabit Plutonium is available in any good hardware store but back in the real world, it’s a little harder to come by! As with the HTGR’s using Thorium (if its possible) would certainly help stretch things out….a bit! But not by nearly as much as the supporters of Thorium reactors would have you believe.

It's not a gas turbine if it's not a gas

Unlike Dr. PhD, I'm not an engineer, and my background knowledge does not help me understand how a Brayton cycle can run on liquid. I just don't get how it would spin, what with the difficulty of getting a liquid to expand. But because the author says he has a "PhD in a thermodynamics-related field", and is trained as a mechanical engineer -- I assume he understands something that I do not.

Another misconception is that a LFTR or LFUR can operate on an open cycle with a gas turbine. While true, it could be run this way, there are a host of practical reasons not to do it. Not least of them the fact that our turbine would have to be designed to withstand having a mixture of molten salt and fluorided fuel passed through it at very high temperatures. This would be tricky to say the least, likely requiring the use of those super expensive refractory metals, and while using such materials to make the odd turbine blade is one thing, an entire turbine casing is an entirely different matter. It would likely cost much more than the reactor itself!

Of course, the concept he is supposedly debunking, it seems he has somewhat miscomprehended its nature. The Energy from Thorium discussion isn't talking about running molten salt through a turbine as a working fluid. They're actually talking about running atmospheric air through a gas turbine ("open cycle") -- analogous to a jet engine or a gas-fired internal combustion turbine. (The air being heated by the reactor, through a gas/liquid heat exchanger). But hey, if we actually read and understood the thinks we were ridiculing, that wouldn't be much fun, would it?

Bug in Google Calculator breaks "Capacity Factor" posts

To help myself out in arithmetic-intensive calculations (and to make them more convenient to readers), I often use Google's 'Calculator' feature -- a nifty tool which is units-aware ("10 btu/hour") and can perform units conversions, as well as other convenient shortcuts such as scientific notation ("1e9"), natural-language words ("billion"), abbreviations ("GeV"), and mathematical constants ("Avogadro's number"). Here's a short example using several of them, illustrating how delightfully expressive this calculator language is. Unfortunately, this feature no longer works the same way it did before, creating broken links in many of my old comments. One of the bugs involves Google being "smart" and substituting words with textually "similar" ones (which mean completely different things), so that

1 TWh/year in watts

is reinterpreted as "1 kWh/year in watts" (kWh being a far more common search phrase) -- introducing an error of 109. Previously, the abbreviation "TWh" would be correctly interpreted as "terawatt hour". For added confusion, this "smart search" is not consistent in its buginess: e.g., slightly tweak the previous query to

1 TWh/hour in watts

And suddenly 'TWh' is back to "terawatt-hour"! Quite an insidious bug.

I would worry for physics students who use this tool. You input the expression correctly, and the wrong answer comes out. Worse, the wrong answer comes out where previous versions of the Calculator yielded the right answer. And most-worstest, this bug is dangerously silent: often the result retains the correct units, so it looks superficially correct. You only see that something went horribly wrong if you read the "parse" preceding the result, which often you won't because

  • It is very verbose on even small inputs, full of natural-language words and superfluous parentheses
  • It is nothing more than a 1-to-1 rewording of the expression you just entered (...except when it's disastrously wrong)

Anyway, a heads-up and an apology: there are some confusing broken links in my archived posts, due to this issue.

Guest post: A curious case of cherry-picking data for the greater good

On the subject of Fukushima hysteria, here is a guest post by Alexey Goldin. It's a rebuttal of claims, by activists Joseph Mangano and Janette Sherman, that lethal amounts of fallout have reached North America, and that there is a "dramatic increase" in infant mortality in US cities. (!) Of course they have no evidence for a causal link, just correlation. Of course they didn't explain how on earth the microscopic amount of fallout observed in the US could cause any health effects, let alone lethal acute radiation poisoning; or how the obvious symptoms of such a deadly disease are going completely unnoticed. They haven't merely not-demonstrated causation -- they haven't even hit plausibility.

In this vein I think it is overgenerous to consider the numbers behind these epidemiological claims at all; what would a statistically significant correlation mean, if anything? Extraordinary claims demand extraordinary evidence. Nevertheless, Alexey takes on the epidemiological correlation at face value, and finds it substanceless. According to his analysis there is no statistical significance to the alleged "dramatic increase in mortality". It is noise.


A curious case of cherry-picking data for the greater good

Alexey Goldin

A large number of websites (for example, see this) recently furiously discussed a finding by Janette Sherman and Joseph Mangano claiming that radioactive fallout from Fukushima resulted in a statistically significant increase in child mortality. As in my daily job I rely very heavily on being able to learn some facts around the world around from very noisy data using statistical methods, I got curious. I was especially interested to learn what "statistically significant" means.

I am not a statistician but I play one at work. If a real statistician will analyze these data in a more careful way I'd be be grateful, but I hope my analysis is rigorous enough. It is overkill for debunking, but I hope it can be used as an introduction to R for some curious people who might be interested to check the next outrageous claims for themselves and demonstrate R capabilities for analysis.

The data are available from the Center for Disease Control, but not in a very convenient format. Great thanks to the Capacity Factor blog for compiling the data for 8 cities used in the Janette Sherman and Joseph Mangano analysis. The csv file childMort.csv is included in attached zip file. To repeat my analysis you do not need anything beyond opensource statistical package R and some standard libraries (lubridate, xts, plyr, sandwich,lmtest), installable by using the install.packages function.

Using functions defined in the included file mort.R one can read data like this:

> childM <- get.data("childMort.csv")

The average mortality for selected 8 cities for 10 weeks after Fukushima:

> dp10 <- mean(childM$Deaths["2011-03-20::2011-05-28"])
> dp10

12.5

And before:

> dm4 <- mean(childM$Deaths["2011-02-26::2011-03-19"])
> dm4

9.25

The plot of mortality vs time:

Indeed, if you squint enough you'll see that post-Fukushima mortality is higher. It looks grim. However it is not obvious why the authors of the study use 10 weeks after March 19 but only 4 before. There is a lot of data available before and using more data always allows you to estimate averages with better precision. If we use all 12 weeks of 2011 before March 19, surely we can get a better estimate of average mortality then using only 4 weeks. Let's try:

> dm12 <- mean(childM$Deaths["2011-01-01::2011-03-19"])
> dm12

12.25

12.25 is much closer to post-Fukushima average weekly value of 12.5.

Let's not stop here and use all of the data available, starting from 2007:

> dm2007 <- mean(childM$Deaths["2007-01-01::2011-03-19"])
> dm2007

13.7899543378995

It definitely looks like average post-Fukushima mortality is lower. You could say "But it is just a statistical fluke!" and would be (partially) right.

Partially, because there is a clear downward trend in child mortality which Fukushima (visually) does not seem to influence very much. But we will return to this trend a bit later.

Let's see if there is more evidence for conclusion-based evidence drawing. If you loaded my code you should have dataframes "cities" and "cities.cdc" defined in your workspace. Here are the cities used in Sherman and Mangano analysis.

> library(maps)
> map(database = "state", regions = c("CA", "OR", "WA", "ID"))
> points(cities.cdc$long, cities.cdc$lat, pch = 16, col = "blue")
> points(cities$long, cities$lat, pch = 16, col = "red")

Blue dots show the cities available in CDC database but not included in the study. Red dots are the cities in the study. Usually every scientist is trying to use as much data as available. There might be a reason for omitting cities south of San Jose, but what are the reasons for omitting Spokane, WA and Tacoma, WA, but including Boise, ID? What is the reason for this?

I downloaded 2011 data for both cities (if you load my code you'll have variables spokane and tacoma). The averages after Fukushima and before:

> mean((spokane + tacoma)["2011-03-20::2011-05-28"])

1.5

> mean((spokane + tacoma)["2011-02-26::2011-03-19"])

1.66666666666667

And it looks like the average death rate is decreasing for these two cities. This might be a good reason to not include them.

At this point it looks quite likely that Sherman and Mangano picked up their indicator (ratio of average death rate for 10 weeks later after the event to average rate for 4 weeks before the event) to be able to claim relationship between child mortality and Fukushima event that may not be there. But let's investigate how good their indicator is. We suspect that they artificially constructed this indicator to spike on March 19. Would it spike on other dates?

The following function averages mortality on next 10 weeks, previous 4 weeks and returns the value. When its value reaches 1.35, average 10 week mortality exceeded previous 4 weeks mortality by 35%.

> tr.indicator <- function(lr, nback = 4, nforward = 10) {
>     lr4 <- rollapply(lr$Deaths, nback, mean, align = "right", 
>         na.pad = TRUE)
>     lr10 <- lag(rollapply(lr$Deaths, nforward, mean, align = "left", 
>         na.pad = TRUE), 1)
>     as.xts(lr10/lr4)
> }
> rind <- tr.indicator(childM)
> plot(rind, main = "Sherman-Mangano indicator")
> abline(h = 1.35)

The horizontal line is exactly where value of indicator is 1.35, or where the average number of deaths for the next 10 weeks is higher then average number of deaths for the previous 4 weeks. You see that it spikes at March 19, 2011 but it also spikes (and sometimes significantly higher) on numerous other occasions.

What about Jan 2011? And what about late 2009? Where infants dying in large numbers then? Could something be happening that we missed? Were there numerous unannounced reactors melting down without anyone noticing? Why Sherman and Mangano were silent on Jan 2011? Or maybe Jan 2011 (just like March 19) was just a statistical fluke?

Let's try another approach. We take our data and resample them, by taking them in random order. Any time dependent pattern will be scrambled, and whatever remains after applying the same 10 week - 4 week indicator is just an artifact of random distribution.

> set.seed(1)
> s1 <- sample(as.numeric(childM$Deaths), nrow(childM), replace = TRUE)
> sxts1 <- xts(s1, order.by = index(childM))
> names(sxts1) <- "Deaths"
> rind1 <- tr.indicator(sxts1)
> plot(rind1, main = "Sherman-Mangano indicator for random sample")
> abline(h = 1.35)

It is obvious that this indicator very often sets off alarms when there is nothing to predict at all, the data we fed to it were random by design.

Let's try to estimate the probability of seeing this indicator exceeding some value of 1.35 on any particular day. We can do it with the following simple method. Let's generate a thousand random sequences which are 14 weeks long with numbers drawn from observed mortality data, calculate our indicator and count how often we exceed value of 1.35:

> mk.boot14 <- function(cm) {
>     d1 <- sample(as.numeric(cm$Deaths), 14, replace = TRUE)
>     d14 <- as.xts(d1, order.by = index(cm)[1:14])
>     names(d14) <- "Deaths"
>     d14
> }
> set.seed(2)
> N <- 1000
> res <- rep(NA, N)
> for (i in 1:N) {
>     d14 <- mk.boot14(childM)
>     res[i] <- coredata(tr.indicator(d14)[4, 1])
> }
> plot(density(res), main = "Sherman-Mangano indicator density distribution")
> abline(v = 1.35)
> s <- sum(res >= 1.35)/N
> s

0.062

A total 0.062 of all samples in a completely random sequence have next 10 weeks average 35% higher then previous 4 weeks average. This is not considered significant even when researcher is not hand picking data subset (cities) which to analyze and not picking his indicator to peak at predetermined date which is probably what was happening (consciously or unconsciously) in this Sherman-Mangano "study".

In fact with significance of 0.062 by selecting from 16 independent samples one is almost guaranteed to find a sample which will demonstrate 35% higher death rate in next 10 weeks compared to previous 4.

Can we take into account the general very fortuitous downward trend in infant mortality? We could modify our bootstrap procedure to take it into account but for educational purposes we will take another approach, described in this great tutorial. The data follows Poisson distribution, therefore we are using Poisson regression to estimate trend and impact of Fukushima factor.

We put our data in dataframe, add a Fukushima factor equal to 1 after March 19, 2011 and 0 before and add a column for linear trend.

> childM$F <- 0
> childM$F["2011-03-20::2011-06-12"] <- 1
> dt <- data.frame(childM)
> ind <- index(childM)
> trend <- (julian(ind) - julian(ind[1]))/365.25
> dt$trend <- trend
> fit <- glm(formula = Deaths ~ trend + F, family = poisson, data = dt)
> library(lmtest)
> library(sandwich)

Here is the fit summary via coeftest:

> ct <- coeftest(fit, vcov = sandwich)
Mortality data fit results
Estimate Std. Error z value Pr(> |z|)
(Intercept) 2.82 0.04 78.51 0.00
trend -0.09 0.02 -5.90 0.00
F 0.08 0.10 0.81 0.42

We see that while trend (fortunately) has a very high significance (better then 1e-8), significance of the F (Fukushima) term is 42 %. This means that without anything happening at all on March 19 we would have this value for coefficient F or higher with a probability of 42 % due to simply random effects. This is because we used not only 4 weeks before the event, but all available data to estimate background death rate. Intercept is a natural log of average death rate.

To estimate average mortality for weeks following Fukushima disaster using all available data and taking into account downward trend we can fit the model using all before-Fukushima data and use this model to predict averages for post-Fukushima. We use argument "weights" to limit our model to the period when F=0 (before Fukushima) and do not use F term in this model:

> w <- 1 - dt$F
> fit.0 <- glm(formula = Deaths ~ trend, family = poisson, data = dt, 
>     weights = w)
> pr <- predict(fit.0, data = dt, type = "response")
> mean(pr[dt$F > 0])

11.1069765487832

The number above is the expected average weekly infant mortality assuming linear trend is correct. Of course, linear trend can not last forever as it would mean mortality would become negative. But it may be a reasonable estimate for a short term ( if you play with the data try to fit quadratic model as well, using additional term I(trend^2). ) So the expected average daily mortality using all data from 2007 is, unfortunately, higher then which we were led to believe. Post-Fukushima excess is much less significant, which is demonstrated in the significance table above.

To plot original mortality data together with fits:

> plot(childM$Deaths)
> prd <- xts(predict(fit, type = "response"), order.by = index(childM))
> lines(prd, col = "red")
> prd.0 <- xts(predict(fit.0, type = "response", data = dt), order.by = index(childM))
> lines(prd.0, col = "blue")

The model with Fukushima factor is plotted with red color, without it -- with blue. They are very close, overlapping everywhere but after Fukushima. You can see that there is just not enough data to see the difference.

At this point it is worthwhile to question either the scientific integrity or statistical competence of Sherman and Mangano. They might be decent people and believe in what they say, but allow themselves to say "small lies" in a service of "Greater Truth". This never ends up well. Because they are likely to kill some unstable people with their small lies.

Now, if someone could find me the data used to "prove" that there are leukemia clusters next to nuclear plants....

The promised attached zip with the data and code is here.

Russian nuclear industry hurt by fatal plane crash

A commenter calls attention to the recent airplane disaster in Russia. Several leading nuclear-power experts lost their lives (they were on a delegation to inspect a prospective nuclear supplier).

http://uvdiv.blogspot.com/2011/06/nuclear-phaseouts-spreading-japanese.html?showComment=1308665098289#c5223789704525534645

Please pass the word

http://www.cnn.com/2011/WORLD/europe/06/21/russia.plane.crash/index.html?hpt=hp_

Several prominent members of Russian nuclear community lost their lifes

  • Sergey Ryzhov, Reactor Chief Designer, Gidropress
  • Gennady Banyuk, his deputy, Gidropress
  • Nikolay Trunov, Reactor Designer and Department Head, Gidropress
  • Andrey Trofimov, Chief Technology Officer, Afrikantov Bureau
  • Valery Lyalin, Department Head, Rosatom

Reporting from India newspapers:

[Times of India] Designer of Indian N-reactor killed in Russian plane crash

The whole leadership of the designers units of Russia's state nuclear corporation was killed when a Tu-134 passenger plane crash landed in the northern republic of Karelia.

[The Hindu] Kudankulam reactor designers among those killed in Russia aircrash

Three top officials of Russia’s main nuclear reactor design company, Gidropress, were killed in the aircrash along with two other senior nuclear engineers. Gidropress CEO and Designer General Sergei Ryzhov, Deputy CEO and Chief Designer Gennady Banyuk and Chief Designer Nikolai Trunov were all involved in designing two VVER-1000 (Version V-412) nuclear reactors for the the first stage of the Kudankulam power project in Tamilnadu. Another four reactors of this type are to be built at Kudankulam under second and third stages of the plant’s expansion.

VVER-1000/VVER-1200 is a PWR that is currently exported to several countries (with, I think, integrated fuel supply agreements). Notably including, China, India (which has neither the industry nor enrichment plants for domestic LWRs), and Iran (Bushehr NPP).