Speed Bumps: Why It’s So Hard to Catch Cheaters in Track and Field

Earlier this month, London’s Sunday Times and German broadcaster ARD published a joint investigation on doping in track and field that included an analysis of 12,000 leaked blood tests from 5,000 athletes between 2001 and 2012. The tests had been carried out by the IAAF, track and field’s international governing body. Two respected experts in doping methods said blood tests of 800 of the athletes were “highly suggestive of doping or at the very least abnormal.” Ten runners who won medals in endurance events at the 2012 Summer Olympics in London had suspicious test results. And a startling 80 percent of Russian medalists recorded tests that showed likely doping. The vast majority of athletes with suspicious tests were never sanctioned.

Former Team Members Accuse Famed Coach Alberto Salazar of Breaking Drug Rules

Some top runners and others who’ve worked with Salazar allege that he experimented with testosterone and pressured athletes to use prescription medications they didn’t need to gain a performance benefit. Read the story.

Salazar Disputes Allegations — Some of Which Were Never Made

In a lengthy response to stories by ProPublica and the BBC, Salazar addresses the allegations of former athletes and staff that he broke drug rules. Read the story.

On Saturday, the 2015 track and field world championships kick off and, of course, some athletes who are doping will vie for medals. Most will not be caught; only 1 to 2 percent of tests in international Olympic sports result in sanctions each year. If doping is so rife in track and field, why are athletes penalized so rarely? It’s partly because many suspicious tests don’t quite reach the high evidence bar to be considered officially positive. But it’s also because doping athletes tend to employ methods that make drug testing extremely difficult. As Paul Scott, head of Scott Analytics, which provides testing services in multiple sports has put it: “Drug testing has a public reputation that far exceeds its capabilities.”

Here’s a look at why drug tests will never snare every cheater.

Looking for a (tiny) edge

Top-tier track and field has become so competitive that the margin of victory is often vanishingly small. In the men’s 100 meters at the last Olympics, the difference between gold and silver was .12 seconds, less than the time it would take you to blink if a flashlight were shined in your face. The difference between silver and bronze was less than half that.

The tiny gap between winning and losing has led athletes to look for what they call marginal gains, whether that comes from extra sleep, better equipment or cheating. It also means that athletes needn’t take the industrial strength drugs that some baseball players and Soviet Bloc athletes famously took. The most popular doping agents today are synthetic versions of natural hormones: testosterone and human growth hormone — which aid muscle building and workout recovery — and EPO, which causes the body to produce more oxygen-carrying red blood cells. Athletes have learned they can take small amounts — known as “microdosing”— to evade detection and still get the benefits.

Why is it so difficult to detect?

For starters, accurately measuring the presence of tiny concentrations of drugs — particularly synthetic versions of natural hormones — is difficult. For the sake of calling a test positive, it’s even more difficult. Consider the ubiquitous anti-doping test known as the T/E ratio. “T” is testosterone and “E” is another hormone called epitestosterone, a natural product of steroid metabolism that provides no benefit. Most people have a T/E ratio of 1-to–1. But there is natural variation among people, so the World Anti-Doping Agency (WADA) set the T/E ratio limit at 4-to–1. If a test goes above that, it is deemed suspicious and testing for synthetic testosterone ensues. This gives an athlete with a typical T/E ratio room to dope before hitting 4-to–1, and even small amounts of testosterone provide benefit. To make matters worse for drug testers, for many people, an elevated T/E ratio will quickly return to normal, even as the benefits of the drug are just beginning. Christiane Ayotte, director of the WADA-accredited lab in Montreal, knows that athletes are slipping through the porous T/E screen every day. She once said that she “cannot retire until we’ve found a better probe.” This is one reason why it’s important that athletes are tested repeatedly and, ideally, outside of competition without advance notice.

Still, athletes focused on cheating can find a way. A clever doper might add epitestosterone to the testosterone they take to keep their ratio within allowable limits. That was what made “the cream” of BALCO-scandal fame so difficult to detect.

Even if the T/E ratio test were perfect, some athletes would slip through simply because their genes alter how testosterone shows up in their urine. In these athletes, their T/E ratio wouldn’t rise even if they took testosterone, and for some, it actually falls. In the first study documenting this, two-thirds of Koreans and 10 percent of Swedes tested had the sail-past-drug-testing physiology.

So what can be done?

There’s a testing method called CIR — carbon isotope ratio testing — that does not rely on the T/E ratio and can distinguish between natural and synthetic testosterone. Ayotte has caught athletes in random testing with CIR even though they had normal T/E ratios. CIR measures the ratio of types of carbon atoms in urine — which differs between natural and synthetic testosterone. Sprinter Justin Gatlin, a favorite to win the 100 at the world championships, was sanctioned based on a CIR test even though he didn’t go above a T/E of 4-to–1. But the CIR test is costly and laborious, so it’s typically only done to follow up a suspicious T/E ratio. And it’s also far from foolproof. An athlete might stick to small doses because the test isn’t sensitive enough to detect the synthetic testosterone in urine at low concentrations. It’s also likely that kitchen chemists who work in sports doping are engineering synthetic testosterone with just the right ratio to beat the CIR tests. The hope of drug testers is that by combining various testing methods they raise the chances of catching a habitual cheater.

What about the Athlete Biological Passport?

The biological passport is a newer method of doping detection that tracks particular blood variables for individual athletes over time. It debuted in 2009 and has been updated since then. By monitoring things like the percentage of new red blood cells and the amount of oxygen-carrying hemoglobin, the passport documents physiologic trends for each athlete. In this way, a baseline profile is established — basically a minimum and maximum value for each variable for that athlete. The athlete could get in trouble if a future test shows a variable well outside the profile.

Before the passport, testers needed to detect a drug — or the chemicals that the drug breaks down into — in the body. The passport simply documents the effects of the drug. Thus, it has a longer testing window, and can detect previously undetectable doping methods, like when an athlete transfuses their own blood. Known as “blood doping,” athletes remove and refrigerate their blood, wait until their bodies regenerate the blood supply, then transfuse the refrigerated blood. The athlete ends up with a significant advantage: a lot more oxygen-carrying red blood cells. Because it’s their own blood, none of the older doping tests would pick it up. The passport, though, would document the reaction of the athlete’s body. When stored blood is reinjected, the athlete’s body would ramp down production of red blood cells. If the opposite occurred — an athlete’s body produced an unusually high proportion of new red blood cells — it could indicate the use of injected EPO, which signals the body to make them.

After the biological passport was introduced in cycling, the percentage of tests that showed unusual proportions of new red blood cells was more than cut in half, suggesting the test was having some deterrent effect. Lance Armstrong famously posted a series of his drug tests from 2008 and 2009 to prove that he wasn’t doping after his comeback. He didn’t fail any single test, but taken as a timeline, the tests look like the signature of blood transfusions. (When Armstrong confessed doping to Oprah in 2013, he still denied doping post-comeback.)

Okay, then why wasn’t he sanctioned?

So many devils in so many details. It is highly unlikely that Armstrong’s blood profile could’ve occurred naturally. And yet, it wasn’t so conspicuous as to reach the level required for a definitive positive.

In order for such a review to be triggered, the offending test result has to be so unusual that there’s a 99.9 percent chance that it’s a true positive. So if there’s only a 99 percent chance, that’s not good enough. This means there’s room for athletes who are very likely, but not conclusively, doping to slip through. In one study, for example, when the 99 percent probability was used, 10 of 11 subjects who were transfusing blood as part of the anti-doping research were caught through biological passport testing. But there was also one false positive. When the probability limit was set to 99.9 percent, only eight of 11 doping subjects were caught, but with no false positives. (And these subjects weren’t making specific efforts to avoid detection, as pro athletes often do.) Anti-doping is like the criminal justice system in the sense that it is constructed to keep the number of false positives to a minimum at the cost of letting some false negatives slip through what are, in fact, pretty big cracks.

Yikes, so the burden of proof is really on the testers.

Yes, and because the passport constitutes a very indirect form of drug testing — unlike some pre-employment testing which looks for direct metabolites of drugs like cocaine — athletes get the chance to try to explain abnormal results. And there actually are some good explanations. Blood count measurements can vary 10 percent or more just based on how hydrated an athlete is, the time of day, even the athlete’s body position during the test, not to mention training at altitude or sleeping in tents that simulate altitude.

And don’t forget natural human variation. Populations of elite athletes tend to include at least some people with physiology that is rather extreme compared to most normal people. For example, one recent U.S. gold medalist naturally has a T/E ratio of 11-to–1, and a cross-country skier who won seven Olympic medals famously had 50 percent more red blood cells than his peers due to a rare genetic mutation. The line for a positive test has to be set very conservatively because we know there are natural outliers, particularly among pro competitors. Even athletes with clearly abnormal results sometimes walk away clean, and some of them actually are clean.

What about testing for human growth hormone?

Similar difficulties, only worse. Back in 2013, the NFL and the NFL players’ union were bickering over whether to implement HGH testing. Rarely discussed was the fact that the test probably wasn’t going to catch anyone anyway. The most common test for HGH is called the “isoform test.” The isoform test looks for a ratio of different weights (or isoforms) of growth hormone in the body. One isoform weighs 20 kilodaltons and the other 22 kilodaltons. Synthetic HGH comes only in the 22-kilodalton variety. So if an athlete injects synthetic HGH, the drug throws off the ratio of isoforms in the body, and the test looks for the altered ratio. But the ratio corrects itself in hours. Plus, to account for natural variation, the limit for a positive test is set way beyond normal, which means an athlete using HGH would really have to get unlucky with test timing to get caught. In a study, even the subjects who were intentionally doped with HGH for research did not quite reach positive-test territory.

When the NFL and the players’ union were having their spat, nobody mentioned that of 10,000 HGH tests around the world, only a dozen had come back positive. Importantly, one of those tests — a cross-country skier’s — was overturned by the Court of Arbitration for Sport, which handles the final appeal if an athlete contests a doping sanction. The court deemed the “decision limit” for a positive test had not been sufficiently proven as scientifically valid. In its ruling, even the CAS panel seemed to acknowledge the skier—who had a very abnormal test result—was probably cheating. But the court wanted more scientific evidence proving that there was a 99.99 percent chance that the skier’s test was a true positive.

For HGH in particular, there is a better test called the “biomarker test,” which looks for changes in blood parameters after HGH injection. But that has generally been sparsely used due to the lack of a steady supply of testing kits.

Is there any hope?

Yes and no. As anti-doping authorities collect more biological passport data, they will have a better picture of what abnormal results look like, and can set the bar for a positive test less conservatively. And new biological markers that can be tested for evidence of doping will surely be discovered. But it is unlikely that anti-doping will reach the point where an athlete who is microdosing and carefully engineering their blood profile can’t potentially slip through unnoticed.

It helps that anti-doping authorities are constantly adding tests. Last year, biological passport profiling for steroids was added to the system that was already looking for blood doping. They’ve also employed DNA analysis to determine when athletes have submitted someone else’s urine. (Think former Minnesota Vikings running back Onterrio Smith and his “Whizzinator.”) Plus, samples from major championships are now stored for 10 years so that they can be re-tested with new methods. The IAAF recently suspended 28 athletes after re-testing samples from previous track and field championships. Those suspensions could result in two American athletes — Kara Goucher and Shalane Flanagan —being upgraded from 10K bronzes in the world championships and Olympics, respectively, to silvers.

Still, even as technology has improved, the proportion of worldwide samples that test positive remains at about 1 to 2 percent year after year. The dopers and anti-dopers may be in technological lockstep. Perhaps the greatest innovation in modern anti-doping is the rise of investigations that lead to “non-analytical positives,” as with Lance Armstrong, who liked to note that he’d never failed a test.

If you find the testing situation in Olympic sports depressing, remember that the WADA-approved testing regimen is the absolute gold standard in sports. Major League Baseball comes the closest — but not all that close — among the major pro sports leagues. The reason more athletes in those leagues aren’t being sanctioned for doping probably isn’t because it isn’t occurring.

Enjoy the game.

Michael J. Joyner is a physiologist and expert in human performance at the Mayo Clinic in Rochester, Minnesota. The views expressed here are his own.

Correction, Aug. 20, 2015: This article originally said blood samples taken at major track championships are stored for eight years. They are stored for 10 years.