Speed matters during an outbreak, and tight-lipped Chinese officials do not have a good track record of sharing information about diseases, air pollution, or natural disasters. But public health officials at WHO and the CDC have to rely on these very same health officials for their own disease monitoring. So maybe an AI can get there faster. “We know that governments may not be relied upon to provide information in a timely fashion,” says Kamran Khan, BlueDot’s founder and CEO. “We can pick up news of possible outbreaks, little murmurs or forums or blogs of indications of some kind of unusual events going on.”
Already, since Tuesday, the number of diagnosed coronavirus patients in Wuhan has shot up to 729.Using case data scraped from official reports, a team led by Jonathan Read at Lancaster University plotted a temporal map of the coronavirus’s spread, starting on January 1, when local authorities closed the meat-and-animal market where the virus is believed to have crossed into humans from an unknown source .
outbreaksSnakes?! The Slippery Truth of a Flawed Wuhan Virus Theory
infection ratesScientists Predict Wuhan's Outbreak Will Get Much Worse
Would the Coronavirus Quarantine of Wuhan Even Work?Khan says the algorithm doesn’t use social media postings because that data is too messy. But he does have one trick up his sleeve: access to global airline ticketing data that can help predict where and when infected residents are headed next. It correctly predicted that the virus would jump from Wuhan to Bangkok, Seoul, Taipei, and Tokyo in the days following its initial appearance.
Khan, who was working as a hospital infectious disease specialist in Toronto during the SARS epidemic of 2003, dreamt of finding a better way to track diseases. That virus started in provincial China and spread to Hong Kong and then to Toronto, where it killed 44 people. “There’s a bit of deja vu right now,” Khan says about the coronavirus outbreak today. “In 2003, I watched the virus overwhelm the city and cripple the hospital. There was an enormous amount of mental and physical fatigue, and I thought, ‘Let’s not do this again.’”
To uncover those “ potentially infectious materials ,” the Global Polio Eradication Initiative hosts a big table that lists the dates and locations of wild poliovirus outbreaks, and the times each country did live-virus vaccinations, so labs around the world can scan the database and see whether their samples might have originated in a polio-prone area.
After testing out several predictive programs, Khan launched BlueDot in 2014 and raised $9.4 million in venture capital funding. The company now has 40 employees—physicians and programmers who devise the disease surveillance analytic program, which uses natural-language processing and machine learning techniques to sift through news reports in 65 languages, along with airline data and reports of animal disease outbreaks. “What we have done is use natural language processing and machine learning to train this engine to recognize whether this is an outbreak of anthrax in Mongolia versus a reunion of the heavy metal band Anthrax,” Kahn says.
Once the automated data-sifting is complete, human analysis takes over, Khan says. Epidemiologists check that the conclusions make sense from a scientific standpoint, and then a report is sent to government, business, and public health clients.BlueDot’s reports are then sent to public health officials in a dozen countries (including the US and Canada), airlines, and frontline hospitals where infected patients might end up. BlueDot doesn’t sell their data to the general public, but they are working on it, Khan says.
The firm isn’t the first to look for an end-run around public health officials, but they are hoping to do better than Google Flu Trends , which was euthanized after underestimating the severity of the 2013 flu season by 140 percent. BlueDot successfully predicted the location of the Zika outbreak in South Florida in a publication in the British medical journal The Lancet.