Artificial intelligence programs built by Alibaba () and Microsoft ( ) have beaten humans on a Stanford University reading comprehension test.
“This is the first time that a machine has outperformed humans on such a test,” Alibaba said in a statement Monday.
The test was devised by artificial intelligence experts at Stanford to measure computers’ growing reading abilities. Alibaba’s software was the first to beat the human score.
Luo Si, the chief scientist of natural language processing at the Chinese company’s AI research group, called the milestone “a great honor,” but also acknowledged that it is likely lead to a significant number of workers losing their jobs to machines.
The technology “can be gradually applied to numerous applications such as customer service, museum tutorials and online responses to medical inquiries from patients, decreasing the need for human input in an unprecedented way,” Si said in a statement.
Alibaba has already put the technology to work on Singles Day, the world’s biggest shopping bonanza, by using computers to answer a large number of customer service questions.
In a tweet, Pranav Rajpurkar, one of the Stanford researchers who developed the reading test, called Alibaba’s feat “a great start to 2018” for artificial intelligence.
The Stanford test generates questions about a set of Wikipedia articles.
For example, a human or AI program reads a passage about the history of British TV show Doctor Who and then answers questions like, “What is Doctor Who’s space ship called?” (Spoiler alert: It’s the TARDIS, for non-Doctor Who fans out there.)
Alibaba’s deep neural network model scored 82.44 on the test on January 11, narrowly beating the 82.304 scored by the human participants. A day later, Microsoft’s AI software also beat the human score, with a result of 82.650.
“These kinds of tests are certainly useful benchmarks for how far along the AI journey we may be,” said Andrew Pickup, a spokesman for Microsoft. “However, the real benefit of AI is when it is used in harmony with humans,” he added.
Facebook (), Tencent ( ) and Samsung ( ) have also previously submitted AI models to the Stanford project.