Tuesday, January 26, 2021
  • About Us
  • Contact Us
News i Can
  • Home
  • News
  • Politics
  • Business
  • Tech
  • Travel
  • Sports
  • Video
  • Books and Novels
  • Buy Products
  • Products
No Result
View All Result
News iCan
  • Home
  • News
  • Politics
  • Business
  • Tech
  • Travel
  • Sports
  • Video
  • Books and Novels
  • Buy Products
  • Products
  • About Us
  • Contact Us
No Result
View All Result
News iCan
Home News

This know-it-all AI learns by reading the entire web nonstop

Will Heaven by Will Heaven
September 7, 2020
in News, Tech
0
This know-it-all AI learns by reading the entire web nonstop
0
SHARES
1
VIEWS
Share on FacebookShare on Twitter



Source link

This is a problem if we want AIs to be trustworthy. That’s why Diffbot takes a different approach. It is building an AI that reads every page on the entire public web, in multiple languages, and extracts as many facts from those pages as it can.

Like GPT-3, Diffbot’s system learns by vacuuming up vast amounts of human-written text found online. But instead of using that data to train a language model, Diffbot turns what it reads into a series of three-part factoids that relate one thing to another: subject, verb, object.

Related posts

Daily Memo: US Tariffs on Chinese Goods, Russian Weapons in the Kurils

Daily Memo: Brexit Plan B

December 10, 2020
THE IPO PLAYBOOK

THE IPO PLAYBOOK

December 10, 2020

Pointed at my bio, for example, Diffbot learns that Will Douglas Heaven is a journalist; Will Douglas Heaven works at MIT Technology Review; MIT Technology Review is a media company; and so on. Each of these factoids gets joined up with billions of others in a sprawling, interconnected network of facts. This is known as a knowledge graph.

Knowledge graphs are not new. They have been around for decades, and were a fundamental concept in early AI research. But constructing and maintaining knowledge graphs has typically been done by hand, which is hard. This also stopped Tim Berners-Lee from realizing what he called the semantic web, which would have included information for machines as well as humans, so that bots could book our flights, do our shopping, or give smarter answers to questions than search engines.

A few years ago, Google started using knowledge graphs too. Search for “Katy Perry” and you will get a box next to the main search results telling you that Katy Perry is an American singer-songwriter with music available on YouTube, Spotify, and Deezer. You can see at a glance that she is married to Orlando Bloom, she’s 35 and worth $125 million, and so on. Instead of giving you a list of links to pages about Katy Perry, Google gives you a set of facts about her drawn from its knowledge graph.

But Google only does this for its most popular search terms. Diffbot wants to do it for everything. By fully automating the construction process, Diffbot has been able to build what may be the largest knowledge graph ever.

Alongside Google and Microsoft, it is one of only three US companies that crawl the entire public web. “It definitely makes sense to crawl the web,” says Victoria Lin, a research scientist at Salesforce who works on natural-language processing and knowledge representation. “A lot of human effort can otherwise go into making a large knowledge base.” Heiko Paulheim at the University of Mannheim in Germany agrees: “Automation is the only way to build large-scale knowledge graphs.” 

Super surfer

To collect its facts, Diffbot’s AI reads the web as a human would—but much faster. Using a super-charged version of the Chrome browser, the AI views the raw pixels of a web page and uses image-recognition algorithms to categorize the page as one of 20 different types, including video, image, article, event, and discussion thread. It then identifies key elements on the page, such as headline, author, product description, or price, and uses NLP to extract facts from any text.

Every three-part factoid gets added to the knowledge graph. Diffbot extracts facts from pages written in any language, which means that it can answer queries about Katy Perry, say, using facts taken from articles in Chinese or Arabic even if they do not contain the term “Katy Perry.”

Browsing the web like a human lets the AI see the same facts that we see. It also means it has had to learn to navigate the web like us. The AI must scroll down, switch between tabs, and click away pop-ups. “The AI has to play the web like a video game just to experience the pages,” says Tung.

Diffbot crawls the web nonstop and rebuilds its knowledge graph every four to five days. According to Tung, the AI adds 100 million to 150 million entities each month as new people pop up online, companies are created, and products are launched. It uses more machine-learning algorithms to fuse new facts with old, creating new connections or overwriting out-of-date ones. Diffbot has to add new hardware to its data center as the knowledge graph grows.

Researchers can access Diffbot’s knowledge graph for free. But Diffbot also has around 400 paying customers. The search engine DuckDuckGo uses it to generate its own Google-like boxes. Snapchat uses it to extract highlights from news pages. The popular wedding-planner app Zola uses it to help people make wedding lists, pulling in images and prices. NASDAQ, which provides information about the stock market, uses it for financial research.

Fake shoes

Adidas and Nike even use it to search the web for counterfeit shoes. A search engine will return a long list of sites that mention Nike trainers. But Diffbot lets these companies look for sites that are actually selling their shoes, rather just talking about them.

For now, these companies must interact with Diffbot using code. But Tung plans to add a natural-language interface. Ultimately, he wants to build what he calls a “universal factoid question answering system”: an AI that could answer almost anything you asked it, with sources to back up its response.

Tung and Lin agree that this kind of AI cannot be built with language models alone. But better yet would be to combine the technologies, using a language model like GPT-3 to craft a human-like front end for a know-it-all bot.

Still, even an AI that has its facts straight is not necessarily smart. “We’re not trying to define what intelligence is, or anything like that,” says Tung. “We’re just trying to build something useful.”



Source link

Previous Post

2021 Ram 1500 and HD Limited Night Editions are Tall, Dark, and Handsome

Next Post

NEW Dell G7 15 Gaming Laptop (2020) I Tech Talk

Next Post
NEW Dell G7 15 Gaming Laptop (2020) I Tech Talk

NEW Dell G7 15 Gaming Laptop (2020) I Tech Talk

RECOMMENDED NEWS

This 30+ MPH, $46,000 Aston Martin DB5 Is the Perfect EV for a Speed-Mad Kid

This 30+ MPH, $46,000 Aston Martin DB5 Is the Perfect EV for a Speed-Mad Kid

5 months ago
7 astuces sur iOS 14

7 astuces sur iOS 14

5 months ago
VWO: Last Time Emerging Market Equities Were This Undervalued, Returns Were +29%

VWO: Last Time Emerging Market Equities Were This Undervalued, Returns Were +29%

4 months ago
Scotland’s Golf Courses – Breathtaking Hidden Gems You Need to Include in Your Travel Plans

Scotland’s Golf Courses – Breathtaking Hidden Gems You Need to Include in Your Travel Plans

8 months ago

FOLLOW US

  • 79 Followers
  • 93.2k Subscribers

BROWSE BY CATEGORIES

  • Books and Novels
  • Business
  • News
  • Politics
  • Products
  • Sports
  • Tech
  • Tech Gadgets Video
  • Travel
  • Uncategorized
  • Video

BROWSE BY TOPICS

books electronics mobile phone mobile phone accessories Sports Sports News
PopAds.net - The Best Popunder Adnetwork
ADVERTISEMENT

News Category

  • Books and Novels (661)
  • Business (2,017)
  • News (8,803)
  • Politics (1,356)
  • Products (213)
  • Sports (937)
  • Tech (2,001)
  • Tech Gadgets Video (2,087)
  • Travel (1,832)
  • Uncategorized (1)
  • Video (2,087)

POPULAR NEWS

  • VALE a pena COMPRAR o IPHONE XR em 2020 / 2021?

    VALE a pena COMPRAR o IPHONE XR em 2020 / 2021?

    0 shares
    Share 0 Tweet 0
  • The startup turning human bodies into compost

    0 shares
    Share 0 Tweet 0
  • Meet Windows 7 2020 Edition Concept

    0 shares
    Share 0 Tweet 0
  • The Deutsche Bank whistleblower who gave up $8m is going broke

    0 shares
    Share 0 Tweet 0
  • The War of the Norm

    0 shares
    Share 0 Tweet 0

Related posts

Daily Memo: US Tariffs on Chinese Goods, Russian Weapons in the Kurils

Daily Memo: Brexit Plan B

December 10, 2020
THE IPO PLAYBOOK

THE IPO PLAYBOOK

December 10, 2020

Navigation

  • Home
  • News
  • Politics
  • Business
  • Tech
  • Travel
  • Sports
  • Video
  • Books and Novels
  • Buy Products
  • Products
News

Daily Memo: Brexit Plan B

by Geopolitical Futures
December 10, 2020

Latest News

Daily Memo: Brexit Plan B

2 months ago

THE IPO PLAYBOOK

2 months ago
ADVERTISEMENT

© 2020 www.newsican.com – Premium news & magazine Web site; Designed By SL Creates

No Result
View All Result
  • Home
  • News
  • Politics
  • Business
  • Tech
  • Travel
  • Sports
  • Video
  • Books and Novels
  • Buy Products
  • Products

© 2020 www.newsican.com - Design By SL Creates.

Are you sure want to unlock this post?
Unlock left : 0
Are you sure want to cancel subscription?