Google Knowledge Graph

From HandWiki
Short description: Knowledge base used by Google to enhance its search engine's results


Knowledge panel data about Thomas Jefferson displayed on Google Search, as of January 2015

The Google Knowledge Graph is a knowledge base from which Google serves relevant information in an infobox beside its search results. This allows the user to see the answer in a glance, as an instant answer. The data is generated automatically from a variety of sources, covering places, people, businesses, and more.[1][2]

The information covered by Google's Knowledge Graph grew quickly after launch, tripling its data size within seven months (covering 570 million entities and 18 billion facts[3]). By mid-2016, Google reported that it held 70 billion facts[4] and answered "roughly one-third" of the 100 billion monthly searches they handled. By May 2020, this had grown to 500 billion facts on 5 billion entities.[5]

There is no official documentation of how the Google Knowledge Graph is implemented.[6] According to Google, its information is retrieved from many sources, including the CIA World Factbook and Wikipedia.[7] It is used to answer direct spoken questions in Google Assistant[8][9] and Google Home voice queries.[10] It has been criticized for providing answers with neither source attribution nor citations.[11]

History

Google announced its Knowledge Graph on May 16, 2012, as a way to significantly enhance the value of information returned by Google searches.[7] Initially available only in English, it was expanded in December 2012 to Spanish, French, German, Portuguese, Japanese, Russian and Italian.[12] Bengali support was added in March 2017.[13]

The Knowledge Graph was powered in part by Freebase.[7]

In August 2014, New Scientist reported that Google had launched a Knowledge Vault project.[14] After publication, Google reached out to Search Engine Land to explain that Knowledge Vault was a research report, not an active Google service. Search Engine Land expressed indications that Google was experimenting with "numerous models" for gathering meaning from text.[15]

Google's Knowledge Vault was meant to deal with facts, automatically gathering and merging information from across the Internet into a knowledge base capable of answering direct questions, such as "Where was Madonna born?" In a 2014 report, the Vault was reported to have collected over 1.6 billion facts, 271 million of which were considered "confident facts" deemed to be more than 90% true. It was reported to be different from the Knowledge Graph in that it gathered information automatically instead of relying on crowd-sourced facts compiled by humans.[15]

Criticism

Lack of source attribution

By May 2016, knowledge boxes were appearing for "roughly one-third" of the 100 billion monthly searches the company processed.[11] Dario Taraborelli, head of research at the Wikimedia Foundation, told The Washington Post that Google's omission of sources in its knowledge boxes "undermines people’s ability to verify information and, ultimately, to develop well-informed opinions". The publication also reported that the boxes are "frequently unattributed", such as a knowledge box on the age of actress Betty White, which is "as unsourced and absolute as if handed down by God".[11]

Declining Wikipedia article readership

According to The Register in 2014 the display of direct answers in knowledge panels alongside Google search results caused significant readership declines for Wikipedia, from which the panels obtained some of their information.[16] Also in 2014, The Daily Dot noted that "Wikipedia still has no real competitor as far as actual content is concerned. All that's up for grabs are traffic stats. And as a nonprofit, traffic numbers don't equate into revenue in the same way they do for a commercial media site". After the article's publication, a spokesperson for the Wikimedia Foundation, which operates Wikipedia, stated that it "welcomes" the knowledge panel functionality, that it was "looking into" the traffic drops, and that "We've also not noticed a significant drop in search engine referrals. We also have a continuing dialog with staff from Google working on the Knowledge Panel".[17]

In his 2020 book, Dariusz Jemielniak noted that as most Google users do not realize that many answers to their questions that appear in the Knowledge Graph come from Wikipedia, this reduces Wikipedia's popularity, and in turn limited the site's ability to raise new funds and attract new volunteers.[18]

Bias

The algorithm has been criticized for presenting biased or inaccurate information, usually because of sourcing information from websites with high search engine optimization. It had been noted in 2014 that while there was a Knowledge Graph for most major historical or pseudo-historical religious figures such as Moses, Muhammad and Gautama Buddha, there was none for Jesus, the central figure of Christianity.[19][20] On June 3, 2021, a knowledge box identified Kannada as the ugliest language in India, prompting outrage from the Kannada-language community; the state of Karnataka, where most Kannada speakers live, also threatened to sue Google for damaging the public image of the language. Google promptly changed the featured snippet for the search query and issued a formal apology.[21][22]

See also

References

  1. "About knowledge panels - Knowledge Panel Help". https://support.google.com/knowledgepanel/answer/9163198?hl=en. 
  2. "Your business information in the Knowledge Panel". Google Inc.. https://support.google.com/business/answer/6331288. 
  3. Newton, Casey (December 4, 2012). "Google's Knowledge Graph tripled in size in seven months". CBS Interactive. https://www.cnet.com/news/googles-knowledge-graph-tripled-in-size-in-seven-months/. 
  4. Vincent, James (October 4, 2016). "Apple boasts about sales; Google boasts about how good its AI is". Vox Media. https://www.theverge.com/2016/10/4/13122406/google-phone-event-stats. 
  5. "A reintroduction to our Knowledge Graph and knowledge panels" (in en). 2020-05-20. https://blog.google/products/search/about-knowledge-graph-and-knowledge-panels/. "It’s a system that understands facts and information about entities from materials shared across the web, as well as from open source and licensed databases. It has amassed over 500 billion facts about five billion entities." 
  6. Ehrlinger, Lisa; Wöß, Wolfram (2016). "Towards a Definition of Knowledge Graphs". http://ceur-ws.org/Vol-1695/paper4.pdf. 
  7. 7.0 7.1 7.2 Singhal, Amit (May 16, 2012). "Introducing the Knowledge Graph: Things, Not Strings". Google Official Blog. http://googleblog.blogspot.com/2012/05/introducing-knowledge-graph-things-not.html. 
  8. Lynley, Matthew (May 18, 2016). "Google unveils Google Assistant, a virtual assistant that's a big upgrade to Google Now". Oath Inc.. https://techcrunch.com/2016/05/18/google-unveils-google-assistant-a-big-upgrade-to-google-now/. 
  9. Kovach, Steve (October 4, 2016). "Google is going to win the next major battle in computing". Axel Springer SE. https://www.businessinsider.com/why-google-assistant-will-win-the-ai-race-2016-10. 
  10. Bohn, Dieter (May 18, 2016). "Google Home: a speaker to finally take on the Amazon Echo". Vox Media. https://www.theverge.com/2016/5/18/11688376/google-home-speaker-announced-virtual-assistant-io-2016. 
  11. 11.0 11.1 11.2 Dewey, Caitlin (May 11, 2016). "You Probably Haven't Even Noticed Google's Sketchy Quest to Control the World's Knowledge". The Washington Post. https://www.washingtonpost.com/news/the-intersect/wp/2016/05/11/you-probably-havent-even-noticed-googles-sketchy-quest-to-control-the-worlds-knowledge/. 
  12. Newton, Casey (December 14, 2012). "How Google is taking the Knowledge Graph global". CBS Interactive. https://www.cnet.com/news/how-google-is-taking-the-knowledge-graph-global/. 
  13. "Making it easier to Search in Bengali" (in en-US). Official Google India Blog. https://india.googleblog.com/2017/03/making-it-easier-to-search-in-bengali.html. 
  14. Hodson, Hal (August 20, 2014). "Google's fact-checking bots build vast knowledge bank". https://www.newscientist.com/article/mg22329832-700-googles-fact-checking-bots-build-vast-knowledge-bank/. 
  15. 15.0 15.1 Sterling, Greg (August 25, 2014). "Google "Knowledge Vault" To Power Future Of Search". https://searchengineland.com/google-builds-next-gen-knowledge-graph-future-201640. 
  16. Orlowski, Andrew (January 13, 2014). "Google stabs Wikipedia in the front". https://www.theregister.co.uk/2014/01/13/google_stabs_wikipedia_in_the_front. 
  17. Kloc, Joe (January 8, 2014). "Is Google accidentally killing Wikipedia?". https://www.dailydot.com/news/wikipedia-falling-traffic-meaning/. 
  18. Jemielniak, Dariusz; Przegalinska, Aleksandra (18 February 2020). Collaborative Society. MIT Press. ISBN 978-0-262-35645-9. https://books.google.com/books?id=yLDMDwAAQBAJ. 
  19. Schwartz, Barry (July 8, 2014). "Why Does Google Exclude Jesus Christ From The Knowledge Graph". Search Engine Roundtable. https://www.seroundtable.com/google-knowledge-graph-jesus-christ-religion-18814.html. Retrieved May 29, 2016. 
  20. Wolford, Josh (July 8, 2014). "Google Has a Jesus-Shaped Hole in Its Graph". WebProNews. http://www.webpronews.com/google-has-a-jesus-shaped-hole-in-its-graph-2014-07/. Retrieved May 29, 2016. 
  21. "Why Google showed Kannada as 'ugliest language of India': Explained" (in en). Hindustan Times. 4 June 2021. https://www.hindustantimes.com/india-news/why-google-showed-kannada-as-ugliest-language-of-india-explained-101622784047314.html. 
  22. Ives, Mike; Mozur, Paul (4 June 2021). "India's 'Ugliest' Language? Google Had an Answer (and Drew a Backlash).". The New York Times. https://www.nytimes.com/2021/06/04/world/asia/google-india-language-kannada.html.