Software:Knowledge Engine (search engine)

From HandWiki
Short description: Search engine project
A screenshot of the Knowledge Graph surrounding a Wikipedia article.
The search example of the Knowledge Engine states "Ad-free, secure, non-profit: Make Wikipedia your default search".[1]

Knowledge Engine (KE) was a search engine project initiated in 2015 by the Wikimedia Foundation (WMF) to locate and display verifiable and trustworthy information from public-information sources[2] in a way that was less reliant on traditional search engines.[3] It aimed to allow readers to stay on Wikipedia.org and other Wikipedia-related projects when looking for additional information rather than returning to proprietary search engines.[3] Its goal was to protect user privacy, to be open and transparent about how a piece of information originates, and to allow access to related metadata.[4]

The development of the project idea was controversial internally, and was not pursued after 2016. Related ideas were applied to the internal cross-wiki search engine for Wikimedia projects.[1]

History

In 2015, WMF applied for a $250,000 grant from the Knight Foundation to support development of the Knowledge Engine. Its grant proposal noted: "Commercial search engines dominate search-engine use of the Internet, and they're employing proprietary technologies to consolidate channels of access to the Internet's knowledge and information."[5] The project was designed in four stages, each scheduled to take about 18 months.[6]

The project planned to draw information from Wikipedia-related projects and eventually to search other sources of public information such as the U.S. Census Bureau.[5] Leaked internal WMF documents stated the "Knowledge Engine By Wikipedia will democratize the discovery of media, news and information—it will make the Internet's most relevant information more accessible and openly curated, and it will create an open data engine that's completely free of commercial interests. Our new site will be the Internet's first transparent search engine, and the first one that carries the reputation of Wikipedia and the Wikimedia Foundation."[2] The new search engine was not expected to immediately replace a general purpose search engine because at first it would only draw on information from Wikipedia and its other free knowledge projects,[5] though it might in time also have included academic and open access sources in its search results.[7] Matt Southern in Search Engine Journal attributed media confusion about the Knowledge Engine's scope to the fact that later WMF statements clarifying the organization's intentions were "quite a contrast to the original grant application documents".[8]

The project was not discussed publicly with the Wikipedia community while developing the concept,[9] nor part of the existing annual plan.[10] This secrecy was mirrored by a degree of confusion within the organization, and seen as at odds with the goal of transparency.[2] An initial blogpost by WMF Executive Director Lila Tretikov about the project did not address why the original proposal was so much broader than an internal search engine.[11] Some staff and WMF board members felt the WMF was still not being straightforward with the Wikipedia community.[12] This led to a crisis for the organization,[1] leading to Tretikov's resignation in February 2016.[11][13]

Design

The goal of the Knowledge Engine was to let readers and editors be less reliant on proprietary search engines when looking for new information.[3] The project proposal asked, "Would users go to Wikipedia if it were an open channel beyond an encyclopedia?"[1]

A screenshot of potential sources used by the Knowledge Engine.
Example of federated data sources potentially used by the Knowledge Engine.

The Knowledge Engine was designed to be open and transparent about how a piece of information originates and allow access to metadata.[4] It would have no advertisements, protect user privacy, and emphasize community building and sharing of information.[4] It would draw information from Wikipedia-related projects and eventually perhaps search other sources of public information such as the U.S. Census Bureau,[5] OpenStreetMap,[14] the Digital Public Library of America,[15] and external sources like Fox News.[1] Jimmy Wales and the WMF stated that the project would focus on improving search on Wikipedia and related Wikimedia projects.[2] The grant application stated that it would "create a model for surfacing high quality, public information on the internet."[2] It also advised that "commercial search engines dominate search-engine use of the internet" and stated that "Google, Yahoo, or another big commercial search engine could suddenly devote resources to a similar project, which could reduce the success of the project."[2]

Development timeline

Information about the project became public only gradually.[2] As early as May 2015, community members asked about the concentration of staff in a new "Search and Discovery" department, though public plans made little or no reference to this work.[2][15] The grant was applied for in mid-2015 and awarded in September, but only publicly announced in a January 2016 press release.[3]

The project plan had four stages, each scheduled to take about 18 months: Discovery, Advisory, Community and Extension.[6] The initial stage of the project was budgeted to cost $2.5 million,[16] with the whole running to the tens of millions.[1] After a year, the WMF was to evaluate development to date, and at the close of the grant, set plans for the project to continue to the second stage.[6]

Motivation and scope

A screenshot of Google Knowledge with fast facts from a Wikipedia article.
Since mid-2012,[17] Google Search has included fast facts from Wikipedia articles on its search results pages[2] via the Google Knowledge Graph.[17]

A central source of confusion for the project was the extent to which it would directly compete with traditional search engines as a place to search the Web. According to Vice, "the Wikimedia Foundation, the nonprofit that finances and founded Wikipedia, is interested in creating a search engine that appears squarely aimed at competing with Google."[2] According to The Guardian , "there was considerable doubt over what the tool was actually intended to be: a search engine aimed at halting a decline in Wikipedia traffic sent by Google, or simply a service for searching within Wikipedia?"[11]

Since 2012, Google Search and other search engines had started highlighting brief informational summaries from Wikipedia in knowledge panels alongside search results, reducing traffic to Wikipedia from those search engines.[2] According to Search Engine Watch, this led to a battle for attention,[1] and this project could have recouped some of that traffic.

Leaked internal documents from early concepts framed the plan more boldly than the final public description.[18] They said the "Knowledge Engine By Wikipedia will democratize the discovery of media, news and information—it will make the Internet's most relevant information more accessible and openly curated, and it will create an open data engine that's completely free of commercial interests. Our new site will be the Internet's first transparent search engine, and the first one that carries the reputation of Wikipedia and the Wikimedia Foundation."[2]

The apparent contradiction between different descriptions of the purpose led to confusion in the media and in the community. In response to speculation, the WMF published a response clarifying its intentions: "We're not building a global crawler search engine ... Despite headlines, we are not trying to compete with other platforms, including Google. As a non-profit we are noncommercial and support open knowledge. Our focus is on the knowledge contributed on the Wikimedia projects. ... We intend to research how Wikimedia users seek, find, and engage with content. This essential information will allow us to make critical improvements to discovery on the Wikimedia projects."[8] Director of Discovery Tomasz Finc added "we are building an internal search engine, and we are not building a broad one.[1] Jimmy Wales stated that suggestions that the WMF is creating a rival to Google are "trolling", "completely and utterly false", and "a total lie",[2][19] while allowing that the Knowledge Engine might in time include academic and open access sources in its search results.[7]

Matt Southern in Search Engine Journal attributed media confusion about the KE's scope to the fact that this was "quite a contrast to the original grant application documents",[8] an assessment echoed by James Vincent in The Verge,[9] Matt McGee in Search Engine Land,[20] and Jason Koebler in Vice.[21]

Controversy

Many in the community were furious that details of such a large project had been withheld by an organization that prides itself on radical transparency. Wikimedia's public story—that it was never working on a search engine—was directly contradicted by a grant proposal made to the Knight Foundation and leaked internal documents.

 —Jason Koebler, Vice[12]

Large-scale WMF projects are almost always discussed publicly with the Wikipedia community, but this did not happen with the Knowledge Engine development.[9] Wikipedians were unaware of the existence of the project as a concept,[2][22] and the KE project was not mentioned in the WMF's annual plan.[10] According to the English Wikipedia's community newsletter, The Signpost,[23] some community members expressed outrage at the perceived secrecy around it and their lack of ability to give input, and this raised questions about WMF's commitment to transparency with the Wikipedia community.[9]

James Heilman, a member of the WMF's Board of Trustees, noted in The Signpost that while on the Board, he had insisted multiple times that the grant documentation be made public, without success.[19] He was dismissed from the Board in December 2015, and it was suggested that his push for transparency concerning the grant had been a factor in his dismissal—a suggestion rejected by Jimmy Wales.[2] The Wikipedia community re-elected Heilman to the Board in 2017.[24]

Ruth McCambridge said in Nonprofit Quarterly, "Wikipedia editors have been requesting from December for the grant proposal and grant letter for a project that many surmise is a bid to remain technologically cutting-edge by the Wikimedia Foundation, but which may divert resources and attention from other pressing needs of the community."[23]

Commenting on the reluctance to share the grant documents with the community, referencing privacy concerns, McCambridge saw "a major difference in culture and values assumptions" compared to previous Wikimedia practice.[23] McCambridge said that "the power of important strategic decisions" here seemed to rest "between funders and the top of the organizational hierarchy" and was "not shared with volunteer editors."[23]

The WMF initially published only portions of the grant documentation,[25] later making the full grant agreement available in February.[19] Further internal documents were leaked shortly after.[2][9] The full agreement clarified the initial concept for the first stage of the project.[23] Tretikov said she regretted being so late in informing the Wikipedia editing community about the grant.[15]

Longtime Wikipedia editor and journalist William Beutler told Vice Magazine's Jason Koebler, "Leaving aside whether a search engine is a good idea, let alone feasible, the core issue here is about transparency. The irony is that the Wikimedia Foundation failed to observe one of the movement's own core values ...."[21] UK Wikipedia editor Ashley van Haeften told Ars Technica via e-mail that "Lila, Jimmy, and the rest chose to keep the project and the Knight Foundation application and grant a secret until the projects were underway for six months, and even then this only came to light because it was leaked."[18]

Tretikov's initial public post about the Knowledge Engine project did not explain why the original grant proposal had such a grander vision than the later public plan to develop an internal search engine.[11] Staff who had been uncomfortable about the project's development felt the WMF was not being sufficiently straightforward with the community.[12] According to statements posted of an internal meeting on the WMF's website,[12] a member of the Discovery team member said to Tretikov, "My concern is that we still aren't communicating it clearly enough. This morning's blog post is the truth, but not all of the truth. Namely that we had big plans in the past. It would have been much easier to say that we did have big plans, but they were ditched ... we still haven't acknowledged it. We can't deny it."

Former deputy director of the WMF Erik Möller, up to April 2015, portrayed the events as "very much out of control" and "a crisis."[1] Disagreements about the project, and the response to the resulting controversy, led to many WMF staff members departing,[26][27] culminating in Tretikov resigning on February 25, 2016.[11][13]

References

  1. 1.0 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 Sentance, Rebecca (March 3, 2016). "Everything you need to know about Wikimedia's 'Knowledge Engine' so far". Search Engine Watch. Archived from the original on January 13, 2017. https://web.archive.org/web/20170113153104/https://searchenginewatch.com/2016/03/03/everything-you-need-to-know-about-wikimedias-knowledge-engine-so-far/. 
  2. 2.00 2.01 2.02 2.03 2.04 2.05 2.06 2.07 2.08 2.09 2.10 2.11 2.12 2.13 2.14 2.15 Koebler, Jason (February 16, 2016). "The Secret Search Engine Tearing Wikipedia Apart". Vice. Archived from the original on February 23, 2016. https://web.archive.org/web/20160223031147/http://motherboard.vice.com/read/wikipedias-secret-google-competitor-search-engine-is-tearing-it-apart. 
  3. 3.0 3.1 3.2 3.3 McGee, Matt (February 15, 2016). "Wikimedia Foundation Secures $250,000 Grant For Search Engine Development". Search Engine Land. Archived from the original on May 23, 2016. https://web.archive.org/web/20160523004937/http://searchengineland.com/wikimedia-foundation-secures-250000-grant-for-search-engine-development-242544. 
  4. 4.0 4.1 4.2 Singh, Manish (February 16, 2016). "Wikipedia's Upcoming Search Engine to Rival Google; Offer Full Transparency". Gadgets 360. Archived from the original on February 16, 2016. https://web.archive.org/web/20160216181343/http://gadgets.ndtv.com/internet/news/wikipedias-upcoming-search-engine-to-rival-google-offer-full-transparency-803053. 
  5. 5.0 5.1 5.2 5.3 Cuthbertson, Anthony (February 16, 2016). "Wikipedia Takes on Google with New 'Transparent' Search Engine". Newsweek. Archived from the original on February 16, 2016. https://web.archive.org/web/20160216191140/http://www.newsweek.com/wikipedia-takes-google-new-transparent-search-engine-427028. 
  6. 6.0 6.1 6.2 Crum, Chris (February 15, 2016). "Wikimedia Works On Search Improvements, Says It's Not Competing with Google [Updated"]. WebProNews. Archived from the original on July 3, 2016. https://web.archive.org/web/20160703174727/http://www.webpronews.com/knowledge-engine-wikipedia-works-on-new-search-engine-2016-02/. 
  7. 7.0 7.1 Greis, Friedhelm (February 15, 2016). "Wirbel um angebliche Wikipedia-Konkurrenz zu Google" (in German). Golem.de. Archived from the original on February 17, 2016. https://web.archive.org/web/20160217092955/http://www.golem.de/news/knowledge-engine-wirbel-um-angebliche-wikipedia-konkurrenz-zu-google-1602-119167.html. 
  8. 8.0 8.1 8.2 Southern, Matt (February 17, 2016). "Wikimedia Clarifies it is Not Building a Global Web Crawler". Search Engine Journal. Archived from the original on February 18, 2016. https://web.archive.org/web/20160218092927/https://www.searchenginejournal.com/wikimedia-clarifies-it-is-not-building-a-global-web-crawler/156732/. 
  9. 9.0 9.1 9.2 9.3 9.4 Vincent, James (February 17, 2016). "Wikimedia says it's not building a search engine to take on Google". The Verge. Archived from the original on September 5, 2017. https://web.archive.org/web/20170905163614/https://www.theverge.com/2016/2/17/11031354/wikipedia-search-engine-wikimedia. 
  10. 10.0 10.1 McCormick, Rich (February 26, 2016). "Wikimedia head resigns after leak exposed search engine plans". The Verge. Archived from the original on April 14, 2017. https://web.archive.org/web/20170414112340/http://www.theverge.com/2016/2/26/11118326/wikimedia-head-resigns-search-engine. 
  11. 11.0 11.1 11.2 11.3 11.4 Hern, Alex (February 26, 2016). "Head of Wikimedia resigns over search engine plans". The Guardian. Archived from the original on March 28, 2016. https://web.archive.org/web/20160328201350/http://www.theguardian.com/technology/2016/feb/26/wikimedia-head-lila-tretikov-resigns-search-engine-plans. 
  12. 12.0 12.1 12.2 12.3 Koebler, Jason (February 25, 2016). "Wikimedia Foundation Executive Director Resigns Amid a Community Revolt". Vice. Archived from the original on February 26, 2016. https://web.archive.org/web/20160226085015/http://motherboard.vice.com/read/wikimedia-foundation-executive-director-lila-tretikov-resigns. 
  13. 13.0 13.1 "Online-Enzyklopädie: Chefin der Wikipedia-Stiftung tritt zurück" (in German). Spiegel Online. February 26, 2016. Archived from the original on March 5, 2016. https://web.archive.org/web/20160305100002/http://www.spiegel.de/netzwelt/web/wikipedia-streit-um-knowledge-engine-lila-tretikov-tritt-zurueck-a-1079448.html. 
  14. Shah, Jaymi (February 16, 2016). "Wikimedia Foundation Secures $250,000 Grant For Search Engine Development". Technoledger. Archived from the original on March 2, 2016. https://web.archive.org/web/20160302025607/http://technoledger.com/wikipedia-receives-250k-grant-for-search-engine/. 
  15. 15.0 15.1 15.2 Kleinz, Torsten (February 15, 2016). "Wikipedia plant Suchmaschine, aber keinen Google-Konkurrenten" (in German). Heinz Heise. Archived from the original on February 17, 2016. https://web.archive.org/web/20160217093345/http://www.heise.de/newsticker/meldung/Wikipedia-plant-Suchmaschine-aber-keinen-Google-Konkurrenten-3104073.html. 
  16. Orlowski, Andrew (February 12, 2016). "Reluctant Wikipedia lifts lid on $2.5m internet search engine project". The Register. Archived from the original on September 1, 2017. https://web.archive.org/web/20170901083007/http://www.theregister.co.uk/2016/02/12/wikipedia_grant_build_search_engine_knight_foundation/. 
  17. 17.0 17.1 Orlowski, Andrew (January 14, 2014). "Google stabs Wikipedia in the front". The Register. Archived from the original on November 13, 2017. https://web.archive.org/web/20171113040947/http://www.theregister.co.uk/2014/01/13/google_stabs_wikipedia_in_the_front/. 
  18. 18.0 18.1 Mullin, Joe (February 29, 2016). "Wikimedia Foundation director resigns after uproar over "Knowledge Engine"". Ars Technica. Archived from the original on March 1, 2016. https://web.archive.org/web/20160301082152/http://arstechnica.com/tech-policy/2016/02/head-of-wikimedia-foundation-resigns-as-tensions-with-editors-mount/. 
  19. 19.0 19.1 19.2 Tual, Morgane (February 16, 2016). "Un projet de moteur de recherche sème la discorde chez Wikipedia" (in French). Le Monde. http://www.lemonde.fr/pixels/article/2016/02/16/un-projet-de-moteur-de-recherche-seme-la-discorde-chez-wikipedia_4866293_4408996.html. 
  20. McGee, Matt (February 16, 2016). "Wikimedia Foundation: "We're Not Building A Global Crawler Search Engine"". Search Engine Land. Archived from the original on February 17, 2016. https://web.archive.org/web/20160217211818/http://searchengineland.com/wikimedia-foundation-were-not-building-a-global-crawler-search-engine-242620. 
  21. 21.0 21.1 Koebler, Jason (February 16, 2016). "Wikimedia: We're Really Really Not Building a Search Engine". Vice. Archived from the original on February 17, 2016. https://web.archive.org/web/20160217211815/http://motherboard.vice.com/read/wikimedia-were-really-really-not-building-a-search-engine. 
  22. Singh, Manish (February 16, 2016). "Knowledge Engine: Wikimedia Foundation takes aim at Google with $3.5m search project". ABC News (Australia). Archived from the original on February 16, 2016. https://web.archive.org/web/20160216091557/http://www.abc.net.au/news/2016-02-15/wikimedia-foundation-aims-to-take-on-google-in-search/7168840. 
  23. 23.0 23.1 23.2 23.3 23.4 McCambridge, Ruth (February 16, 2016). "Knight Foundation Grant Request Tears at Wikipedia's Community". Nonprofit Quarterly. Archived from the original on February 24, 2016. https://web.archive.org/web/20160224093633/https://nonprofitquarterly.org/2016/02/16/knight-foundation-grant-request-tears-at-wikipedias-community/. 
  24. Andreas Kolbe (7 June 2017). "Golden handshakes of almost half a million at Wikimedia Foundation". The Register. Archived from the original on 10 October 2017. https://web.archive.org/web/20171010104346/https://www.theregister.co.uk/2017/06/07/golden_handshakes_at_wikipedia/. 
  25. Orlowski, Andrew (February 11, 2016). "Move over, Google. Here's Wikipedia's Search Engine – Full of On-Demand Smut". The Register. Archived from the original on July 8, 2017. https://web.archive.org/web/20170708195253/http://www.theregister.co.uk/2016/02/11/wikipedia_search_engine/. 
  26. Price, Rob (February 26, 2016). "The executive director of the nonprofit behind Wikipedia just resigned". Business Insider. Archived from the original on February 28, 2016. https://web.archive.org/web/20160228232942/http://www.businessinsider.com/wikimedia-foundation-executive-director-lila-tretikov-resigns-wikipedia-knowledge-engine-2016-2. 
  27. Noisette, Thierry (February 26, 2016). "Crise à la fondation Wikimedia : sa directrice démissionne" (in French). L'Obs. http://rue89.nouvelobs.com/2016/02/26/crise-a-fondation-wikimedia-directrice-demissionne-263290. 

External links