Software:GraphLab

From HandWiki
Turi
Developer(s)Carnegie Mellon University
Stable release
v2.2 / July 1, 2013 (2013-07-01)
Written inC++
Operating systemLinux, macOS
TypeMachine learning platform
LicenseProprietary
Websiteturi.com

Turi is a graph-based, high performance, distributed computation framework written in C++. The GraphLab project was started by Prof. Carlos Guestrin of Carnegie Mellon University in 2009. It is an open source project using an Apache License. While GraphLab was originally developed for machine learning tasks, it has also been developed for other data-mining tasks.[1][2]

Motivation

As the amounts of collected data and computing power grow (multicore, GPUs, clusters,clouds), modern datasets no longer fit into one computing node. Efficient distributed parallel algorithms for handling large-scale data are required. The GraphLab framework is a parallel programming abstraction targeted for sparse iterative graph algorithms. GraphLab provides a programming interface, allowing deployment of distributed machine learning algorithms.[3] The main design considerations behind the design of GraphLab are:

  • Sparse data with local dependencies
  • Iterative algorithms
  • Potentially asynchronous execution

GraphLab toolkits

On top of GraphLab, several implemented libraries of algorithms:

  • Topic modeling - contains applications like LDA, which can be used to cluster documents and extract topical representations.[4]
  • Graph analytics - contains applications like pagerank and triangle counting, which can be applied to general graphs to estimate community structure.[5]
  • Clustering - contains standard data clustering tools such as Kmeans[6]
  • Collaborative filtering - contains a collection of applications used to make predictions about users interests and factorize large matrices.[7]
  • Graphical models - contains tools for making joint predictions about collections of related random variables.[8]
  • Computer vision - contains a collection of tools for reasoning about images.[9]

Award winning software

A solution based on Graphlab collaborative filtering library won the 5th place in the ACM Yahoo! KDD CUP challenge, track1, out of more than 1000 participants. LeBuShiShu team used a mixture of 12 different algorithms and deployed 10,000 CPU hours on the BlackLight supercomputer.[10] Most of the utilized algorithms and techniques are now part of the GraphLab Collaborative FIltering Toolkit.

Turi

Turi (formerly called Dato and before that GraphLab Inc.) is a company that was founded by Prof. Carlos Guestrin from University of Washington in May 2013 to continue development support of the GraphLab open source project. Dato Inc. raised a $6.75M Series A from Madrona Venture Group and New Enterprise Associates (NEA). They raised a $18.5M Series B from Vulcan Capital and Opus Capital, with participation from Madrona and NEA.[11] On August 5, 2016, Turi was acquired by Apple Inc. for $200,000,000.[12][13]

References

  1. Joseph Gonzalez, Yucheng Low, Haijie Gu, Danny Bickson, Carlos Guestrin (2012). "PowerGraph: Distributed Graph-Parallel Computation on Natural Graphs." Proceedings of Operating Systems Design and Implementation (OSDI).
  2. Yucheng Low, Joseph Gonzalez, Aapo Kyrola, Danny Bickson, Carlos Guestrin and Joseph M. Hellerstein (2012). "Distributed GraphLab: A Framework for Machine Learning and Data Mining in the Cloud." Proceedings of Very Large Data Bases (PVLDB).
  3. Y. Low, J. Gonzalez, A. Kyrola, D. Bickson, C. Guestrin and J. Hellerstein. GraphLab: A New Framework for Parallel Machine Learning. In the 26th Conference on Uncertainty in Artificial Intelligence (UAI), Catalina Island, USA, 2010
  4. "GraphLab: Distributed Graph-Parallel API: Topic Modeling". http://docs.graphlab.org/topic_modeling.html. 
  5. "GraphLab: Distributed Graph-Parallel API: Graph Analytics". http://docs.graphlab.org/graph_analytics.html. 
  6. "GraphLab Clustering Library". http://www.select.cs.cmu.edu/code/graphlab/clustering.html. 
  7. "GraphLab: Collaborative filtering library using matrix factorization methods". http://www.select.cs.cmu.edu/code/graphlab/pmf.html. 
  8. "GraphLab: Distributed Graph-Parallel API: Graphical Models". http://docs.graphlab.org/graphical_models.html. 
  9. "GraphLab: Distributed Graph-Parallel API: Computer Vision". http://docs.graphlab.org/computer_vision.html. 
  10. Yao Wu, Qiang Yan, Danny Bickson, Yucheng Low, Qing Yang. Efficient Multicore Collaborative Filtering. In ACM KDD CUP workshop 2011.
  11. Gage, Deborah (2015-01-08). "GraphLab, Now Dato, Raises $18.5M for Machine-Learning Applications". WSJ Blogs. https://blogs.wsj.com/venturecapital/2015/01/08/graphlab-now-dato-raises-18-5m-for-machine-learning-applications/. 
  12. Clover, Juli. "Apple Acquires Machine Learning and AI Startup Turi". http://www.macrumors.com/2016/08/05/apple-acquires-ai-startup-turi/. 
  13. "Exclusive: Apple acquires Turi in major exit for Seattle-based machine learning and AI startup" (in en-US). 2016-08-05. http://www.geekwire.com/2016/exclusive-apple-acquires-turi-major-exit-seattle-based-machine-learning-ai-startup/. 

External links