Software:Sqoop

From HandWiki
Apache Sqoop
Apache Sqoop logo.svg
Developer(s)Apache Software Foundation
Initial release1 June 2009; 15 years ago (2009-06-01)
Final release
1.4.7 / December 6, 2017; 6 years ago (2017-12-06)
RepositorySqoop Repository
Written inJava
Operating systemCross-platform
TypeData management
LicenseApache License 2.0
Websitesqoop.apache.org

Sqoop is a command-line interface application for transferring data between relational databases and Hadoop.[1]

The Apache Sqoop project was retired in June 2021 and moved to the Apache Attic.[2]

Description

Sqoop supports incremental loads of a single table or a free form SQL query as well as saved jobs which can be run multiple times to import updates made to a database since the last import. Imports can also be used to populate tables in Hive or HBase.[3] Exports can be used to put data from Hadoop into a relational database. Sqoop got the name from "SQL-to-Hadoop".[4] Sqoop became a top-level Apache project in March 2012.[5]

Informatica provides a Sqoop-based connector from version 10.1. Pentaho provides open-source Sqoop based connector steps, Sqoop Import[6] and Sqoop Export,[7] in their ETL suite Pentaho Data Integration since version 4.5 of the software.[8] Microsoft uses a Sqoop-based connector to help transfer data from Microsoft SQL Server databases to Hadoop.[9] Couchbase, Inc. also provides a Couchbase Server-Hadoop connector by means of Sqoop.[10]

See also

References

  1. "Hadoop: Apache Sqoop". https://sqoop.apache.org. 
  2. "moving Sqoop to the Attic". http://mail-archives.apache.org/mod_mbox/sqoop-user/202106.mbox/browser. 
  3. "Apache Sqoop - Overview". https://blogs.apache.org/sqoop/entry/apache_sqoop_overview. 
  4. "Introducing Sqoop". https://blog.cloudera.com/blog/2009/06/introducing-sqoop/. 
  5. "Apache Sqoop Graduates from Incubator". https://blogs.apache.org/sqoop/entry/apache_sqoop_graduates_from_incubator. 
  6. "Sqoop Import". Pentaho. 2015-12-10. http://wiki.pentaho.com/display/EAI/Sqoop+Import. "The Sqoop Import job allows you to import data from a relational database into the Hadoop Distributed File System (HDFS) using Apache Sqoop." 
  7. "Sqoop Export". Pentaho. 2015-12-10. http://wiki.pentaho.com/display/EAI/Sqoop+Export. "The Sqoop Export job allows you to export data from Hadoop into an RDBMS using Apache Sqoop." 
  8. "Big Data Analytics Vendor Pentaho Announces Tighter Integration with Cloudera; Extends Visual Interface to Include Hadoop Sqoop and Oozie". Database Trends and Applications (dbta.com). 2012-07-27. http://www.dbta.com/Editorial/News-Flashes/Big-Data-Analytics-Vendor-Pentaho-Announces-Tighter-Integration-with-Cloudera-Extends-Visual-Interface-to-Include-Hadoop-Sqoop-and-Oozie-84025.aspx. "Pentaho’s Business Analytics 4.5 is now certified on Cloudera’s latest releases, Cloudera Enterprise 4.0 and CDH4. Pentaho also announced that its visual design studio capabilities have been extended to the Sqoop and Oozie components of Hadoop." 
  9. "Microsoft SQL Server Connector for Apache Hadoop". https://www.microsoft.com/en-us/download/details.aspx?id=27584. 
  10. "Couchbase Hadoop Connector". http://www.couchbase.com/develop/connectors/hadoop. 

Bibliography

External links