5
0
mirror of https://github.com/apache/sqoop.git synced 2025-05-02 23:21:22 +08:00
Go to file
2013-06-16 13:22:51 -07:00
client SQOOP-973: Sqoop2: Merge host, port and app to single server URL in Sqoop shell 2013-06-05 15:02:46 -07:00
common SQOOP-995: Sqoop2: Add cloning ability to model classes 2013-05-23 01:16:00 -07:00
connector SQOOP-1080: Sqoop2: Investigate Jenkins test failures 2013-06-16 06:40:35 -07:00
core SQOOP-1081: destroy() method of ConnectorManager has not been implemented yet. 2013-06-13 20:28:01 -07:00
dev-support Printing out defect if specified 2013-06-16 13:22:51 -07:00
dist SQOOP-1070: Shell scripts should not rely on $PATH if $JAVA_HOME is set 2013-06-09 08:04:59 -07:00
docs SQOOP-973: Sqoop2: Merge host, port and app to single server URL in Sqoop shell 2013-06-05 15:02:46 -07:00
execution SQOOP-1067: Sqoop2: Tests should not use /tmp/ directory for storing data 2013-06-15 15:27:50 -04:00
repository SQOOP-1043: Sqoop2: Improve error messages in Derby repository 2013-05-13 00:23:19 -04:00
server SQOOP-941: Sqoop2: Do not send sensitive values from server to client 2013-03-23 08:03:10 -07:00
spi SQOOP-659: Design metadata upgrade procedure 2013-04-18 21:15:28 -07:00
submission SQOOP-1002: Sqoop2: Mapreduce local mode detection is not working correctly on Hadoop 2.x 2013-05-11 21:22:20 -04:00
test SQOOP-1077: Integration: Create Netezza provider 2013-06-16 02:08:37 -04:00
.gitattributes SQOOP-651: adding .gitattributes to prevent CRLF and LF mismatches for source and text files 2012-10-25 15:40:22 -07:00
.gitignore Ignoring patch-process directory 2013-06-16 07:28:59 -07:00
CHANGELOG.txt Updating CHANGELOG.txt files with recent fixes 2013-04-15 18:55:12 -07:00
LICENSE.txt SQOOP-786: Include jars from WAR archive in LICENSE.txt file 2012-12-17 14:07:23 -08:00
NOTICE.txt SQOOP-1005: Sqoop2: Update year in the notice file to 2013 2013-06-15 15:15:59 -04:00
pom.xml Adding patch-process to RAT ignore 2013-06-16 07:38:18 -07:00
README.txt SQOOP-772: OutOfMemory from document generation 2012-12-17 13:24:28 -08:00

= Welcome to Sqoop

Apache Sqoop is a tool designed for efficiently transferring bulk data between
Apache Hadoop and structured datastores such as relational databases. You can use
Sqoop to import data from external structured datastores into Hadoop Distributed
File System or related systems like Hive and HBase. Conversely, Sqoop can be used
to extract data from Hadoop and export it to external structured datastores such
as relational databases and enterprise data warehouses.

== Documentation

Sqoop ships with documentation, please check module "docs" for additional materials.

More documentation is available online on Sqoop home page:

http://sqoop.apache.org/

== Compiling Sqoop

Sqoop uses the Maven build system, and it can be compiled and built running the
following commands:

  mvn compile # Compile project
  mvn package # Build source artifact
  mvn package -Pbinary # Build binary artifact

Sqoop is using Sphinx plugin to generate documentation that have higher memory
requirements that might not fit into default maven configuration. You might need
to increase maximal memory allowance to successfully execute package goal. This
can done using following command:

  export MAVEN_OPTS="-Xmx512m -XX:MaxPermSize=512m"

Sqoop currently supports multiple Hadoop distributions. In order to compile Sqoop
against a specific Hadoop version, please specify the hadoop.profile property in
Maven commands. For example:

  mvn package -Pbinary -Dhadoop.profile=100

Please refer to the Sqoop documentation for a full list of supported Hadoop
distributions and values of the hadoop.profile property.