5
0
mirror of https://github.com/apache/sqoop.git synced 2025-05-03 02:51:00 +08:00
Commit Graph

245 Commits

Author SHA1 Message Date
Andrew Bayer
44ffd1e75b SQOOP-207. Support import as Avro Data Files.
(Tom White via Arvind Prabhakar)

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150047 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:40 +00:00
Andrew Bayer
5eaafd3243 SQOOP-225. Checkstyle module for detecting trailing white spaces.
This patch adds a checkstyle module to detect trailing white
spaces. It also removed various current instances of trailing
white spaces in the code.

From: Ahmed Radwan <ahmed@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150046 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:39 +00:00
Andrew Bayer
fe9c0666b9 SQOOP-91. Empty "columns" clause leads to null pointer exception
From: Jonathan Hsieh <jon@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150045 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:38 +00:00
Andrew Bayer
bccae3464a SQOOP-224. Table name issue with PostgreSQL.
Recently the PostgresqlManager was updated to escape all identifier
names. This change addresses a couple of places where the identifier
was either not being escaped, or was being lower-case converted as
per the previous logic.

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150044 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:38 +00:00
Andrew Bayer
eac50d0e24 SQOOP-212. Hive import for existing table does not work.
This patch fixes a bug that prevents importing data into
an existing hive table with the 'hive-overwrite' argument set.

From: Ahmed Radwan <ahmed@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150043 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:38 +00:00
Andrew Bayer
cb911f34b4 SQOOP-67. NPE when column name list contains spaces
- e.g. sqoop -import --columns "col1, col2"

From: Jonathan Hsieh <jon@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150042 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:37 +00:00
Andrew Bayer
c0ca0c100b SQOOP-219. Duplicate columns arguments results in code generation compilation error
From: Jonathan Hsieh <jon@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150041 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:37 +00:00
Andrew Bayer
913034315b SQOOP-48: Import bug when splitting over unsigned bigint column
From: Jonathan Hsieh <jon@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150040 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:37 +00:00
Andrew Bayer
b84aaf3ab0 SQOOP-215. Support for codec aliases.
(Tom White via Arvind Prabhakar)

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150039 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:37 +00:00
Andrew Bayer
fe4f54a413 SQOOP-195. Documentation tip on specifying MR job name.
Documentation change to highlight how to use hadoop generic
options to set the MR job name.

From: Ahmed Radwan <ahmed@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150038 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:36 +00:00
Andrew Bayer
0870f9499a SQOOP-158. Additional methods for generated classes.
Adding setter-methods and a field-based equals-implementation to
the generated classes. These new methods enhance the usage of the
generated classes.

(Michael Häusler via ahmed)

From: Ahmed Radwan <ahmed@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150037 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:36 +00:00
Andrew Bayer
dfd9021662 SQOOP-201. Support for arbitrary compression codec.
(Tom White via Arvind Prabhakar)

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150036 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:36 +00:00
Andrew Bayer
7fa8b68fab SQOOP-206. HBase import to use plugin InputFormat.
(Peter Hall via Arvind Prabhakar)

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150035 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:35 +00:00
Andrew Bayer
da297a784b SQOOP-202. sqoop-metastore should be able to handle missing pid files
Signed-off-by: Jonathan Hsieh <jon@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150034 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:35 +00:00
Andrew Bayer
6acded9bb9 SQOOP-204. Allow connector specific bounds query.
This change introduces a new method in ConnManager that allows the
various implementations to optionally override it and specify
custom bounds query used for calculating splits during free form
query based imports.

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150033 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:35 +00:00
Andrew Bayer
f9f45cf55d SQOOP-205. Add getJob() to JobBase
(Garrett Wu via Aaron Kimball)

From: Aaron Kimball <akimball83@gmail.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150032 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:34 +00:00
Andrew Bayer
0efc5a4d55 SQOOP-100. Sqoop to support populating Hive table partitions.
Initial patch by Frank Maritato.

From: Jonathan Hsieh <jon@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150031 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:34 +00:00
Andrew Bayer
0a795d32e4 SQOOP-193. Update Docs to describe --hive-drop-import-delims argument
From: Jonathan Hsieh <jon@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150030 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:34 +00:00
Andrew Bayer
34b7c72165 SQOOP-118. Sqoop should work without HBase where possible.
From: Jonathan Hsieh <jon@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150029 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:33 +00:00
Andrew Bayer
f38e40e760 SQOOP-129. Newlines in RDBMS fields break Hive
From: Jonathan Hsieh <jon@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150028 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:33 +00:00
Andrew Bayer
79fd6b4d84 SQOOP-191. Supplying programmatic configuration.
(Aaron Kimball via Arvind Prabhakar)

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150027 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:33 +00:00
Andrew Bayer
aa41d0635e SQOOP-192. Limit HSQLDB export to single inserts.
(Aaron Kimball via Arvind Prabhakar)

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150026 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:32 +00:00
Andrew Bayer
7677d39b79 SQOOP-187. Allow skipping end-of-record delimiter.
The SqoopRecord.toString() and SqoopRecord.toString(DelimiterSet) methods
always append an end-of-record delimiter. Sqoop uses its own OutputFormat
when rendering these to text files, so that the user's delimiters are
preserved.

Other users could use this OutputFormat when working with SqoopRecord
instances in their own MapReduce code, but it would also be nice to "play
nice" with TextOutputFormat in the event that the intent is
newline-terminated records.

This patch allows users to suppress end-of-record delimiter generation when
formatting records with toString.

(Aaron Kimball via Arvind Prabhakar)

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150025 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:32 +00:00
Andrew Bayer
3bbb4a6314 SQOOP-170. Capturing causal exception for logging.
From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150024 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:32 +00:00
Andrew Bayer
67dce5cc36 SQOOP-185. Tool plugins should not reuse names.
(Aaron Kimball via Arvind Prabhakar)

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150023 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:31 +00:00
Andrew Bayer
a8dd09ddbe SQOOP-183. File rename fails for generated files.
(Angus He via Arvind Prabhakar)

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150022 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:31 +00:00
Andrew Bayer
cf117e6500 SQOOP-179. Adding setField() method to SqoopRecord
This change introduces a setField(fieldName, fieldVal) method for
SqoopRecord instances which would allow an arbitrary programmatic
"setter" function without requiring reflection.

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150021 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:31 +00:00
Andrew Bayer
5ee493ee0c SQOOP-178. Support for tool plugins.
The change will look for ToolPlugin definitions in the
sqoop.tool.plugins configuration entry, or conf/tools.d. Each
ToolPlugin returns a list of ToolDesc entries, which are then
registered with SqoopTool.register() before the user's arguments
are parsed. The user can then run 'sqoop <custom-tool> args...'
as if it were part of the natural Sqoop system.

(Aaron Kimball via Arvind Prabhakar)

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150020 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:30 +00:00
Andrew Bayer
349f707f39 SQOOP-180. Classpath construction in script.
(Aaron Kimball via Arvind Prabhakar)

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150019 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:30 +00:00
Andrew Bayer
9db0f3b4a9 SQOOP-181. Improve formatting of help messages.
(Aaron Kimball via Arvind Prabhakar)

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150018 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:30 +00:00
Andrew Bayer
9219504fd8 SQOOP-166. Postgresql identifiers should be quoted
(James Grant via Arvind Prabhakar)

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150017 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:30 +00:00
Andrew Bayer
9b10f9b2c4 SQOOP-168. NPE due to null binary columns.
(Scott Foster via Arvind Prabhakar)

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150016 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:29 +00:00
Andrew Bayer
5774ddd6db SQOOP-165. Improved log message for unknown types.
(Lars Francke via Arvind Prabhakar)

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150015 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:29 +00:00
Andrew Bayer
6f2e5c683a SQOOP-162. Changes for vendor-specific subclasses.
Minor changes to AsyncSqlExecThread to use execute instead of executeUpdate
and to DBRecordReader to allow subclasses to access the configuration object.

From: Ahmed Radwan <ahmed@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150014 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:29 +00:00
Andrew Bayer
a00a320bfa SQOOP-153. Enable tests against real cluster.
From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150013 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:28 +00:00
Andrew Bayer
0b1f47c459 SQOOP-152. Support for test against cluster.
This change allows Sqoop unit tests to be run against a real cluster.

(Konstantin Boudnik via arvind)

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150012 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:28 +00:00
Andrew Bayer
d920b8a0e8 SQOOP-140. Control max. number of fetched records.
This change adds the ability of specifying the max. number of fetched records
from the database. This will solve problems that may arise when importing
large tables.

(Michael Häusler via ahmed)

From: Ahmed Radwan <ahmed@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150011 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:28 +00:00
Andrew Bayer
de2fc6c2b3 SQOOP-164. Allow unit tests to use external dbs.
Modified the thirdparty tests to pick host URL from system properties.

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150010 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:27 +00:00
Andrew Bayer
3f8252a28c SQOOP-154. Fix connection leak in OracleManager.
From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150009 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:27 +00:00
Andrew Bayer
49613bb5b7 SQOOP-159. Fixing HBase test failures.
Changes include explicitly setting the Zookeeper client port and increasing
the memory limit from 256m to 512m in build.xml.

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150008 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:27 +00:00
Andrew Bayer
8812957f98 Increment version number and prev.git.hash after branching for release.
From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150007 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:26 +00:00
Andrew Bayer
4e6351f372 SQOOP-148. Use catalog views for OracleManager.
This change updates the OracleManager to use catalog views for resolving
the necessary metadata.

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150006 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:26 +00:00
Andrew Bayer
70caf779b0 SQOOP-142. Document requirements for direct import
Updated the documentation with details on direct mode execution
requirements.

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150005 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:26 +00:00
Andrew Bayer
e33bdbced1 SQOOP-111. Documentation fix.
Sqoop user guide inaccurately claims that Hive does not support escaping
of characters. This change updates the user guide to fix this and make the
claim based on the current capabilities of Hive.

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150004 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:25 +00:00
Andrew Bayer
fbb283f54d SQOOP-143. Simplify test configuration.
This change removes the test that asserts the presence of a non-default hosts
file configuration. It also adds the necessary comments to the PostgresqlTest
to allow configuring the server for default hosts file configuration.

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150003 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:25 +00:00
Andrew Bayer
1d5b7011a9 SQOOP-124. Improve error handling during export.
This change introduces the ability to use a staging table for intermediate
storage during execution for regular export jobs in insert mode. This allows
all of exported data to first be populated in the staging table and then
inserted into the destination table in a single transaction. Thus if a failure
were to occur during export, it is less likely to corrupt the destination
table data. Moreover, the staging table is emptied before the export
job starts populating it, which ensures that re-running the job does not
require any special clean up.

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150002 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:25 +00:00
Andrew Bayer
23cebe14ba SQOOP-139. Doc update for SQOOP-125 changes.
This fix mainly corrects a minor option naming inconsistency in the
documentation.

From: Ahmed Radwan <ahmed@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150001 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:24 +00:00
Andrew Bayer
2de36d7aab SQOOP-141. BlobRef accessor returns incorrect data.
This change fixes the BlobRef implementation to return the appropriate stream
source or byte array from the BytesWritable instance by taking into
consideration the actual data length.

(Peter Hall via arvind)

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1150000 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:24 +00:00
Andrew Bayer
cc288b6784 SQOOP-126. Support for loading options from file.
This change allows Sqoop to load options from an options file. An
options file is specified using --options-file. All options that
are otherwise specified on the command line should be specified
in this file in the order they would otherwise appear on the command
line. Options files can contain empty lines and comments for
readability. More than one options file may be used for a single
tool invocation if so preferred. Leading and trailing spaces are
ignored unless they appear within single or double quotes. Quoted
options extending into multiple lines are not supported.

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1149999 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:24 +00:00
Andrew Bayer
819c1dbb0b SQOOP-138. Fixing intermittent IVY failure.
This change fixes the problem due to which on certain systems IVY
is unable to download the hbase artifacts from maven repository.
It also includes some clean up of documentation and build files
that relate to the removal of shim layer mechanism.

From: Arvind Prabhakar <arvind@cloudera.com>

git-svn-id: https://svn.apache.org/repos/asf/incubator/sqoop/trunk@1149998 13f79535-47bb-0310-9956-ffa450edef68
2011-07-22 20:04:23 +00:00