sqoop 常见错误以及处理方式

时间:2023-03-09 15:49:05
sqoop 常见错误以及处理方式

  

Oracle: Connection Reset Errors
错误代码
// :: INFO mapred.JobClient: Task Id : attempt_201105261333_0002_m_000002_0, Status : FAILED
java.lang.RuntimeException: java.lang.RuntimeException: java.sql.SQLRecoverableException: IO Error: Connection reset
at com.cloudera.sqoop.mapreduce.db.DBInputFormat.setConf(DBInputFormat.java:)
at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:)
at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:)
at org.apache.hadoop.mapred.Child$.run(Child.java:)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:)
at org.apache.hadoop.mapred.Child.main(Child.java:)
Caused by: java.lang.RuntimeException: java.sql.SQLRecoverableException: IO Error: Connection reset
at com.cloudera.sqoop.mapreduce.db.DBInputFormat.getConnection(DBInputFormat.java:)
at com.cloudera.sqoop.mapreduce.db.DBInputFormat.setConf(DBInputFormat.java:)
...  more

这个问题的产生原因是由于缺少一个生成随机数的设备
找到 $JAVA_HOME/jre/lib/security目录下的java.security文件
修改securerandom.source=file:/dev/../dev/urandom

Oracle: Case-Sensitive Catalog Query Errors
下面是错误代码
// :: INFO manager.OracleManager: Time zone has been set to
GMT
// :: DEBUG manager.SqlManager: Using fetchSize for next
query:
// :: INFO manager.SqlManager: Executing SQL statement:
SELECT t.* FROM addlabel_pris t WHERE =
// :: DEBUG manager.OracleManager$ConnCache: Caching
released connection for jdbc:oracle:thin:
// :: ERROR sqoop.Sqoop: Got exception running Sqoop:
java.lang.NullPointerException
java.lang.NullPointerException
at com.cloudera.sqoop.hive.TableDefWriter.getCreateTableStmt(TableDefWriter.java:)
at com.cloudera.sqoop.hive.HiveImport.importTable(HiveImport.java:)
at com.cloudera.sqoop.tool.ImportTool.importTable(ImportTool.java:)
at com.cloudera.sqoop.tool.ImportTool.run(ImportTool.java:)
at com.cloudera.sqoop.Sqoop.run(Sqoop.java:)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:)
at com.cloudera.sqoop.Sqoop.runSqoop(Sqoop.java:)
at com.cloudera.sqoop.Sqoop.runTool(Sqoop.java:)
at com.cloudera.sqoop.Sqoop.runTool(Sqoop.java:)
at com.cloudera.sqoop.Sqoop.main(Sqoop.java:)

解决方式:用户名和表名全用大写

Oracle: ORA- error (SQL command not properly ended)
如果没有指定的话,sqoop会默认使用oracle.jdbc.OracleDriver
Omit选项 --driver oracle.jdbc.driver.OracleDriver 然后重新提交

MySQL: Import of TINYINT() from MySQL behaves strangely
jdbc会把tinyint()认为是java.sql.Types.BIT,然后sqoop就会转为Boolean了,悲剧吧
解决方法:在连接上加上一句话tinyInt1isBit=false
jdbc:mysql://localhost/test?tinyInt1isBit=false

另一种解决方式是
hive使用 --map-column-hive foo=tinyint非hive使用--map-column-java foo=integer