Using Big Data Connectors

You can use Oracle Big Data Connectors and Oracle Copy to Hadoop (a feature of Big Data SQL) to load data from an Big Data Service cluster into an Oracle Cloud database instance and to copy from an Oracle Cloud database instance to a Big Data Service cluster. The database can be an Oracle Autonomous Database or a co-managed Oracle Database service, as shown in the following table:

Type of Database Features Supported for Copying Data
Oracle Autonomous Database You can use the following with Oracle Shell for Hadoop Loaders (OHSH) to copy data between a Big Data Service cluster and an Autonomous Database instance.
  • Oracle Loader for Hadoop (OLH)
  • The Copy to Hadoop (CP2HADOOP) feature of Oracle Big Data SQL
Co-managed Oracle Database You can use the following with Oracle Shell for Hadoop Loaders to copy data between a Big Data Service cluster and a co-managed Oracle Database instance.
  • Oracle SQL Connector for HDFS (OSCH)
  • Oracle Loader for Hadoop
  • The Copy to Hadoop feature of Oracle Big Data SQL

Features

Big Data connectors and features are pre-installed on your Big Data Service clusters. The Copy to Hadoop feature of Oracle Big Data SQL is also already installed on your cluster.

The following features are pre-installed on every node of your cluster:

  • Oracle Shell for Hadoop Loaders

    Oracle Shell for Hadoop Loaders (OHSH) is a helper shell that provides a simple-to-use command line interface to Oracle Loader for Hadoop, Oracle SQL Connector for HDFS, and Copy to Hadoop.

  • Copy to Hadoop

    Copy to Hadoop (CP2HADOOP) is a feature of Oracle Big Data SQL, for copying data from an Oracle database to HDFS.

  • Oracle Loader for Hadoop

    Oracle Loader for Hadoop (OLH) is a high-performance loader for loading data from a Hadoop cluster into a table in an Oracle database.

  • Oracle SQL Connector for Hadoop Distributed File System (HDFS)

    Oracle SQL Connector for HDFS (OSCH) enables an Oracle external table to access data stored in HDFS files or in a table in Apache Hive. Use this connector only for loading data into a co-managed Oracle Database service.

    Note

    Oracle SQL Connector for HDFS is supported only for connecting to a co-managed Oracle Database service. It is not supported for connecting to Oracle Autonomous Database.
  • Oracle Instant Client for Linux

    Oracle Instant Client enables development and deployment of applications that connect to Oracle Database.

Set TNS Settings for Connecting to a Database

Configuring Big Data Service to Run Connector Examples

Before running the examples included with your Oracle Shell for Hadoop Loaders installation, perform the following configuration steps. Use the oracle operating system user.

Running the Connector Examples

Run these examples to see different ways to load data.