Skip to content

oracle/oci-hdfs-connector

Repository files navigation

Oracle Cloud Infrastructure HDFS Connector for Object Storage

About

oci-hdfs-connector provides the public HDFS connector that integrates with the Oracle Cloud Infrastructure Object Storage Service.

The project is open source and maintained by Oracle Corp. The home page for the project is here.

Installation

See the documentation for details.

Examples

Examples can be found here.

Documentation

Full documentation, including prerequisites and installation and configuration instructions, can be found here.

HDFS Connector provides support for delegation token configurator that injects a delegation token on every request. This can be done by setting the delegation token file path in an environment variable OCI_DELEGATION_TOKEN_FILE or by setting the Hadoop property fs.oci.delegation.token.filepath

API reference can be found here.

Help

See the “Questions or Feedback?” section here.

Version

The first three numbers of HDFS connector version represent the Hadoop version that the connector was compiled and tested against. The next three numbers are for the platform, major and minor versions respectively.

  • Minor version is incremented when there are no breaking changes
  • Major version is incremented when there are breaking changes which might affect some users
  • Platform version is incremented when there are bigger important breaking changes which would affect many users

For example, in HDFS connector version 3.3.1.x.y.z, 3.3.1 is for the Hadoop version being used and x.y.z refers to the {platform}.{major}.{minor} version of the HDFS connector.

Build

Building HDFS connector relies on Maven artifacts that are provided by the Java SDK. To obtain the artifacts, you must download the Java SDK and build it locally. You can then build the HDFS connector.

Important: The Java SDK file version that you download from the Oracle Releases page must match the HDFS connector version, which you can find in the hdfs-connector/pom.xml file in the tag block that has the groupId attribute com.oracle.oci.sdk.

Changes

See CHANGELOG.

Known Issues

You can find information on any known issues with the connector here and under the “Issues” tab of this GitHub repository. You can find information on any known issues with the SDK here and under the “Issues” tab of this GitHub repository.

Potential data corruption issue for OCI HDFS Connector with RefreshableOnNotAuthenticatedProvider

Details: If you are using version 3.2.1.1 or earlier of the OCI HDFS Connector and you use a RefreshableOnNotAuthenticatedProvider (e.g. InstancePrincipalsCustomAuthenticator, or generally for Resource Principals or Instance Principals) you may be affected by silent data corruption.

Workaround: Update the OCI HDFS Connector to version 3.2.1.3 or later. For more information about this issue and workarounds, see Potential data corruption issue for OCI HDFS Connector with RefreshableOnNotAuthenticatedProvider.

Direct link to this issue: Potential data corruption issue with OCI HDFS Connector on binary data upload with RefreshableOnNotAuthenticatedProvider

Contributing

This project welcomes contributions from the community. Before submitting a pull request, please review our contribution guide

Security

Please consult the security guide for our responsible security vulnerability disclosure process

License

Copyright (c) 2016, 2024, Oracle and/or its affiliates. All rights reserved. This software is dual-licensed to you under the Universal Permissive License (UPL) 1.0 as shown at https://oss.oracle.com/licenses/upl or Apache License 2.0 as shown at http://www.apache.org/licenses/LICENSE-2.0. You may choose either license.

See LICENSE for more details.