INTERNETOFTHINGS JOBS         BIGDATA INDIA JOBS         Home         Register       Sign In


 
Company Info
AMAZON
Brisbane, CA, United States

Company Profile


Engineer Hive


col-narrow-left   

Job ID:

5366

Location:

Houston, TX, United States 

Category:

Big Data Developer, C++ #, Hadoop, Hive
col-narrow-right   

Job Views:

328

Posted:

06.12.2017
col-wide   

Job Description:

Apache Hive is a compute framework that provides a SQL-based interface for interacting with data in Hadoop, as well as other distributed file systems. Hive is the most widely adopted component by enterprise customers for their ETL workloads and mission-critical applications.

 

Recent and upcoming projects include:

  • Adding support for Spark as a new execution engine in Hive
  • Enabling Hive workloads to run against Kudu storage engine for improved consistency guarantees and increased performance
  • Implementing a memory manager for the Parquet file format to improve scalability and lower resource consumption

Job Requirements:

  • 8+ years of experience writing production-quality code (C/C++, Java, and/or Scala)
  • Experience with scalable distributed multi-node environments with understanding of scaling, performance and scheduling
  • Deep knowledge of system architecture, including process, memory, storage and networking management is highly desired
  • Experience with Hadoop ecosystem and related technologies or database internals. Hive, Spark, or HDFS development knowledge and committership is a strong plus.
  • Excellent communication skills



Home My Account Find Jobs Post Resumes Search Resumes Post Jobs Contact About Us Sitemap terms & cond Privacy policy