Untangling Apache Hadoop YARN

 Part 1: Cluster and YARN Basics Ray Chiang is a Software Engineer at Cloudera. Dennis Dawson is a Senior Technical Writer at Cloudera. Categories: Hadoop MapReduce YARN In this multipart series, fully explore the tangled ball of thread that is YARN. YARN (Yet Another Resource Negotiator) is the resource management layer for the Apache Hadoop […]

Using Apache Hive on Docker

Apache Hive is data warehouse framework for storing, managing and querying large data sets. The Hive query language HiveQL is a SQL-like language. Hive stores data in HDFS by default, and a Hive table may be used to define structure on the data. Hive supports two kinds of tables: managed tables and external tables. A managed table is […]