hive

Introduction to Hive Bucketed Table

2022-08-24

Start Hive Beeline CLI

This code snippet provides example to start Hive Beeline CLI in Linux. Beeline is the successor of Hive CLI. In the shell scripts, the environment variable $HIVE_HOME is the home folder of Hive installation in the system. In a cluster environment, it usually refers to the Hive client installation on an edge server. Output: `` $HIVE_HOME/bin/beeline -u jdbc:hive2:// Connecting to jdbc:hive2:// Hive Session ID = 65a40cd9-02ce-4965-93b6-cff9db461b70 Connected to: Apache Hive (version 3.1.3) Driver: Hive JDBC (version 3.1.3) Transaction isolation: TRANSACTIONREPEATABLEREAD Beeline version 3.1.3 by Apache Hive 0: jdbc:hive2://> ``

2022-08-20

Configure HiveServer2 to Enable Transactions (ACID Support)

2022-08-20

Hive SQL - Merge Statement on ACID Tables

Hive supports standard ANSI SQL MERGE statement from version 2.2. However it can be only be applied to tables that support ACID transactions. To learn more about ACID support in Hive, refer to article: Hive ACID Inserts, Updates and Deletes with ORC. Sample table This code snippet merges into a sample table named testdb.crudtable. It has two records before the merge. !20220819124209-image.png The staging table was created using the following statements: `` create table crudtablestg (id int, value string, op string); insert into crudtablestg values (1,'AA','U'),(2,'B','D'),(3,'C', 'I'); ` It has one additional column named op to indicate the delta changes: U - updates D - deletes I - inserts (i.e. new records) Syntax ` MERGE INTO AS T USING AS S ON WHEN MATCHED [AND ] THEN UPDATE SET WHEN MATCHED [AND ] THEN DELETE WHEN NOT MATCHED [AND ] THEN INSERT VALUES `` Output After the merge, record 1 is updated; record 2 is deleted and record 3 is inserted into the table.

2022-08-19

Error: Failed to load class org.apache.spark.sql.hive.thriftserver.SparkSQLCLIDriver

2020-12-27

Create Temporary Table - Hive SQL

Create Table as SELECT - Hive SQL

Create Bucketed Sorted Table - Hive SQL

Create Partitioned Table - Hive SQL

Create Table Stored as CSV, TSV, JSON Format - Hive SQL

Create Table with Parquet, Orc, Avro - Hive SQL

Create, Drop, and Truncate Table - Hive SQL

2020-08-24

Create, Drop, Alter and Use Database - Hive SQL

2020-08-24

Apache Hive 3.1.2 Installation on Windows 10

2020-08-10

Hive: Exception in thread "main" java.lang.NoSuchMethodError: com.google.common.base.Preconditions.checkArgument(ZLjava/lang/String;Ljava/lang/Object;)V

2020-04-20

Differences between Hive External and Internal (Managed) Tables

2020-02-22

Schema Merging (Evolution) with Parquet in Spark and Hive

2020-02-02

Select top N records in SQL / Hive

In different databases, the syntax of selecting top N records are slightly different. They may also differ from ISO standards.

2019-11-18

Big Data Tools on Windows via Windows Subsystem for Linux (WSL)

2019-05-19

Sqoop

Apache Hive 3.1.1 Installation on Windows 10 using Windows Subsystem for Linux

2019-05-18

.NET for Apache Spark Preview with Examples

2019-04-26

HiveServer2 Cannot Connect to Hive Metastore Resolutions/Workarounds

2019-04-15

Configure a SQL Server Database as Remote Hive Metastore

2019-04-14

Connect to Hive via HiveServer2 JDBC Driver

2019-04-14

Java Programming

Spark - Save DataFrame to Hive Table

2019-03-27

Apache Hive 3.0.0 Installation on Windows 10 Step by Step Guide

2019-03-25