hadoop
52 items tagged with "hadoop"
Articles
Get Started with Apache Kylin - OLAP for Big Data
Hadoop 3.4.0 winutils for Windows 10 (x64)
Build Latest Hadoop on Windows 10 natively via Docker
Hadoop build error - Cound not find a SASL library (GSASL (gsasl) or Cyrus SASL (libsasl2)
Hive ACID Inserts, Updates and Deletes with ORC
Hive SQL - Aggregate Functions Overview with Examples
Install Hadoop 3.3.2 in WSL on Windows
Install Ambari 2.7.6 on Windows via WSL to Provision Hadoop Cluster
Install Hadoop 3.3.1 on Windows 10 Step by Step Guide
This detailed step-by-step guide shows you how to install the latest Hadoop v3.3.1 on Windows 10. It leverages Hadoop 3.3.1 winutils tool and WSL is not required. This version was released on June 15 2021.
Hadoop 3.3.1 winutils
Hive - Rename Table
Windows 10安装Hadoop 3.3.0教程
Install HBase in WSL - Pseudo-Distributed Mode
Detailed step-by-step guide about installing HBase 2.4.1 pseudo-distributed cluster with Hadoop 3.2.0 in Windows Subsystem for Linux (WSL) Ubuntu distro.
Hadoop Daemon Log Files Location
java.net.SocketException: Permission denied
Apache Hive 3.1.2 Installation on Linux Guide
Install Hadoop 3.3.0 on macOS
This article provides step-by-step guidance to install Hadoop 3.3.0 on macOS. Hadoop 3.3.0 was released on July 14 2020. It is the first release of Apache Hadoop 3.3 line. There are significant changes compared with Hadoop 3.2.0, such as Java 11 runtime support, protobuf upgrade to 3.7.1, scheduling of opportunistic containers, non-volatile SCM support in HDFS cache directives, etc.
Load File into HDFS through WebHDFS APIs
Apache Hive 3.1.2 Installation on Windows 10
Install Hadoop 3.3.0 on Linux
Install Hadoop 3.3.0 on Windows 10 Step by Step Guide
This detailed step-by-step guide shows you how to install the latest Hadoop v3.3.0 on Windows 10. It leverages Hadoop 3.3.0 winutils tool and WSL is not required. This version was released on July 14 2020. It is the first release of Apache Hadoop 3.3 line. There are significant changes compared with Hadoop 3.2.0, such as Java 11 runtime support, protobuf upgrade to 3.7.1, scheduling of opportunistic containers, non-volatile SCM support in HDFS cache directives, etc.
Hadoop 3.3.0 winutils
Install Hadoop 3.3.0 on Windows 10 using WSL
Hive: Exception in thread "main" java.lang.NoSuchMethodError: com.google.common.base.Preconditions.checkArgument(ZLjava/lang/String;Ljava/lang/Object;)V
Ingest Data into HDFS from NAS or Windows Shared Folder
Fix for Hadoop 3.2.1 namenode format issue on Windows 10
Compile and Build Hadoop 3.2.1 on Windows 10 Guide
Install Hadoop 3.2.1 on Windows 10 Step by Step Guide
This detailed step-by-step guide shows you how to install the latest Hadoop (v3.2.1) on Windows 10. It also provides a temporary fix for bug HDFS-14084 (java.lang.UnsupportedOperationException INFO).
How to Kill Running Jobs in Hadoop
The following code snippet shows how to list and kill Hadoop jobs including (MapReduce and YARN jobs).
List Hadoop running jobs
Hadoop provides a number of CLIs. hadoop job command can be used to retrieve running job list. You can also use YARN resource manager UI to view the jobs too.
Check HDFS folder size in Shell / Hadoop
Hadoop provides a number of CLIs that can be used to perform many tasks/activities. This code snippet shows you how to check file/folder size in HDFS.