Big Data Tools on Windows 10
This series provides detailed tutorials about installing big data tools such as Hadoop, Spark, Sqoop, Zeppelin, Hive, Sqoop, etc. on Windows 10 using Java and Windows native libraries.
* Logos used in the picture are registered trademarks of Apache or Microsoft.
Hive 3.1.2 was released on 26th Aug 2019. It is still the latest 3.x release and works with Hadoop 3.x.y releases. In this article, I’m going to provide step by step instructions about installing Hive 3.1.2 on Windows 10. * Logos are registered trademarks of Apache Hive and Microsoft Windows.
Install Apache Spark 3.0.0 on Windows 10
Spark 3.0.0 was release on 18th June 2020 with many new features. The highlights of features include adaptive query execution, dynamic partition pruning, ANSI SQL compliance, significant improvements in pandas APIs, new UI for structured streaming, up to 40x speedups for calling R user-defined ...
This detailed step-by-step guide shows you how to install the latest Hadoop v3.3.0 on Windows 10. It leverages Hadoop 3.3.0 winutils tool and WSL is not required. This version was released on July 14 2020. It is the first release of Apache Hadoop 3.3 line. There are significant changes compared with Hadoop 3.2.0, such as Java 11 runtime support, protobuf upgrade to 3.7.1, scheduling of opportunistic containers, non-volatile SCM support in HDFS cache directives, etc.
This article provides detailed steps about how to compile and build Hadoop (incl. native libs) on Windows 10. The following guide is based on Hadoop release 3.2.1. *The yellow elephant logo is a registered trademark of Apache Hadoop; the blue window logo is registered trademark of Microsoft.
This detailed step-by-step guide shows you how to install the latest Hadoop (v3.2.1) on Windows 10. It also provides a temporary fix for bug HDFS-14084 (java.lang.UnsupportedOperationException INFO).
In this article, I’m going to demo how to install Hive 3.0.0 on Windows 10. Before installation of Apache Hive, please ensure you have Hadoop available on your Windows environment. We cannot run Hive without Hadoop. I recommend to install Hadoop 3.x to work with Hive 3.0.0. There are two ...
Install Apache Sqoop in Windows
This page summarizes the steps required to install Apache Sqoop (v1.4.7) in Windows 10 environment. Sqoop is an ETL tool for Hadoop,which is designed to efficiently transfer data between structured (RDBMS), semi-structured (Cassandra, Hbase and etc.) and unstructured data sources (HDFS).
This page summarizes the steps to install Hadoop 3.0.0 on your Windows environment. Reference page: https://wiki.apache.org/hadoop/Hadoop2OnWindows https://hadoop.apache.org/docs/r1.2.1/cluster_setup.html info A newer version of installation guide for latest Hadoop 3.2.1 is available. I ...
Install Zeppelin 0.7.3 on Windows
This post summarizes the steps to install Zeppelin 0.7.3 in Windows environment. GIT Bash Command Prompt Windows 10 Download the latest binary package from the following website: http://zeppelin.apache.org/download.html In my case, I am saving the file to folder: F:\DataAnalytics Open ...