By using this site, you acknowledge that you have read and understand our Cookie policy, Privacy policy and Terms .

Welcome

Click the button below to become a member of our professional community.

person_add Register account_circle Log in


Hot sites

.NET Framework

Everything about .NET framework.

Java Programming

Java Programming related.

Angular

Angular related

ASP.NET Core

Blog posts about ASP.NET Core

Scripting

PowerShell, Bash, ksh, sh, Perl and etc. 

Teradata

Tutorials and informations about Teradata.

Hadoop

Articles about Apache Hadoop

Spark

Articles about Apache Spark

Power BI

Posts and tutorials about Power BI.

Code snippets

Code snippets for various programming languages/frameworks.

local_offer hadoop local_offer yarn local_offer hdfs

visibility 24558
comment 30
thumb_up 0
access_time 2 years ago

This page summarizes the steps to install Hadoop 3.0.0 in your Windows environment. Reference page: https://wiki.apache.org/hadoop/Hadoop2OnWindows ...

open_in_new View

local_offer .net core local_offer entity-framework

visibility 11997
comment 4
thumb_up 0
access_time 2 years ago

SQLite is a self-contained and embedded SQL database engine. In .NET Core, Entity Framework Core provides APIs to work with SQLite. This page provides sample code to create a SQLite database using package Microsoft.EntityFrameworkCore.Sqlite . Create sample project ...

open_in_new View

local_offer angular local_offer lite-log

visibility 8486
comment 3
thumb_up 0
access_time 2 years ago

Problem When you follow Angular CLI installation guide in Windows, you may encounter the following error: ng is not recognized as an internal or external command The resolutions are available in the following link: ...

open_in_new View

local_offer python local_offer spark

visibility 10654
comment 0
thumb_up 0
access_time 11 months ago

This post shows how to derive new column in a Spark data frame from a JSON array string column. I am running the code in Spark 2.2.1 though it is compatible with Spark 1.6.0 (with less JSON SQL functions). Prerequisites Refer to the following post to install Spark in Windows. ...

open_in_new View

local_offer SQL Server local_offer python local_offer spark local_offer pyspark

visibility 6566
comment 0
thumb_up 0
access_time 9 months ago

Spark is an analytics engine for big data processing. There are various ways to connect to a database in Spark. This page summarizes some of common approaches to connect to SQL Server using Python as programming language. ...

open_in_new View

local_offer hadoop local_offer hive

visibility 9298
comment 11
thumb_up 0
access_time 9 months ago

If you have been following my website, you would know I’ve published a number of articles about installing big data tools/framewo...

open_in_new View

local_offer python local_offer spark local_offer pyspark local_offer hive

visibility 6741
comment 0
thumb_up 0
access_time 8 months ago

From Spark 2.0, you can easily read data from Hive data warehouse and also write/append new data to Hive tables. This page shows how to operate with Hive in Spark including: Create DataFrame from existing Hive table Save DataFrame to a new Hive table Append data ...

open_in_new View

local_offer python local_offer spark local_offer pyspark

visibility 2579
comment 0
thumb_up 0
access_time 8 months ago

Data partitioning is critical to data processing performance especially for large volume of data processing in Spark. Partitions in Spark won’t span across nodes though one node can contains more than one partitions. When processing, Spark assigns one task for each partition and each worker threa...

open_in_new View

local_offer hadoop local_offer linux local_offer WSL

visibility 7590
comment 18
thumb_up 0
access_time 7 months ago

In my previous post , I showed how to configure a single node Hadoop instance on Windows 10. The steps are not too difficult to follow if you have Java programming backgr...

open_in_new View

local_offer python local_offer spark local_offer pyspark

visibility 3544
comment 0
thumb_up 0
access_time 5 months ago

In Spark, SparkContext.parallelize function can be used to convert Python list to RDD and then RDD can be converted to DataFrame object. The following sample code is based on Spark 2.x. In this page, I am going to show you how to convert the following list to a data frame: data = [(...

open_in_new View

...

local_offer kontext local_offer Azure

visibility 81
comment 0
thumb_up 0
access_time 2 days ago

Kontext website is now upgraded to ASP.NET Core 3.0 with many new features.

open_in_new View

local_offer powershell

visibility 3
comment 0
thumb_up 0
access_time 3 days ago

This code snippet shows how to calculate time differences.

open_in_new View

local_offer teradata local_offer SQL

visibility 2
comment 0
thumb_up 0
access_time 3 days ago

This code snippet shows how to convert string to date.

open_in_new View

local_offer hadoop local_offer shell

visibility 0
comment 0
thumb_up 0
access_time 3 days ago

The following code snippet shows how to list and kill Hadoop jobs including (MapReduce and YARN jobs).

open_in_new View

local_offer mssql local_offer t-sql

visibility 4
comment 0
thumb_up 1
access_time 3 days ago

In different databases, the syntax of selecting top N records are slightly different. They may also differ from ISO standards.

open_in_new View

local_offer teradata local_offer SQL

visibility 5
comment 0
thumb_up 0
access_time 3 days ago

In different databases, the syntax of selecting top N records are slightly different. They may also differ from ISO standards.

open_in_new View

local_offer SQL local_offer hive

visibility 24
comment 0
thumb_up 0
access_time 3 days ago

In different databases, the syntax of selecting top N records are slightly different. They may also differ from ISO standards.

open_in_new View

local_offer python local_offer spark-2-x

visibility 6
comment 0
thumb_up 0
access_time 3 days ago

In Spark, SparkContext.parallelize function can be used to convert list of objects to RDD and then RDD can be converted to DataFrame object through SparkSession.

open_in_new View

local_offer scala local_offer spark-2-x

visibility 6
comment 0
thumb_up 0
access_time 3 days ago

In Spark, SparkContext.parallelize function can be used to convert list of objects to RDD and then RDD can be converted to DataFrame object through SparkSession.

open_in_new View

local_offer hadoop local_offer shell

visibility 3
comment 0
thumb_up 0
access_time 3 days ago

Hadoop provides a number of CLIs that can be used to perform many tasks/activities. This code snippet shows you how to check file/folder size in HDFS.

open_in_new View