Skip to content

Latest commit

 

History

History
23 lines (15 loc) · 1.03 KB

README.md

File metadata and controls

23 lines (15 loc) · 1.03 KB

Fundamental-Pig

This repository is used for learning Apache Pig on fundamental level.

Apache Pig

Apache Pig is a high-level platform for creating programs that run on Apache Hadoop. The language for this platform is called Pig Latin. Pig can execute its Hadoop jobs in MapReduce, Apache Tez, or Apache Spark.

Apache Pig Official Website

https://pig.apache.org/

MapReduce

A MapReduce job usually splits the input data-set into independent chunks which are processed by the map tasks in a completely parallel manner. The framework sorts the outputs of the maps, which are then input to the reduce tasks. Typically both the input and the output of the job are stored in a file-system.

Hortonworks Sandbox Virtual Quickstart

Used Hortonworks Sandbox 2.2 version for runnig pig scrips.

Hortonworks Sandbox link

https://hortonworks.com/downloads/

Youtube Tutorial Link

You can access youtube tutorial which created for this fundamental learning hands on. https://www.youtube.com/playlist?list=PLrwkeifIVAoTRflOnLYjmA-H5u8bpQ1Fa