Project information
- Category: Big Data Skillset
- Description: Midterm Exam Course
- Project date: October, 2022
- Project URL: Github
Implement Big Data Tools to simple problem solving
The Aglonema Farmers Group named Javaglonema Millennial which is located in Paten, Tridadi, Sleman, DI Yogyakarta. Javaglonema Milenial already has 83 farmer members. The farmer group requires integrated plant asset data owned by each member to estimate the achievement of the overall target number of plants.
Big Data Skillset:- Extract & Transform Data.
- Load Data.
- Perform Data Analysis.
- Data storytelling.
- Extract & Transform Data
- Load Data
- Perform Data Analysis
- Data storytelling
Tools: Microsoft Excel/Google Spreadsheet
Extract data from data retrieval in July 2022 with csv format.
Tools: Java Mapreduce, Apache Hadoop, Apache Pig
Load data to hadoop environment use pig language.
pig -x local
LOAD 'D:\BigData\Dataset\aglonema.csv' (folder directory) as (nama_anggota:chararray, jenis_aglonema:chararray, jumlah:int);
dump aglonema;
Tools: Java Mapreduce, Apache Hadoop, Apache Pig
see the number of each type of plant sorted by the number of plant descendingly
grunt> asetinfo = FOREACH(GROUP aglonema BY jenis_aglonema) GENERATE group AS nama_aglonema, SUM(aglonema.jumlah) as jumlah_per_jenis;
grunt> describe asetinfo;
asetinfo: {nama_aglonema: chararray, jumlah_per_jenis: long}
grunt> sorted = ORDER asetinfo BY jumlah_per_jenis DESC;
grunt> dump sorted;
see plant assets owned by each member
grunt> asettani = GROUP aglonema BY nama_anggota;
grunt> dump asettani;
The largest aglonema plant production asset in Javaglonema Millennial is
red legacy with 107 plants and the smallest aglonema plant production assets
is red cherry with 1 plant.
The member who has the largest number and type of plants is Agung Prasetya. On the other hand, the member who has the fewest number and type of plants is Zamroni.