Project information

  • Category: Big Data Skillset
  • Description: Midterm Exam Course
  • Project date: October, 2022
  • Project URL: Github

Implement Big Data Tools to simple problem solving

The Aglonema Farmers Group named Javaglonema Millennial which is located in Paten, Tridadi, Sleman, DI Yogyakarta. Javaglonema Milenial already has 83 farmer members. The farmer group requires integrated plant asset data owned by each member to estimate the achievement of the overall target number of plants.

Big Data Skillset:
  1. Extract & Transform Data.
  2. Load Data.
  3. Perform Data Analysis.
  4. Data storytelling.
Step:
  1. Extract & Transform Data

  2. Tools: Microsoft Excel/Google Spreadsheet
    Extract data from data retrieval in July 2022 with csv format.


  3. Load Data

  4. Tools: Java Mapreduce, Apache Hadoop, Apache Pig
    Load data to hadoop environment use pig language.
    pig -x local
    LOAD 'D:\BigData\Dataset\aglonema.csv' (folder directory) as (nama_anggota:chararray, jenis_aglonema:chararray, jumlah:int);
    dump aglonema;


  5. Perform Data Analysis

  6. Tools: Java Mapreduce, Apache Hadoop, Apache Pig

    see the number of each type of plant sorted by the number of plant descendingly
    grunt> asetinfo = FOREACH(GROUP aglonema BY jenis_aglonema) GENERATE group AS nama_aglonema, SUM(aglonema.jumlah) as jumlah_per_jenis;
    grunt> describe asetinfo;
    asetinfo: {nama_aglonema: chararray, jumlah_per_jenis: long}
    grunt> sorted = ORDER asetinfo BY jumlah_per_jenis DESC;
    grunt> dump sorted;


    see plant assets owned by each member
    grunt> asettani = GROUP aglonema BY nama_anggota;
    grunt> dump asettani;


  7. Data storytelling

  8. The largest aglonema plant production asset in Javaglonema Millennial is red legacy with 107 plants and the smallest aglonema plant production assets is red cherry with 1 plant.

    The member who has the largest number and type of plants is Agung Prasetya. On the other hand, the member who has the fewest number and type of plants is Zamroni.