Our Process

Explore the process of our project!

The process of our project generally follows the Data Science process taught in DS105. However, due to the nature of our project requiring the usage of multiple APIs, we have decided to include “Integration” as the third step of our project. As such, our process follows the structure as shown below:

Our Process Diagram

The sections below will go into further detail about how each of the different types of data - Pokémon, Biome and Weather - are collected, cleaned, and analyzed for our project.


Tip: Click on the headers below to expand and explore our process in detail.

Pokémon Data
Pokémon Data Collection

This section details the methodology for retrieving and processing Pokémon data from the PokeAPI. The approach ensures efficient data collection, structuring, and storage while utilizing parallel processing techniques to optimize performance.

Asynchronous API Requests with ThreadPoolExecutor
Extracting and Organizing Pokémon Identifiers
Comprehensive Data Collection for Each Pokémon
Error Handling
Pokémon Data Processing
Data Extraction and Transformation
Statistical Calculations
Dominant Colour Extraction from Images
Data Storage and Accessibility
Considerations and Challenges
Biome Data
Data Collection

This section focuses on collecting and analyzing biome data by determining the center coordinates of each ecoregion for the 14 biomes present in the Ecoregions 2017 Resolve dataset from Google Earth Engine (GEE). By identifying the central coordinates, we can use these coordinates to find out more data (e.g., weather data) using other APIs.

Use of a Service Account
Rationale for Using .py Script
Weather Data
Data Collection
Data Collection
Rationale
Data Processing
Integration
Visualisation