Analytics and Application Group Project
Welcome to the DataDynamos project! This repository contains all the code and models developed as part of our data science project, covering data preparation, analysis, and prediction.
We have provided all our code and models within a single, comprehensive Jupyter Notebook file: DataScience.ipynb. This notebook is a collectionof code and showcases the entire process, including:
- Data Preparation: Loading, cleaning, and preprocessing data for analysis.
- Data Analysis: Exploratory data analysis and visualization (differentiation of sites, kpis, clustering).
- Data Prediction: Building and evaluating predictive models.
The notebook is designed for easy navigation, providing some explanations and visual outputs at each stage.
To run the notebook and replicate our results, follow these steps:
-
Clone this repository:
git clone https://github.com/Zeyd263/DataDynamos.git cd DataDynamos -
Ensure you have
condainstalled on your system to manage dependencies.
We have included an environment file (environment.yml) to simplify dependency management. This file ensures that all necessary packages are correctly installed.
-
Create and activate the environment:
conda env create -f environment.yml conda activate Dynamos
-
Launch Jupyter Notebook:
jupyter notebook DataScience.ipynb
This will load the environment Dynamos with all required dependencies.
This project is part of a university group project created for the course Analytics and Applications (AA) by the following contributors:
- Abdesselam Zeyd Boutchich
- Carlotta May
- Marlon Spiess
- Steffen Niesmann
- Yakup Kula
Have fun exploring our project!