SamayYantra

Weather Prediction of Beutenberg using time series forcasting with deep learning

Samay Yantra

GOALS of the Project

In ancient days, people used to look at the sky or feel it’s humidity and several factors tried to predict the upcoming days which were most of them predicted true to a high accuracy.Nowadays as we can detect several weather factors, the prediction of weather is becoming too complex to be handled by human brain. Deep learning took up the next step and brought ease to this domain.Time series forecasting with deep neural networks changes the frontiers. Now we can predict/forecast innumerable future attributes based on historical data. Not only it shared us the predictions, but the data can be taken for more advanced analysis and research.The secondary goal of the project is to implement cutting edge MLOps to actual real problems.The ternary goal of the project is to implement this on traditional python OOPS but Jupyter Notebook , so that we can match different real-world ML codebase and learn accordingly.

Technology

1. Pytorch

2. Scikit-Learn

3. Pandas

4. Numpy

5. MLFlow

Data Collection Process :

The raw data has been recorded by the Weather station of Max Planck Institute for Biogeochemistry, Jena, Germany.Jena Weather dataset is made up of many different quantities (such air temperature, atmospheric pressure, humidity, wind direction, and so on) were recorded every 10 minutes, over several years. This dataset covers data from January 1st 2004 to December 31st 2020The actual data is this a copy which is published for academic purposes as a kaggle dataset, Link : kaggle/Weather Station Beutenberg Dataset.The primary data is stored as a single .csv file which is later processed to processed.csv file to be taken for training.

Special Note :

Data has been stored using DVC(Data version Control), so the repository package can beused flexibly without adding the data straight in the repo but fetch from any remote source e.g. AWS S3, GDRIVE, etc.For this case, the data has been stored in GDRIVE.

Directory Structure :

The data follows a strict data science project structure.

.└── root/    ├──.dvc/    ├── config/    ├── mlruns/    ├── models/    ├── notebooks/    ├── results/    └── src/        ├── data        ├── features        ├── models        └── visualization

Installation and Usage :

Installation

Create a Virtual Environment : Tutorial
Clone the repository by running this command.

git clone https://github.com/sagnik1511/Tabular-AutoML.git

Open the directory with cmd.
Copy this command in terminal to install dependencies.

pip install -r requirements.txt

Installing the requirements.txt may generate some error due to outdated MS Visual C++ Build. You can fix this problem using this.

Approach :

Go to the root directory using cd command.
The first step is to download the actual data into the project.Copy and run this command.

dvc pull

If you want to run the training process, simply change the configuration in config/pt_training.yaml and then run this command . Keep in mind that you have to stay at the root directory.

python -m src.training.pytorch_trainer

Further usage will be updated soon…

Results:

You can visit reports directory where all the runs are stored. Currently, for some privacy issues, the mlflow runs are not shared in here.

Thanks for visiting 😀

Do STAR if you find it useful

GitHub

View Github

Stolen programs from Rohit Mehra's Time Machine from Krish(2006) for future weather prediction at Beutenberg

SamayYantra

Weather Prediction of Beutenberg using time series forcasting with deep learning

Samay Yantra

GOALS of the Project

Technology

1. Pytorch

2. Scikit-Learn

3. Pandas

4. Numpy

5. MLFlow

Data Collection Process :

Special Note :

Directory Structure :

Installation and Usage :

Installation

Approach :

Results:

Thanks for visiting 😀

Do STAR if you find it useful

GitHub

John

A simple GUI app of nuitka

Sync your ML data with your favorite productivity tools

SamayYantra

Weather Prediction of Beutenberg using time series forcasting with deep learning

Samay Yantra

GOALS of the Project

Technology

1. Pytorch

2. Scikit-Learn

3. Pandas

4. Numpy

5. MLFlow

Data Collection Process :

Special Note :

Directory Structure :

Installation and Usage :

Installation

Approach :

Results:

Thanks for visiting 😀

Do STAR if you find it useful

GitHub

A simple GUI app of nuitka

Sync your ML data with your favorite productivity tools

You might also like...