Project 2: Applied Data Analysis and Machine Learning (FYS-STK3155)

This repository contains the code, notebooks, and report for Project 2 in the course FYS-STK3155 - Applied Data Analysis and Machine Learning at the University of Oslo, during the Autumn 2025 semester. The project builds on concepts from Project 1 by shifting from analytical solutions (e.g., matrix inversion in regression) to iterative optimization techniques and introduces neural networks for both regression and classification tasks.

Group Members

Francesco Giuseppe Minisini — francegm@uio.no
Stan Daniels
Carolina Ceccacci
Teresa Ghirlandi

Overview

The primary goal of this project is to implement and analyze optimization algorithms and machine learning models from scratch, focusing on regression and binary classification problems. We develop gradient-based optimizers to solve ordinary least squares (OLS) and Ridge regression, then extend this to a custom feed-forward neural network (FFNN) with back-propagation. For classification, we adapt the FFNN and implement logistic regression, testing them on real-world datasets.

Key objectives include:

Implementing various optimizers and analyzing their convergence rates, stability, and sensitivity to hyperparameters like learning rate (η), momentum, and regularization (λ).
Building an FFNN with flexible architecture, activation functions, and output layers for regression (linear) vs. classification (sigmoid/softmax).
Evaluating models on synthetic and real datasets, comparing custom implementations to PyTorch baselines.
Investigating trade-offs: e.g., adaptive optimizers like Adam converge faster but may overfit without regularization; ReLU activations prevent vanishing gradients better than sigmoid but can cause "dying ReLU" issues.
Hyperparameter tuning: Learning rates, batch sizes, epochs, λ, and network depths.

The project highlights the evolution from linear models to deep learning, emphasizing computational efficiency and practical challenges like gradient explosion/vanishing.

Datasets

Regression Datasets:
- Both clean and noisy Runge Function
Classification Dataset:
- Hand written digits, MINST dataset.

Data preprocessing includes scaling (StandardScaler), train-test splits (80/20), and one-hot encoding for classification.

Methods Implemented

Optimizers: Custom implementations using NumPy and Autograd for gradients. Includes plain GD (full batch, slow for large data), SGD (mini-batches for noise injection and faster iterations), Momentum (accelerates in relevant directions), Adagrad (adaptive per-parameter learning rates), RMSprop (improves Adagrad for non-stationary objectives), and Adam (combines Momentum and RMSprop with bias correction).
Feed-Forward Neural Network (FFNN): A modular class supporting arbitrary layers (e.g., [input_dim, hidden1, hidden2, output_dim]). Forward pass computes activations; back-propagation updates weights/biases via chain rule. Supports sigmoid (smooth but vanishing gradients), ReLU (fast, non-linear), and Leaky ReLU (fixes dying ReLU). Regularization via L2 penalty.
Logistic Regression: SGD-based with sigmoid output and cross-entropy loss; includes L2 regularization to prevent overfitting.
Evaluation Metrics:
- Regression: MSE (mean squared error), R² (coefficient of determination; 1 is perfect fit).
- Classification: Accuracy, precision/recall, confusion matrices (via scikit-learn for visualization).
Libraries Used: Core computations in NumPy/SciPy; plotting with Matplotlib; benchmarking with scikit-learn; automatic differentiation with Autograd.

Repository Structure

The repository is organized for modularity: core models in separate files, task-specific scripts for experiments, utilities for shared functions, and output directories for results.

Here is the complete file tree (recursively listed based on the repository contents):

├── Code
│   ├── MINST_complexity.py
│   ├── __init__.py
│   ├── autograd_gradient_check.py
│   ├── codeexport.config.psd1
│   ├── comparison_NN_vs_linear_methods.py
│   ├── complexity_analysis.py
│   ├── config.py
│   ├── convergence_and_complexity_comparison.py
│   ├── gradient_descent_analysis.py
│   ├── l1_l2_analysis.py
│   ├── plotting
│   │   ├── __init__.py
│   │   ├── plot_clasification_complexity.py
│   │   ├── plot_gd_alalysis.py
│   │   ├── plot_gdmethods_heatmaps.py
│   │   └── render_confusion_matrix.py
│   ├── src
│   │   ├── FFNN.py
│   │   ├── OLS_utils.py
│   │   ├── __init__.py
│   │   ├── activation_functions.py
│   │   ├── cost_functions.py
│   │   ├── scheduler.py
│   │   └── torch_utils.py
│   └── testing_specific_model.py
├── Makefile
├── README.md
├── Report.pdf
└── requirements.txt

No additional subdirectories or hidden files (e.g., .git) are present beyond this structure. The setup promotes reusability—e.g., FFNN can be imported independently.

Installation and Requirements

Python 3.8+.
Dependencies: numpy, scipy, matplotlib, scikit-learn, autograd.

Install with:

pip install -r requirements.txt

How to Run

Clone the repo:

git clone https://github.com/STK3155-25H/Project-2.git
cd Project-2

Execute tasks:
- Full run: make all.

To control the output directories, modify the config.py folder

Outputs save to figures/ and results/; console shows metrics.

Results Summary

Regression: Adam achieves lowest MSE (e.g., 0.03 on Franke) with η=0.005, λ=0.01; GD slowest. FFNN with 2 hidden layers beats Ridge on non-linear data (R² 0.92 vs. 0.81).
Classification: FFNN ~96% accuracy with ReLU (vs. sigmoid's 94% due to gradients); logreg ~93%. Heatmaps show λ>0.05 reduces overfitting.
See report.pdf for tables/figures (e.g., convergence: Adam plateaus in 50 epochs vs. GD's 200).

Name		Name	Last commit message	Last commit date
Latest commit History 110 Commits
Code		Code
.DS_Store		.DS_Store
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project 2: Applied Data Analysis and Machine Learning (FYS-STK3155)

Group Members

Overview

Datasets

Methods Implemented

Repository Structure

Installation and Requirements

How to Run

Results Summary

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Project 2: Applied Data Analysis and Machine Learning (FYS-STK3155)

Group Members

Overview

Datasets

Methods Implemented

Repository Structure

Installation and Requirements

How to Run

Results Summary

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages