Mini projects

2023

An Alternative to the Log-Likelihood with Entropic Optimal Transport

less than 1 minute read

Published:

This paper explores the entropic optimal transport (EOT) loss and its estimator in parameter estimation, comparing its advantages over traditional likelihood methods, such as improved robustness, faster convergence, and resilience to bad local optima, with a focus on theoretical justification and experimental validation in Gaussian Mixture Models.

See paper | GitHub Repository

Discrete Morse Theory for Relative Cosheaf Homology

less than 1 minute read

Published:

This paper aims to generalize discrete Morse theory in the context of relative cosheaf homology on filtrations of finite simplicial complexes, enabling faster computations. These methods are extended to persistent cosheaf homology for longer filtrations.

See paper

Robustness in Neural ODEs and SDEs

less than 1 minute read

Published:

Recent studies show that Neural ODEs are more robust against adversarial attacks than traditional DNNs, but as complexity increases, concerns about robustness and expressivity arise, prompting exploration of stochastic noise regularization.

See paper | GitHub Repository

2022

Spectral Methods for Clustering in Finance

less than 1 minute read

Published:

This report gathers tools from spectral graph theory to analyze stock market relation graphs, focusing on spectral embedding for positioning companies in Euclidean space and exploring graph entropy to classify graphs and detect regime changes, with a generalization to directed weighted graphs and in-depth explanations of the underlying concepts and algorithms.

See paper

2021

Forest Classification - Kaggle Challenge

less than 1 minute read

Published:

This Kaggle challenge involved a classification problem (with 7 different classes) based on a dataset of forest parcels. The data consisted of 55 columns, including 11 numerical variables and 2 categorical variables (with 4 and 40 classes, respectively). To tackle this problem, we employed strategies detailed chronologically in this report. After an initial data exploration phase and attempts at dimensionality reduction, we tested several classic algorithms and proceeded with optimizations where possible.

See paper

2019