πŸ‘‹πŸΌ Hi! I'm Alberto!

πŸ‘¨πŸ»β€πŸ’» I'm a Computer Vision and AI engineer with over two years of experience developing and optimizing AI models for real-world applications. My expertise spans multimodal learning, video understanding, and deploying scalable, production-ready models, shaped through hands-on research and industry experience.

πŸ”¬ Currently, I'm pursuing my Master's Thesis at the CVML lab at the National University of Singapore (NUS), working on advanced video moment retrieval models.

πŸ“š Feel free to explore my website to learn more about my projects and insights from my journey in AI.

profile photo

Publications

Projects

Welcome to my project portfolio! Explore a selection of my projects, showcasing my work. πŸš€

Image
CARLA-SB3-RL: An Open-Source Framework for Deep Reinforcement Learning using StableBaselines3 in CARLA simulator
GitHub

⭐ 70+ stars on GitHub
This project introduces an out-of-the-box training and evaluation environment for deep reinforcement learning (DRL) in the CARLA simulator based on StableBaselines3 library. This project was made as part of my Bachelor's thesis.

Image
Multi-camera multi-tracking with YoloV7 for detection people position

A multi-camera pedestrian tracking system that fuses views from multiple cameras observing the same scene. The system uses YOLOv7 for initial person detection, projects the detected bounding boxes onto a common ground plane using homography transformations, and applies filtering, clustering, and tracking algorithms to accurately determine people's positions in real-time.

Image
ExploreAI: A mobile app for discovering city landmarks
GitHub

ExploreAI is a mobile app designed to make city exploration interactive. It uses real-time location data and AI-driven descriptions to guide users through landmarks, unlocking information as they explore. I created this project to learn about app development and AI integration, using React Native and Expo for cross-platform support, PostgreSQL for data, Clerk for authentication, and OpenAI and Wikipedia APIs for dynamic content.

REDQ Implementation: Randomized Ensembled Double Q-Learning in PyTorch
GitHub Report

Clean PyTorch implementation of Randomized Ensembled Double Q-Learning, a model-free reinforcement learning algorithm that achieves high sample efficiency in continuous action space domains. It demonstrates faster learning than SAC on OpenAI Gym environments.

Image
Gravitational Simulator for performance-oriented programming in C++
GitHub

This project analyzes and compares the performance of Array of Structures (AoS) and Structure of Arrays (SoA) in a C++ gravitational simulator, focusing on efficiency improvements. Additionally, it explores performance gains from parallelizing the code with OpenMP for multi-threaded execution.

Image
CNN-based MRI Classification for Alzheimer's Disease Detection
GitHub

This project uses CNNs to classify MRI scans and predict Alzheimer's disease severity across four stages: NonDemented, VeryMildDemented, MildDemented, and ModerateDemented. Grad-CAM is applied to visualize key brain areas influencing the model's classification decisions.

Image
G-means: Implementation of the "Learning the k in k-means" paper
GitHub

GMeans is an adaptive clustering algorithm that automatically determines the optimal number of clusters by recursively applying a statistical hypothesis test to validate the Gaussian distribution of cluster subgroups. By extending KMeans with a principled statistical approach, the algorithm dynamically splits clusters and assesses their statistical coherence, enabling data-driven cluster selection without manual tuning.