Kelvin Sila
Data Scientist/Analyst

A dedicated data scientist skilled in Power BI, Excell/Google sheets, SQL, python, tableau, tensorflow, keras, Github, pytorch, flask, google cloud, statistical data analysis, machine learning, deep learning among other critical skills for data science and analytics field.

Moringa School AI Chatbot

In this project, we created a chatbot for Moringa School's website. We scraped data from all Moringa School sites using API scraper functions and BeautifulSoup. Through the extracted corpus text, we created a JSON file with intents, questions, and responses. I gained skills in natural language processing (NLP), specifically deep learning models such as sequential models used to train the chatbot to generate appropriate responses. We are still improving our model’s performance. Additionally, I developed skills in web scraping, parsing JSON files, and data cleaning for NLP.

Chatbot Website: Tawi-Chatbot.

Power BI Projects Power BI Logo

Power BI analytics

This series of projects uses power BI to Analyze water crisis in Maji Ndogo, amythical city facing challenges with clean water access; using data on water sources, collections, and related issues, I gained insights into residents' daily lives and water's role. Click the links below for specific projects

Part 1: Visualizing Maji Ndogo's Past
Part 2: Moulding Data Into Visual Stories of Maji Ndogo
Part 3: Communicating Findings in Maji Ndogo
Part 4: Communicating Findings in Maji Ndogo
Sales Analysis Project

SQL Data Querying Projects Power BI Logo

SQL Projects

In these series of projects, i utilized database from Maji Ndogo and queried data from the database using structured query language (SQL). From the analysis, i sharpened my sql skills such as using statements such as SELECT, FROM, GROUP BY, ORDER BY, WHEREBY, among others.Click the links below for specific projects

Part 1: Beginning Our Data-Driven Journey in Maji Ndogo
Part 2: Clustering data to unveil Maji Ndogo's water crisis
Part 3: Weaving the data threads of Maji Ndogo's narrative
Part 4: Clustering data to unveil Maji Ndogo's water crisis
Sales Analysis Project

Machine/Deep Learning Projects Machine Learning Logo

Automated Identification of Plant Leaf Disease

In this project, we used Convolutional Neural Networks (CNN) to create a mobile app for diagnosing potato plant diseases through leaf image recognition. The CNN model accurately classified various potato leaf diseases and identified healthy vs. unhealthy leaves. I gained skills in image classification, data preprocessing, TensorFlow, deep learning, and model deployment on Google Cloud.

Mobile App: Advantech's Mobile App. Presentation: Final Presentation.

Diabetes Prediction

The goal of this machine learning project is to develop a diabetes prediction system using logistic regression. This system will help in early identification of individuals who may be at risk of developing diabetes based on certain health-related features. The deployment will be done through a Django web application, providing a user-friendly interface for individuals to input their health data and receive predictions regarding their likelihood of being diabetic. .

Presentation: Jupyter notebooks.

Supervised Machine Learning Predictive Modeling

In this project, we used supervised machine learning to predict customer likelihood of buying houses in King’s County, Washington DC. We applied linear regression to identify key factors for buying or selling, including living room size, number of bedrooms, bathrooms, and floors. I gained skills in data preparation, feature engineering, data visualization, and linear regression.

Project Slides: Final Presentation.

Data Analysis for Microsoft's Movie Venture

In this project, I conducted exploratory data analysis to evaluate trends in movie production, release, and profitability since 2015, advising Microsoft on whether to enter the movie industry. The analysis recommended optimal movie genres and release months, adjusted for inflation. The Avengers had the highest return on investment. This project enhanced my skills in data analysis, visualization, and feature engineering.

Project Slides: Final Presentation.