Frame Level Speech Recognition
Application of feedforward neural networks to recognize phoneme states in audio recordings from the Wall Street Journal dataset.
Application of feedforward neural networks to recognize phoneme states in audio recordings from the Wall Street Journal dataset.
MyTorch is a python Deep Learning framework. Implemented using Numpy, the framework supports functionality for building and training deep learning networks.
We presents a multilingual ASR model for Kinyarwanda, Swahili, and Luganda, achieving a WER of 21.91 using a 3,900-hour curated dataset from the Common Voice project.
Download here
We model and predict Johannesburg Securities Exchange (JSE) stock trends using Markov Chains and benchmark them against popular machine learning techniques.
Download here
We propose using microgrids powered by renewable energy and enhanced with machine learning for energy trading to improve electricity supply in rural Sudan.
Download here
We use textual inversion to adapt the Stable Diffusion model for text-to-image generation, fine-tuning it with a dataset including LAION 5B and images from African artists to generate culturally specific artistic styles, achieving above-average performance in evaluations.
Download here