Rito Ghosh
Legal name: Ritobrata Ghosh
(he/him)
Google Scholar π | Visit π Blog | Visit LinkedIn for a more updated version | βοΈ/Twitter | πΈ GitHub | π Mastodon | βοΈ [gh_uname].ml@gmail.com | π CV
π₯Ό Research Interests: Computer Vision, Scientific Machine Learning, Artificial Life, Edge AI/TinyML, Computational Neuroscience, Efficient NLP.
ποΈ Key Skills: Python, Deep Learning, Machine Learning, Computer Vision, PyTorch, Mathematics, Programming, Team Work, Communication.
π¨βπ» Experience
π¦ Deep Learning Research Consultant - Akbar Brothers
ποΈ Mar 2021 - Jun 2023 | πΊοΈ Remote
- Worked in single-label classification with many labels.
- Cleaned, wrangled a large amount of data (>60 GB).
- Set up end-to-end Deep Learning training pipeline for single-label image classification.
- Experimented and worked with Deep Learning architectures such as DenseNet, ResNet, EfficientNet
- Used improvement techniques such as image data augmentation, oversampling and undersampling, gradual freezing, discriminative learning rates, ensemble methods, etc.
- Achieved accuracy of >91% in the classification task, where this has never been done before.
- Worked with tabular data of ~200k rows involving features to predict numerical data with XGBoost. Achieved ~98% accuracy.
- Worked closely with business stakeholders to understand and solve problems.
- Stack: Python, PyTorch, fast.ai, Scikit-Learn, XGBoost, Pandas, Python, Colab Pro.
Skills: Machine Learning, Deep Learning, Python (Programming Language), Data Science, Programming, Computer Vision, Linux, Jupyter Notebook, PyTorch, Written Communication, Convolutional Neural Networks (CNN)
π¦ Deep Learning Research Consultant - Okkular.io
ποΈ Sep 2021 - Oct 2021 | πΊοΈ Remote
- Wrote the patent filing for a novel architecture developed inside the company (filing process ongoing)
- Briefly worked on Active Learning techniques in Computer Vision to automate the training process and improve accuracy and maintenance.
- Stack: Python, PyTorch, fast.ai, AWS SageMaker, AWS S3, Pandas.
Skills: Deep Learning, Linux, AWS SageMaker, Team Work
π¦ Technical Reviewer - Packt
ποΈ Sep 2022 - Present | πΊοΈ Remote
- Reviewing a book on Advanced PyTorch. Chapters include: multimodal AI, Deep Reinforcement Learning, etc.
- Working with co-ordinators and other editors.
Skills: PyTorch, Computer Vision, NLP, Reinforcement Learning, Team Work
π¦ Section Leader (Teaching Assistant) of Code in Place Programme - Stanford University
ποΈ Apr 2021 - Jun 2021| πΊοΈ Remote
This online course was offered by Stanford University during the COVID-19 pandemic. It brought together 12,000 students and 1,100 volunteer teachers participating from around the world. The course is a 6-week introduction to Python programming using materials from the first half of Stanfordβs CS106A course.
As a volunteer section leader, I prepared and taught a weekly discussion section of 8-10 students to supplement professorsβ lectures.
Received direct training from Stanford Professors Mehran Sahami, Chris Piech, and Julie Zelenski. Received further training from Stanford-appointed teacher mentors.
Taught introductory Python and CS to about 12 students from India, Bangladesh, and the USA. Students were of various ages and came from diverse backgrounds including from Fine Arts backgrounds. For many students, it was the first introduction to programming and CS ever.
Taught five sessions of my own (lasting 1.5 h on average) and a backup session for another.
Taught programming, problem-solving, and CS concepts.
Skills: Python, Programming, Teaching, Team Work
π¨βπ« Education
π© Master of Science (Computer Science) [M.Sc. (CS)]
ποΈ Techno College Hooghly (under MAKAUT) | ποΈ Jul 2021 - Jul 2023 | πΊοΈ Kolkata
CGPA: 9.04/10 Notable courses: Advanced algorithms, Soft Computing, Artificial Intelligence
π© Bachelor of Science (Physics, Mathematics, Computer Science) [B.Sc.]
ποΈ University of Calcutta | ποΈ Jul 2017 - Apr 2021 | πΊοΈ Kolkata
Grade: First Class Notable courses: Mathematical Methods for Physics I, II, Quantum Mechanics, Statistical Mechanics, Special Theory of Relativity
π© Summer School (Computational Neuroscience)
ποΈ Neuromatch Academy | ποΈ Jun 2022 - Jul 2022 | πΊοΈ Remote
π¦οΈ Projects
π§ Functional connectivity differences between ECoG signals during motor movement vs. motor imagery
ποΈ Neuromatch Academy Summer School in Computational Neuroscience
ποΈ Jun 2022 - Jul 2022 | π Link β‘οΈ
- Human brain is responsible for both motor tasks, i.e. when actual movement of some body parts happen, and also for imagery tasks: when the person imagines a movement. This is a project that aims to find functional connectivity among brain areas in both imagined and real motor movements. This project also aims to find the similarity (or lack thereof) between real and imaginary movement in lower dimensional latent space of brain signals using Principal Component Analysis.
Skills: Python, PyTorch, Deep Learning, Computational Neuroscience, Team Work
π§ Learn JAX: From Linear Regression to Neural Networks
ποΈ Personal Project
ποΈ Dec 2021 - Jan 2022 | π Link β‘οΈ
I introduce the JAX library for solving basic problems in Machine Learning. I explain everything so that a beginner in this framework will find it helpful. Introduced the functional programming paradigm for Deep Learning.
Won the Google Open Source Software Expert Prize
Skills: JAX, Machine Learning, Deep Learning, Written Communication
π§ CLIP for Satellite Images
ποΈ Hugging Face x Google Cloud Hackathon
ποΈ Jul 2021 - Sep 2021 | π Link β‘οΈ
This project is an Open-Source Deep Learning architecture that generates captions for satellite imagery.
We used transfer-learning to fine-tune OpenAI CLIP for satellite data.
Training was done for multiple hours on a TPUv3-8 enabled virtual machine provided by Google Cloud.
We used JAX/Flax and PyTorch for this model.
GitHub: https://github.com/arampacha/CLIP-rsicd
Demo: https://huggingface.co/spaces/sujitpal/clip-rsicd-demo
I worked on researching and applying different augmentation techniques that led to the improvement of the model.This project is an Open-Source Deep Learning architecture that generates captions for satellite imagery. We used transfer-learning to fine-tune OpenAI CLIP for satellite data. Training was done for multiple hours on a TPUv3-8 enabled virtual machine provided by Google Cloud. We used JAX/Flax and PyTorch for this model.
Skills: Transfer Learning, Deep Learning, Computer Vision, Linux, Jupyter Notebook
π§ DALLΒ·E Mini
ποΈ Hugging Face x Google Cloud Hackathon
ποΈ Jul 2021 - Sep 2021 | π Link β‘οΈ
This project is an Open-Source Deep Learning architecture that generates images from a given text prompt. This implements the functionality of OpenAIβs DALL-E architecture, but with a 27x smaller model.
A BART autoregressive model was used to process and generate text, and a VQGAN to generate the images. Then we used a pre-trained CLIP model from OpenAI to rank the generated images.
Training was done for 70 hours on a TPUv3-8 enabled virtual machine provided by Google Cloud.
We used JAX/Flax and PyTorch for this model.
GitHub: https://github.com/borisdayma/dalle-mini
Demo: https://huggingface.co/spaces/flax-community/dalle-mini
Skills: Deep Learning, Linux, Google Cloud Platform, JAX, Flax, Team Work
π§ Folk.AI: Classifying Between Two Folk Dance Forms of India Using Deep Learning
ποΈ Personal Project during taking the fast.ai MOOC
ποΈ Jul 2020 - Aug 2020 | π Link β‘οΈ
Trained a Convolutional Neural Network with ResNet-34 leveraging transfer learning. Deployed the model as a web app for inference.
This app distinguishes between Kathakali and Chhau- two classical dance forms of India.Trained a Convolutional Neural Network with ResNet-34 leveraging transfer learning. Deployed the model as a web app for inference. GitHub- https://github.com/ghosh-r/folkAI This app distinguishes between Kathakali and Chhau- two classical dance forms of India.
Skills: Transfer Learning, Deep Learning, Computer Vision
π§ Investing in A New Eatery in California: A Data Driven Approach
ποΈ Personal Project for IBM Data Science Professional Certificate
ποΈ Jul 2020 - Jul 2020 | π Link β‘οΈ
Applied KMeans Clustering algorithm to cluster counties of California, to look for and recommend new investment opportunities for investors looking to fund new eateries in California City.
Evaluated feature correlation through the Pearson Coefficient, and evaluated model through elbow method.
Final Report Link: https://github.com/ghosh-r/California_Eatery/blob/master/final_deliverables/capstone_project_final_report.pdf
Skills: Machine Learning, Data Science, Python, Scikit-Learn, Written Communication
π§ Free Bengali Poetry (Dataset project)
ποΈ Personal Project
ποΈ Jul 2021 | π Link β‘οΈ
I scraped about 2,700 copyright-free Bengali poems from the web and made them available under a permissive license. The intention of the project is to provide a freely available, well-structured dataset for downstream tasks of pre-trained language models, such as poetry generation.
This dataset is ideal for training language models to generate prose with pre-trained language models such as GPT-2, GPT-J, etc.
DOI: 10.34740/KAGGLE/DSV/2400728
Skills: Linux, Python, Web Scraping
π§ Bengali-MNIST (Dataset Project)
ποΈ Personal Project
ποΈ Mar 2021 | π Link β‘οΈ
- Created a Dataset containing 70,000 images from an existing database so that it can be used for a seamless introduction to Vision, and can be used to solve commercial problems involving Bengali handwritten digits.
Skills: Python, Linux
π Honors and Awards
ποΈ Winner of Google Open Source Software Expert Prize
ποΈ Google/Kaggle
ποΈ Apr 2022 | π Link β‘οΈ
- In this world-wide competition, I was awarded the prize along with two others for demonstrating expertise in Googleβs Open Source software JAX and communicating my knowledge.
ποΈ 1st Place in JAX/Flax Deep Learning Community Event
ποΈ Hugging Face x Google Cloud
ποΈ Jul 2021 | π Link β‘οΈ
80+ Teams of about 800 people took part in this competition, and among ~50 final projects, our project was picked as the top one.
Our team consisted of a Software Engineer, two ML Engineers, three Ph.D. Students, and two undergraduate students.
We built an Open Source version of OpenAIβs Dall-E Mini, that generates images from text. We used JAX/Flax as our framework, trained on Cloud TPU-VMs with TPUv3-8s. We used a VQGAN + BART architecture, and CLIP to rank generated images.
I personally worked on sequence generation using BART and created part of the model front-end using Streamlit. Actively took part in all other aspects of the project.
On the Jury were Ross Wightman (Investor, DL Library maintainer), Asish Vaswani (of Transformers), Niki Parmar (Research Scientist, Google Brain), and Thomas Wolf (CSO of Hugging Face).
GitHub: https://github.com/borisdayma/dalle-mini
Report: https://wandb.ai/dalle-mini/dalle-mini/reports/DALL-E-miniβVmlldzo4NjIxODA
ποΈ 3rd Place in JAX/Flax Deep Learning Community Event Organized by Hugging Face and Google Cloud
ποΈ Hugging Face x Google Cloud
ποΈ Jul 2021 | π Link β‘οΈ
80+ Teams of about 800 people took part in this competition, and among ~50 final projects, our project was picked for third place.
We built an Open Source model for captioning satellite images by fine-tuning CLIP with satellite image data (RSICD dataset). We expect it to be used in natural disaster management, climate monitoring, and defense among other uses.
We used JAX/Flax as our framework, trained on Cloud TPU-VMs with one TPUv3-8.
I personally worked on data augmentation techniques for the images in the dataset for improving the accuracy score.
On the Jury were Ross Wightman (Investor, DL Library maintainer), Ashish Vaswani (of Transformers fame), Niki Parmar (Research Scientist, Google Brain), and Thomas Wolf (CSO, Hugging Face).
GitHub: https://lnkd.in/eeWQBiX
Report: https://lnkd.in/en74Mmk
ποΈ βBest Speaker of The Legislative Assemblyβ in Youth Parliament Competition, District Level
ποΈ Department of Parliamentary Affairs, Government of West Bengal
ποΈ Dec 2015 | π Link β‘οΈ
Was awarded the the βBest Speaker of Legislative Assemblyβ award in district level of Youth Parliament Competition in (then undivided) Burdwan District, West Bengal. Received certificate signed by the Honβble Minister-in-Charge of Departments of Parliamentary Affairs, School Education and Higher Education, Government of West Bengal and Shri Basudeb Banerjee, I.A.S, Additional Chief Secretary, Home & Parliamentary Affairs Department, Government of West Bengal.
Youth Parliament Competition is a Government funded competition to grow awarness about Parliamentary Democratic process, rules and decorum among school students.
My school stood first at the block level, and I was awarded the same award in the Block level and qualified to District level, and received the award.
ποΈ 3rd Place in District Level Student-Youth Science Fair
ποΈ Department of Youth Services, Government of West Bengal
ποΈ Sep 2015 | π Link β‘οΈ
Designed a model of a stradling bus, and discussed on its logistics and was awarded 3rd Place in District Level (then undivided) Burdwan District.
And participated in State Level of the Competition.
π Publications
π Complete Introductory Guide to Speech to Text with Transformers
ποΈ Analytics Vidhya
ποΈ Jul 2023 | π Link β‘οΈ
Wrote an introductory guide to using Transformers models for solving audio-related problems.
It is very practical and one can get started with Audio Machine Learning right away after reading this blog. This also teaches how to use Hugging Face to find models, datasets, and how to use them to solve oneβs own problems.
π Non-Linear Dynamics Through Linear Algebraic Lenses: Attempting to Learn the Trajectories of the Logistic Map with Artificial Neural Networks
ποΈ Masterβs Thesis published in Zenodo
ποΈ Apr 2023 | π Link β‘οΈ
To automatically learn the behavior of trajectories of a map in Non-Linear Dynamics- the Logistic Map, Deep Neural Networks have been trained. Different iterates of the Logistic Map have been generated and models have been fit to them to test the learning capabilities of Neural Networks under such scenario. This paper examines the capability of Neural Networks to learn the dynamics of a system that can be modeled with the Logistic Map.
keywords- Non-Linear Dynamics, Deep Learning, Artificial Neural Networks, Physics, Computational Mathematics, Logistic Map
π Best Podcasts for Machine Learning
ποΈ KDNuggets
ποΈ Apr 2021 | π Link β‘οΈ
π TAKING JULIA FOR A TEST DRIVE: My First Encounter With Julia
ποΈ Towards Data Science
ποΈ Sep 2020 | π Link β‘οΈ
π How Much Math Do You Need to Know to Get Started with Data Science? A Clear, Straightforward Answer
ποΈ Towards Data Science
ποΈ Jul 2020 | Link β‘οΈ
Volunteering Experiences
π± Data Contributor
ποΈ Mozilla | ποΈ Mar 2021 - Mar 2022
I recorded my voice and validates othersβ for creating the Mozilla Common Voice Dataset. I was involved in five languages- Bengali, English, Spanish, Hindi, and German.
π± Mentor
ποΈ Code School by Giles Mcmullen | ποΈ Jul 2021 - Present
- I help answer questions from beginners getting into programming and learning Data Science. I also guide beginners about programming languages, problem solving skills, tools, and other related things about programming and Data Science.
π± Volunteer
ποΈ Porichoy | ποΈ Jan 2015 - Jan 2017
- Served as a volunteer in Porichoy, a local group of volunteers aimed to serve the less fortunate after natural disasters and helping in education of children from economically challenged families. I mainly worked in rehabilitation after flood as a part of a team and organised cultural events.
π± Mentor of Youth Parliament Competition
ποΈ Dainhat High School | ποΈ Jul 2022 - Sep 2022
Trained a group of fifteen high school students in Youth Parliament Competition.
Taught them about government, state-level polity, and conducts of Vidhan Sabha (State Legisslative Assembly)
Helped train them in speech delivery, acting, etc.
Helped write the script of 45 minute run- many additions, full editing, and wrote all English dialogues.
In the first competition, held at the level of Municipality, they won the First place beating three other schools.
In the next level, they won the Third place competing with 29 other schools in the District Level.
π± Language and Culture Volunteer
ποΈ Project Bhasha | ποΈ Jan 2019 - Feb 2021
- Project Bhasha was aimed at creating a website and a community around it to teach Bengali to foreigners. We took a broader approach to teach culture as well as the language. My role was to write articles on Bengali art, culture, language, and literature. I also oversaw some aspects of the overall strategy, pedagogy, etc.
π€ΈββοΈ Hobbies
Reading books, swimming, playing the piano, narrating stories, etc.