Top Bootcamps for Data Professionals— An Analysis of 5000 Profiles
What are the top Bootcamps and Universities for Data Scientists?

Motivation
As a data enthusiast, you might want to take a data science Bootcamp to upgrade your skills and get a job in data science. However, when searching on Google, you get hundreds of options to choose from.
Instead of choosing a random one, wouldn’t it be nice if you can find which Bootcamps most data scientists took? In this article, we will use the profiles of data scientists to find what institutions they enrolled in.

Collect Data
Dataset of Data Scientists
The dataset of data scientists all over the world is collected from Diffbot, the world’s largest knowledge graph. We are specifically interested in a person’s gender, role, and institutions.

Bootcamps
To get a list of top Bootcamps, I use AutoScraper to automatically scrape all Bootcamps listed on Discover Data Science and SwitchUp.
Get top Bootcamps from Discover Data Science:
Get top Bootcamps from SwitchUp:
Combine the Bootcamps from two websites:
We then get all data scientists who enrolled in theseBootcamps using Diffbot.
Find the specific code to get and process data from Diffbot in this notebook. The processed data is uploaded to this repository.
Let’s use the processed data to visualize top data science Bootcamps and universities. Start with downloading all data to your current directory:
Top 25 Institutions for Data Professionals
Top 25 Institutions by Roles
What are the top 25 institutions for data scientists, machine learning engineers, and data engineers? Let’s figure that out using the data of the top 25 institutions grouped by positions.
Visualize the data:
From the plot above, we can see that the six most popular institutions for data professionals are:
- Standford University
2. University of California — UC Berkeley
3. Columbia University
4. Udacity
5. Georgia Tech
6. MIT
Students who graduated from these institutions seem to have an equal chance to become data scientists, machine learning engineers, or data engineers.
Since Udacity is not a university, we can say that many people learn data science through online courses.
Institutions by Genders
What is the ratio of male data scientists vs female data scientists at these top universities? Let’s find that out using the gender_by_unversities.pkl
data.
Visualize the data:
From the plot above, we can see that the number of males is more than double the number of females at most universities.
This is not surprising since the number of males is also more than double the number of females for roles such as data scientists, data engineers, and machine learning engineers (based on my analysis of all data professionals from all over the world).
Top Majors
Did most data scientists, data engineers, and machine learning engineers have a degree in data science? Let’s use the data from over 300k data scientists, data engineers, and machine learning engineers to figure that out.
From the plot above, we can see that most data professionals had degrees in:
- computer science
- business administration
- mathematics
- data science
- information technology
This shows that you don’t need to have a degree in data science to get into the data science field. You only need to have a degree in a quantitative field such as computer science, math, physics, etc.
Top Bootcamps
Top Bootcamps by Roles
Now you know that you don’t need to go back to school to get a job in data science. But what if you have little to no knowledge in data science and want to upgrade your skills?
One good way to learn data science knowledge without going back to college is to enroll in a Bootcamp. Let’s find out which Bootcamps most data professionals enrolled in.
Visualize the data:
From this plot, we can see that the most popular Bootcamps among data professionals are:
- Udacity
2. General Assembly
3. Springboard
4. Galvanize
5. The Data Incubator
6. Datacamp
Udacity is 6.5 times more popular than the second most popular Bootcamp. This makes sense since there are a lot of courses on Udacity.
Bootcamps by Genders
What is the ratio of male data scientists vs female data scientists enrolled in these Bootcamps? Let’s find that out using the gender_by_bootcamps.pkl
data:
Interesting. We see many more male data scientists enrolled in Udacity courses than female data scientists. This could be because:
- There are more male data scientists than female data scientists in general
- Females prefer to learn through other institutions than Udacity
Conclusion
Congratulations! You have just learned the educational background of data professionals. Summary of the analysis above:
- Many data scientists take online courses and Bootcamps to upgrade their data science skills
- There are many more male data scientists than female data scientists at the top data science institutions
- You don’t need to have a degree in data science to get a job in data science
- The most popular Bootcamp is Udacity
I hope the insights in this article will help you take appropriate actions towards your data science career.
Feel free to play and fork the source code of this article here:
I like to write about basic data science concepts and play with different algorithms and data science tools. You could connect with me on LinkedIn and Twitter.
Star this repo if you want to check out the codes for all of the articles I have written. Follow me on Medium to stay informed with my latest data science articles like these:
What skills and backgrounds do data scientists have in common?
More content at plainenglish.io. Sign up for our free weekly newsletter here.