• Latest
  • Trending
Want to Be a Data Scientist? Don’t Start With Machine Learning

Want to Be a Data Scientist? Don’t Start With Machine Learning

December 30, 2021
ATC Ghana supports Girls-In-ICT Program

ATC Ghana supports Girls-In-ICT Program

April 25, 2023
Vice President Dr. Bawumia inaugurates  ICT Hub

Vice President Dr. Bawumia inaugurates ICT Hub

April 2, 2023
Co-Creation Hub’s edtech accelerator puts $15M towards African startups

Co-Creation Hub’s edtech accelerator puts $15M towards African startups

February 20, 2023
Data Leak Hits Thousands of NHS Workers

Data Leak Hits Thousands of NHS Workers

February 20, 2023
EU Cybersecurity Agency Warns Against Chinese APTs

EU Cybersecurity Agency Warns Against Chinese APTs

February 20, 2023
How Your Storage System Will Still Be Viable in 5 Years’ Time?

How Your Storage System Will Still Be Viable in 5 Years’ Time?

February 20, 2023
The Broken Promises From Cybersecurity Vendors

Cloud Infrastructure Used By WIP26 For Espionage Attacks on Telcos

February 20, 2023
Instagram and Facebook to get paid-for verification

Instagram and Facebook to get paid-for verification

February 20, 2023
YouTube CEO Susan Wojcicki steps down after nine years

YouTube CEO Susan Wojcicki steps down after nine years

February 20, 2023
Inaugural AfCFTA Conference on Women and Youth in Trade

Inaugural AfCFTA Conference on Women and Youth in Trade

September 6, 2022
Instagram fined €405m over children’s data privacy

Instagram fined €405m over children’s data privacy

September 6, 2022
8 Most Common Causes of a Data Breach

5.7bn data entries found exposed on Chinese VPN

August 18, 2022
  • Consumer Watch
  • Kids Page
  • Directory
  • Events
  • Reviews
Friday, 23 May, 2025
  • Login
itechnewsonline.com
  • Home
  • Tech
  • Africa Tech
  • InfoSEC
  • Data Science
  • Data Storage
  • Business
  • Opinion
Subscription
Advertise
No Result
View All Result
itechnewsonline.com
No Result
View All Result

Want to Be a Data Scientist? Don’t Start With Machine Learning

The biggest misconception aspiring data scientists have

by ITECHNEWS
December 30, 2021
in Data Science, Leading Stories
0 0
0
Want to Be a Data Scientist? Don’t Start With Machine Learning

The first thing most people think about when they hear the term “data science” is usually “machine learning”.

This was the case for me. My interest in data science sparked because I was first exposed to the idea of “machine learning” which sounded really cool. So when I was looking for a place to start learning about data science, you can guess where I started (hint: it rhymes with bean churning).

This was my biggest mistake and this leads me to my main point:

If you want to be a data scientist, don’t start with machine learning.

Bear with me here. Obviously, to be a “complete” data scientist, you’ll have to eventually learn about machine learning concepts. But you’d be surprised at how far you can get without it.

So why shouldn’t you start with machine learning?

1. Machine learning is only one part of a data scientist (and a very small part too).

Image created by Author

Data science and machine learning are like a square and a rectangle. Machine learning is (a part of) data science but data science isn’t necessarily machine learning, similar to how a square is a rectangle but a rectangle isn’t necessarily a square.

In reality, I’d say that machine learning modeling only makes up around 5–10% of a data scientist’s job, where most of one’s time is spent elsewhere, which I’ll elaborate on later.

YOU MAY ALSO LIKE

ATC Ghana supports Girls-In-ICT Program

Vice President Dr. Bawumia inaugurates ICT Hub

TLDR: By focusing on machine learning first, you’ll be putting in a lot of time and energy, and getting little in return.

2. Fully understanding machine learning requires preliminary knowledge in several other subjects first.

At its core, machine learning is built on statistics, mathematics, and probability. The same way that you first learn about English grammar, figurative language, and so forth to write a good essay, you have to have these building blocks set in stone before you can learn machine learning.

To give some examples:

  • Linear regression, the first “machine learning algorithm” that most bootcamps teach first is really a statistical method.
  • Principal Component Analysis is only possible with the ideas of matrices and eigenvectors (linear algebra)
  • Naive Bayes is a machine learning model that is completely based on Bayes Theorem (probability).

And so, I’ll conclude with two points. One, learning the fundamentals will make learning more advanced topics easier. Two, by learning the fundamentals, you will already have learned several machine learning concepts.

3. Machine learning is not the answer to every data scientist’s problem.

Many data scientists struggle with this, even myself. Similar to my initial point, most data scientists think that “data science” and “machine learning” go hand in hand. And so, when faced with a problem, the very first solution that they consider is a machine learning model.

But not every “data science” problem requires a machine learning model.

In some cases, a simple analysis with Excel or Pandas is more than enough to solve the problem at hand.

In other cases, the problem will be completely unrelated to machine learning. You may be required to clean and manipulate data using scripts, build data pipelines, or create interactive dashboards, all of which do not require machine learning.

What should you do instead?

If you’ve read my article, “How I’d Learn Data Science If I Had to Start Over,” you may have noticed that I suggested learning Mathematics, Statistics, and programming fundamentals. And I still stand by this.

Like I said before, learning the fundamentals will make learning more advanced topics easier, and by learning the fundamentals, you will already have learned several machine learning concepts.

I know it may feel like you’re not progressing to be a “data scientist” if you’re learning statistics, math, or programming fundamentals, but learning these fundamentals will only accelerate your learnings in the future.

You have to learn to walk before you can run.

If you would like some tangible next steps to start with instead, here are a couple:

  1. Start with statistics. Of the three building blocks, I think statistics is the most important. And if you dread statistics, data science probably isn’t for you. I’d check out Georgia Tech’s course called Statistical Methods, or Khan Academy’s video series.
  2. Learn Python and SQL. If you’re more of an R kind of guy, go for it. I’ve personally never worked with R so I have no opinion on it. The better you are at Python and SQL, the easier your life will be when it comes to data collection, manipulation, and implementation. I would also be familiar with Python libraries like Pandas, NumPy, and Scikit-learn. I also recommend that you learn about binary trees, as it serves as the basis for many advanced machine learning algorithms like XGBoost.
  3. Learn linear algebra fundamentals. Linear algebra becomes extremely important when you work with anything related to matrices. This is common in recommendation systems and deep learning applications. If these sound like things that you’ll want to learn about in the future, don’t skip this step.
  4. Learn data manipulation. This makes up at least 50% of a data scientist’s job. More specifically, learn more about feature engineering, exploratory data analysis, and data preparation.
Source: Terence Shin, Data Scientist @ KOHO
Via: Data and Marketing Advisor
Tags: Data ScientistML
ShareTweetShare
Plugin Install : Subscribe Push Notification need OneSignal plugin to be installed.

Search

No Result
View All Result

Recent News

ATC Ghana supports Girls-In-ICT Program

ATC Ghana supports Girls-In-ICT Program

April 25, 2023
Vice President Dr. Bawumia inaugurates  ICT Hub

Vice President Dr. Bawumia inaugurates ICT Hub

April 2, 2023
Co-Creation Hub’s edtech accelerator puts $15M towards African startups

Co-Creation Hub’s edtech accelerator puts $15M towards African startups

February 20, 2023

About What We Do

itechnewsonline.com

We bring you the best Premium Tech News.

Recent News With Image

ATC Ghana supports Girls-In-ICT Program

ATC Ghana supports Girls-In-ICT Program

April 25, 2023
Vice President Dr. Bawumia inaugurates  ICT Hub

Vice President Dr. Bawumia inaugurates ICT Hub

April 2, 2023

Recent News

  • ATC Ghana supports Girls-In-ICT Program April 25, 2023
  • Vice President Dr. Bawumia inaugurates ICT Hub April 2, 2023
  • Co-Creation Hub’s edtech accelerator puts $15M towards African startups February 20, 2023
  • Data Leak Hits Thousands of NHS Workers February 20, 2023
  • Home
  • InfoSec
  • Opinion
  • Africa Tech
  • Data Storage

© 2021-2022 iTechNewsOnline.Com - Powered by BackUPDataSystems

No Result
View All Result
  • Home
  • Tech
  • Africa Tech
  • InfoSEC
  • Data Science
  • Data Storage
  • Business
  • Opinion

© 2021-2022 iTechNewsOnline.Com - Powered by BackUPDataSystems

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Go to mobile version