Here’s how I Learned Just Enough Programming for Data Science

Here’s how I Learned Just Enough Programming for Data Science

How to approach learning programming and best books I recommend. There’s no doubt that data science requires decent programming skills, but how much is enough?

There’s no doubt that data science requires decent programming skills, but how much is enough? Should you know just as much as an average software engineer? This article aims to answer this question, and much more.

Image for post

As a one-sentence summary — no, knowing to program on the level of a mid/senior backend developer is not required. Aim to know more than average statistician and you’ll be fine. There’s always time to learn more as you progress in your career.

The article is divided into three sections:

  1. How much programming is needed in data science?
  2. Which programming language to pick?
  3. Resources I recommend to get started

Please keep in mind — in the article, you’ll find affiliate links to the recommended resources to get started. That doesn’t mean anything to you, as the price is identical, but I’ll get a small commission if you decide to make a purchase. Also, I only show materials I’ve gone through myself and can guarantee 100% for the quality.

Without much ado, let’s get started with the first section.


How much programming is needed in data science?

Well, a lot — but that depends on the role and the company you work for. Small companies don’t necessarily have structured teams for both development and data science, so it’s required to be comfortable with both.

In a nutshell, you won’t be the best in programming nor in data science. That’s not necessarily a bad thing, as you’ll get a better grasp of the product/service the company offers.

Larger companies will treat you differently, due to a more formal structure. You’ll handle data science problems only (as a data scientist), and more often than not won’t see the big picture. You’re there to do the job — not to ask too many questions.

Keep in mind that this is just a rule of thumb — drawn from my experience and from many others.


Which programming language to pick?

It’s not an easy question, to be honest. Most websites mention Python and R as a go-to languages, but those aren’t the only options.

Some companies need a data science solution, but don’t have any data scientists onboard — software development companies centered around the web/mobile development.

While Python and R are great, I find more and more resources on solving machine learning tasks with Java, or even with Go(lang). Heck, I’ve even written a whole article on this topic:

Go for Data Science? Let’s try.

Can Google’s Golang handle data science? Let’s find out.

towardsdatascience.com

I’m not saying languages like Java and Go are great for prototyping, but they are still a viable option for a software developer that doesn’t know Python or just doesn’t want to use it. As I’m diving deeper into software development, or developing applications that use machine learning, I can get why someone wants to stay away from Python.

To summarize:

  • Learn Python/R if you only care about data science and machine learning
  • If you are a software developer and don’t want to switch languages, you can try Java and Go (among other languages)

Resources I recommend to get started

My guess is that you’ve chosen the Python route, and that’s great for several reasons:

  • The language is simple to learn — more beginner-friendly than Java/Go
  • It’s the most widely used language for data science
  • It’s a general-purpose language — not limited to statistical tasks

As an aspiring data scientist, Python will suit you just fine. There’s no need for you to explore other, more difficult languages as coding shouldn't be your primary concern.

But how to get started? I’ve got two amazing books for you which helped me to learn Python well, both pure programming-wise and for the data analysis tasks. Let’s start with the basics.

towards-data-science software-development data-science python programming data analysis

Bootstrap 5 Complete Course with Examples

Bootstrap 5 Tutorial - Bootstrap 5 Crash Course for Beginners

Nest.JS Tutorial for Beginners

Hello Vue 3: A First Look at Vue 3 and the Composition API

Building a simple Applications with Vue 3

Deno Crash Course: Explore Deno and Create a full REST API with Deno

How to Build a Real-time Chat App with Deno and WebSockets

Convert HTML to Markdown Online

HTML entity encoder decoder Online

Offshore Software Development - Best Practices

To make the most out of the benefits of offshore software development, you should understand the crucial factors that affect offshore development.

Basic Data Types in Python | Python Web Development For Beginners

In the programming world, Data types play an important role. Each Variable is stored in different data types and responsible for various functions. Python had two different objects, and They are mutable and immutable objects.

Python For Data Science | Python For Data Analysis

Python for Data Science, you will be working on an end-to-end case study to understand different stages in the data science life cycle. This will mostly deal with "data manipulation" with pandas and "data visualization" with seaborn. After this, an ML model will be built on the dataset to get predictions. You will learn about the basics of the sci-kit-learn library to implement the machine learning algorithm.

Hire Python Developers

Are you looking for experienced, reliable, and qualified Python developers? If yes, you have reached the right place. At **[HourlyDeveloper.io](https://hourlydeveloper.io/ "HourlyDeveloper.io")**, our full-stack Python development services...

Applied Data Science with Python Certification Training Course -IgmGuru

Master Applied Data Science with Python and get noticed by the top Hiring Companies with IgmGuru's Data Science with Python Certification Program. Enroll Now