Setting Up a Linux Data Science Environment on Windows

Phillip Peng
4 min readJan 5, 2024

a guaranteed-to-work guide to setup a Linux data science environment for Windows users.

Summary chart created by Mermaid JS

Linux is often preferred for data science projects because it supports a range of Python packages specifically designed for this OS, which may not work as well on other systems. Additionally, Linux is sometimes a requirement for advanced projects due to its compatibility with certain libraries, its efficient memory management, and the availability of specialized tools that perform optimally in this environment. Therefore, learning to use Linux can be crucial for effectively handling data science tasks.

Welcome to this guaranteed-to-work guide where we will guide Windows users through setting up a Linux Python environment for data science projects using the Windows Subsystem for Linux (WSL) and Git. This setup provides the best of both worlds: the robustness of a Linux environment and the convenience of Windows.

Part 1: Installing WSL and Ubuntu

Step 1: Enable WSL

  1. Open PowerShell as Administrator and run:
wsl --install

2. Restart your computer when prompted.

Step 2: Install Ubuntu

  1. Open Microsoft Store and search for…

--

--