Installation and Setup
Installing Anaconda:
- Visit https://www.anaconda.com/download/ in your browser.
- Click on Windows, Mac, or Linux, depending on the OS you are working on.
- Next, click on the Download option. Make sure you download the latest version.
- Open the installer after download.
- Follow the steps in the installer and that's it! Your Anaconda distribution is ready.
PySpark is available on PyPi. To install PySpark run the following command:
pip install pyspark --upgrade
Updating Jupyter and installing dependencies:
- Search for Anaconda Prompt and open it.
- Type the following commands to update Conda and Jupyter:
#Update conda
conda update conda
#Update Jupyter
conda update Jupyter
#install packages
conda install numpy
conda install pandas
conda install statsmodels
conda install matplotlib
conda install seaborn
- To open Jupyter Notebook from Anaconda Prompt, use the following command:
jupyter notebook
pip install -U scikit-learn