Packt+ | Advance your knowledge in tech

You're reading from Scala Data Analysis Cookbook (new) Navigate the world of data analysis, visualization, and machine learning with over 100 hands-on Scala recipes

Product type Paperback

Published in Oct 2015

Publisher

ISBN-13 9781784396749

Length 254 pages

Edition 1st Edition

Languages

Scala

Tools

Apache Spark

Concepts

Data Analysis

Author (1):

Manivannan

View More author details

Table of Contents (14) Chapters

Scala Data Analysis Cookbook

Credits

About the Author

About the Reviewers

www.PacktPub.com

Preface

1. Getting Started with Breeze

2. Getting Started with Apache Spark DataFrames FREE CHAPTER

3. Loading and Preparing Data – DataFrame

4. Data Visualization

5. Learning from Data

6. Scaling Up

7. Going Further

Index

Creating a DataFrame from CSV

In this recipe, we'll look at how to create a new DataFrame from a delimiter-separated values file.

Note

The code for this recipe can be found at https://github.com/arunma/ScalaDataAnalysisCookbook/blob/master/chapter1-spark-csv/src/main/scala/com/packt/scaladata/spark/csv/DataFrameCSV.scala.

How to do it...

This recipe involves four steps:

Add the spark-csv support to our project.
Create a Spark Config object that gives information on the environment that we are running Spark in.
Create a Spark context that serves as an entry point into Spark. Then, we proceed to create an SQLContext from the Spark context.
Load the CSV using the SQLContext.
CSV support isn't first-class in Spark, but it is available through an external library from Databricks. So, let's go ahead and add that to our build.sbt.
After adding the spark-csv dependency, our complete build.sbt looks like this:
```
organization := "com.packt"

name := "chapter1-spark-csv"

scalaVersion := "2.10.4"

val sparkVersion...
```

Tech Concepts

Programming languages

Tech Tools

Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

50+ new titles added per month and exclusive early access to books as they are being written.

You're reading from Scala Data Analysis Cookbook (new) Navigate the world of data analysis, visualization, and machine learning with over 100 hands-on Scala recipes

Table of Contents (14) Chapters

Creating a DataFrame from CSV

Note

How to do it...

Authors (1)

Personalised recommendations for you

You're reading from Scala Data Analysis Cookbook (new) Navigate the world of data analysis, visualization, and machine learning with over 100 hands-on Scala recipes

Table of Contents (14) Chapters

Creating a DataFrame from CSV

Note

How to do it...

Authors (1)

Personalised recommendations for you

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access