Packt+ | Advance your knowledge in tech

You're reading from Scala for Data Science Leverage the power of Scala with different tools to build scalable, robust data science applications

Product type Paperback

Published in Jan 2016

Publisher

ISBN-13 9781785281372

Length 416 pages

Edition 1st Edition

Languages

Scala

Concepts

Application Development

Author (1):

Bugnion

View More author details

Table of Contents (22) Chapters

Scala for Data Science

Credits

About the Author

About the Reviewers

www.PacktPub.com

Preface

1. Scala and Data Science FREE CHAPTER

2. Manipulating Data with Breeze

3. Plotting with breeze-viz

4. Parallel Collections and Futures

5. Scala and SQL through JDBC

6. Slick – A Functional Interface for SQL

7. Web APIs

8. Scala and MongoDB

9. Concurrency with Akka

10. Distributed Batch Processing with Spark

11. Spark SQL and DataFrames

12. Distributed Machine Learning with MLlib

13. Web APIs with Play

14. Visualization with D3 and the Play Framework

Pattern Matching and Extractors

Index

Reference

Learning Spark, by Holden Karau, Andy Konwinski, Patrick Wendell, and Matei Zaharia, O'Reilly, provides a much more complete introduction to Spark that this chapter can provide. I thoroughly recommend it.
If you are interested in learning more about information theory, I recommend David MacKay's book Information Theory, Inference, and Learning Algorithms.
Information Retrieval, by Manning, Raghavan, and Schütze, describes how to analyze textual data (including lemmatization and stemming). An online
On the Ling-Spam dataset, and how to analyze it: http://www.aueb.gr/users/ion/docs/ir_memory_based_antispam_filtering.pdf.
This blog post delves into the Spark Web UI in more detail. https://databricks.com/blog/2015/06/22/understanding-your-spark-application-through-visualization.html.
This blog post, by Sandy Ryza, is the first in a two-part series discussing Spark internals, and how to leverage them to improve performance: http://blog.cloudera.com/blog/2015/03/how-to-tune-your-apache...

The rest of the chapter is locked

You're reading from Scala for Data Science Leverage the power of Scala with different tools to build scalable, robust data science applications

Table of Contents (22) Chapters

Reference

Authors (1)

Personalised recommendations for you

You're reading from Scala for Data Science Leverage the power of Scala with different tools to build scalable, robust data science applications

Table of Contents (22) Chapters

Reference

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you