logo
Product categories

EbookNice.com

Most ebook files are in PDF format, so you can easily read them using various software such as Foxit Reader or directly on the Google Chrome browser.
Some ebook files are released by publishers in other formats such as .awz, .mobi, .epub, .fb2, etc. You may need to install specific software to read these formats on mobile/PC, such as Calibre.

Please read the tutorial at this link.  https://ebooknice.com/page/post?id=faq


We offer FREE conversion to the popular formats you request; however, this may take some time. Therefore, right after payment, please email us, and we will try to provide the service as quickly as possible.


For some exceptional file formats or broken links (if any), please refrain from opening any disputes. Instead, email us first, and we will try to assist within a maximum of 6 hours.

EbookNice Team

(Ebook) Data Algorithms with Spark: Recipes and Design Patterns for Scaling Up using PySpark (Early Release) by Mahmoud Parsian ISBN 9781492082316, 9781492082385, 1492082317, 1492082384

  • SKU: EBN-34605222
Zoomable Image
$ 32 $ 40 (-20%)

Status:

Available

4.7

36 reviews
Instant download (eBook) Data Algorithms with Spark: Recipes and Design Patterns for Scaling Up using PySpark (Early Release) after payment.
Authors:Mahmoud Parsian
Pages:500 pages.
Year:2021
Editon:1 / 2021-09-10 Fourth Early Release
Publisher:O'Reilly Media
Language:english
File Size:9.73 MB
Format:epub
ISBNS:9781492082316, 9781492082385, 1492082317, 1492082384
Categories: Ebooks

Product desciption

(Ebook) Data Algorithms with Spark: Recipes and Design Patterns for Scaling Up using PySpark (Early Release) by Mahmoud Parsian ISBN 9781492082316, 9781492082385, 1492082317, 1492082384

Apache Spark's speed, ease of use, sophisticated analytics, and multilanguage support makes practical knowledge of this cluster-computing framework a required skill for data engineers and data scientists. With this hands-on guide, anyone looking for an introduction to Spark will learn practical algorithms and examples using PySpark.

In each chapter, author Mahmoud Parsian shows you how to solve a data problem with a set of Spark transformations and algorithms. You'll learn how to tackle problems involving ETL, design patterns, machine learning algorithms, data partitioning, and genomics analysis. Each detailed recipe includes PySpark algorithms using the PySpark driver and shell script.

With this book, you will:

  • Learn how to select Spark transformations for optimized solutions
  • Explore powerful transformations and reductions including reduceByKey(), combineByKey(), and mapPartitions()
  • Understand data partitioning for optimized queries
  • Design machine learning algorithms including Naive Bayes, linear regression, and logistic regression
  • Build and apply a model using PySpark design patterns
  • Apply motif-finding algorithms to graph data
  • Analyze graph data by using the GraphFrames API
  • Apply PySpark algorithms to clinical and genomics data (such as DNA-Seq)

*Free conversion of into popular formats such as PDF, DOCX, DOC, AZW, EPUB, and MOBI after payment.

Related Products