Normal view MARC view ISBD view

Data algorithms with Spark : recipes and design patterns for scaling up using PySpark (Record no. 31326)

000 -LEADER
fixed length control field	a
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field	230220b xxu\|\|\|\|\| \|\|\|\| 00\| 0 eng d
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number	9789355420787
082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number	005.1
Item number	PAR
100 ## - MAIN ENTRY--PERSONAL NAME
Personal name	Parsian, Mahmoud
245 ## - TITLE STATEMENT
Title	Data algorithms with Spark : recipes and design patterns for scaling up using PySpark
260 ## - PUBLICATION, DISTRIBUTION, ETC. (IMPRINT)
Name of publisher, distributor, etc	Shroff Publishers,
Date of publication, distribution, etc	2022
Place of publication, distribution, etc	Mumbai :
300 ## - PHYSICAL DESCRIPTION
Extent	xx, 412 p.;
Other physical details	ill.
Dimensions	24 cm
365 ## - TRADE PRICE
Price amount	1600.00
Price type code	INR
Unit of pricing	1.00
504 ## - BIBLIOGRAPHY, ETC. NOTE
Bibliography, etc	Includes bibliographical references and index.
520 ## - SUMMARY, ETC.
Summary, etc	Apache Spark's speed, ease of use, sophisticated analytics, and multilanguage support makes practical knowledge of this cluster-computing framework a required skill for data engineers and data scientists. With this hands-on guide, anyone looking for an introduction to Spark will learn practical algorithms and examples using PySpark. In each chapter, author Mahmoud Parsian shows you how to solve a data problem with a set of Spark transformations and algorithms. You'll learn how to tackle problems involving ETL, design patterns, machine learning algorithms, data partitioning, and genomics analysis. Each detailed recipe includes PySpark algorithms using the PySpark driver and shell script. With this book, you will: Learn how to select Spark transformations for optimized solutions Explore powerful transformations and reductions including reduceByKey(), combineByKey(), and mapPartitions() Understand data partitioning for optimized queries Build and apply a model using PySpark design patterns Apply motif-finding algorithms to graph data Analyze graph data by using the GraphFrames API Apply PySpark algorithms to clinical and genomics data Learn how to use and apply feature engineering in ML algorithms Understand and use practical and pragmatic data design patterns.
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element	Computer programming

Topical term or geographic name as entry element	Data Mining

Topical term or geographic name as entry element	Programmation

Topical term or geographic name as entry element	Spark

Topical term or geographic name as entry element	Open source tools

Topical term or geographic name as entry element	Associative law

Topical term or geographic name as entry element	Binning

Topical term or geographic name as entry element	FlatMap

Topical term or geographic name as entry element	Input-Filter-Output design pattern

Topical term or geographic name as entry element	Join design patterns

Topical term or geographic name as entry element	Monoids

Topical term or geographic name as entry element	Motif finding feature

Topical term or geographic name as entry element	One-hot encoding

Topical term or geographic name as entry element	PageRank algorithm

Topical term or geographic name as entry element	Pipelines

Topical term or geographic name as entry element	Transformation algorithms
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Source of classification or shelving scheme
Item type	Books

Holdings
Withdrawn status	Lost status	Source of classification or shelving scheme	Damaged status	Not for loan	Permanent location	Current location	Date acquired	Cost, normal purchase price	Full call number	Barcode	Date last seen	Koha item type
					DAIICT	DAIICT	2023-02-20	1600.00	005.1 PAR	033445	2023-02-20	Books

Koha online

Data algorithms with Spark : recipes and design patterns for scaling up using PySpark (Record no. 31326)