EasterBlack-owned or founded brands at TargetGroceryClothing, Shoes & AccessoriesBabyHomeFurnitureKitchen & DiningOutdoor Living & GardenToysElectronicsVideo GamesMovies, Music & BooksSports & OutdoorsBeautyPersonal CareHealthPetsHousehold EssentialsArts, Crafts & SewingSchool & Office SuppliesParty SuppliesLuggageGift IdeasGift CardsClearanceTarget New ArrivalsTarget Finds#TargetStyleTop DealsTarget Circle DealsWeekly AdShop Order PickupShop Same Day DeliveryRegistryRedCardTarget CircleFind Stores

Sponsored

Learning Spark - 2nd Edition by Jules S Damji & Brooke Wenig & Tathagata Das & Denny Lee (Paperback)

Learning Spark - 2nd Edition by  Jules S Damji & Brooke Wenig & Tathagata Das & Denny Lee (Paperback) - 1 of 1
$43.99 sale price when purchased online
$79.99 list price
Target Online store #3991

About this item

Highlights

  • Data is bigger, arrives faster, and comes in a variety of formatsâ and it all needs to be processed at scale for analytics or machine learning.
  • About the Author: Jules S. Damji is a senior developer advocate at Databricks and an MLflow contributor.
  • 397 Pages
  • Computers + Internet, Data Processing

Description



About the Book



"Data is bigger, arrives faster, and comes in a variety of formats-- and it all needs to be processed at scale for analytics or machine learning. But how can you process such varied workloads efficiently? Enter Apache Spark. Updated to include Spark 3.0, this second edition shows data engineers and data scientists why structure and unification in Spark matters. Specifically, this book explains how to perform simple and complex data analytics and employ machine learning algorithms."--Page 4 of cover.



Book Synopsis



Data is bigger, arrives faster, and comes in a variety of formatsâ and it all needs to be processed at scale for analytics or machine learning. But how can you process such varied workloads efficiently? Enter Apache Spark.

Updated to include Spark 3.0, this second edition shows data engineers and data scientists why structure and unification in Spark matters. Specifically, this book explains how to perform simple and complex data analytics and employ machine learning algorithms. Through step-by-step walk-throughs, code snippets, and notebooks, youâ ll be able to:

  • Learn Python, SQL, Scala, or Java high-level Structured APIs
  • Understand Spark operations and SQL Engine
  • Inspect, tune, and debug Spark operations with Spark configurations and Spark UI
  • Connect to data sources: JSON, Parquet, CSV, Avro, ORC, Hive, S3, or Kafka
  • Perform analytics on batch and streaming data using Structured Streaming
  • Build reliable data pipelines with open source Delta Lake and Spark
  • Develop machine learning pipelines with MLlib and productionize models using MLflow



About the Author



Jules S. Damji is a senior developer advocate at Databricks and an MLflow contributor. He is a hands-on developer with over 20 years of experience and has worked as a software engineer at leading companies such as Sun Microsystems, Netscape, @Home, Loudcloud/Opsware, Verisign, ProQuest, and Hortonworks, building large scale distributed systems. He holds a B.Sc. and an M.Sc. in computer science and an MA in political advocacy and communication from Oregon State University, Cal State, and Johns Hopkins University, respectively.

Brooke Wenig is a machine learning practice lead at Databricks. She leads a team of data scientists who develop large-scale machine learning pipelines for customers, as well as teaching courses on distributed machine learning best practices. Previously, she was a principal data science consultant at Databricks. She holds an M.S. in computer science from UCLA with a focus on distributed machine learning.

Tathagata Das is a staff software engineer at Databricks, an Apache Spark committer, and a member of the Apache Spark Project Management Committee (PMC). He is one of the original developers of Apache Spark, the lead developer of Spark Streaming (DStreams), and is currently one of the core developers of Structured Streaming and Delta Lake. Tathagata holds an M.S. in computer science from UC Berkeley.

Denny Lee is a staff developer advocate at Databricks who has been working with Apache Spark since 0.6. He is a hands-on distributed systems and data sciences engineer with extensive experience developing internet-scale infrastructure, data platforms, and predictive analytics systems for both on-premises and cloud environments. He also has an M.S. in biomedical informatics from Oregon Health and Sciences University and has architected and implemented powerful data solutions for enterprise healthcare customers.

Dimensions (Overall): 9.2 Inches (H) x 7.0 Inches (W) x .9 Inches (D)
Weight: 1.4 Pounds
Suggested Age: 22 Years and Up
Number of Pages: 397
Genre: Computers + Internet
Sub-Genre: Data Processing
Publisher: O'Reilly Media
Format: Paperback
Author: Jules S Damji & Brooke Wenig & Tathagata Das & Denny Lee
Language: English
Street Date: August 25, 2020
TCIN: 83300205
UPC: 9781492050049
Item Number (DPCI): 247-56-7971
Origin: Made in the USA or Imported
If the item details above aren’t accurate or complete, we want to know about it.

Shipping details

Estimated ship dimensions: 0.9 inches length x 7 inches width x 9.2 inches height
Estimated ship weight: 1.4 pounds
We regret that this item cannot be shipped to PO Boxes.
This item cannot be shipped to the following locations: American Samoa (see also separate entry under AS), Guam (see also separate entry under GU), Northern Mariana Islands, Puerto Rico (see also separate entry under PR), United States Minor Outlying Islands, Virgin Islands, U.S., APO/FPO

Return details

This item can be returned to any Target store or Target.com.
This item must be returned within 90 days of the date it was purchased in store, shipped, delivered by a Shipt shopper, or made ready for pickup.
See the return policy for complete information.

Guests also viewed

Bash Idioms - by  Carl Albing & Jp Vossen (Paperback)

$32.49
MSRP $55.99
Buy 1 get 1 50% off books, movies, games & activity toys

The Book of PF, 3rd Edition - by  Peter N M Hansteen (Paperback)

$34.99
Buy 1 get 1 50% off books, movies, games & activity toys

Building Event-Driven Microservices - by  Adam Bellemare (Paperback)

$36.49
MSRP $65.99
Buy 1 get 1 50% off books, movies, games & activity toys

Efficient Go - by  Bartlomiej Plotka (Paperback)

$36.99
MSRP $65.99
Buy 1 get 1 50% off books, movies, games & activity toys

Discover more options

Data Algorithms with Spark - by  Mahmoud Parsian (Paperback)

$62.99
MSRP $79.99
Buy 1 get 1 50% off books, movies, games & activity toys

Scaling Machine Learning with Spark - by  Adi Polak (Paperback)

$71.87
MSRP $79.99
Buy 1 get 1 50% off books, movies, games & activity toys

Security as Code - by  Bk Sarthak Das & Virginia Chu (Paperback)

$30.99
MSRP $55.99
Buy 1 get 1 50% off books, movies, games & activity toys

The Spark - by  Jules Wake (Paperback)

$19.99
Buy 1 get 1 50% off books, movies, games & activity toys

Get top deals, latest trends, and more.

Privacy policy

Footer

About Us

About TargetCareersNews & BlogTarget BrandsBullseye ShopSustainability & GovernancePress CenterAdvertise with UsInvestorsAffiliates & PartnersSuppliersTargetPlus

Help

Target HelpReturnsTrack OrdersRecallsContact UsFeedbackAccessibilitySecurity & FraudTeam Member Services

Stores

Find a StoreClinicPharmacyOpticalMore In-Store Services

Services

Target Circle™Target Circle™ CardTarget Circle 360™Target AppRegistrySame Day DeliveryOrder PickupDrive UpFree 2-Day ShippingShipping & DeliveryMore Services
PinterestFacebookInstagramXYoutubeTiktokTermsCA Supply ChainPrivacyCA Privacy RightsYour Privacy ChoicesInterest Based AdsHealth Privacy Policy