New ArrivalsChristmasHoliday Hosting & EntertainingGift IdeasAI Gift FinderClothing, Shoes & AccessoriesToysElectronicsBeautyGift CardsHomeFurnitureCharacter ShopBabyKitchen & DiningGroceryHousehold EssentialsSchool & Office SuppliesVideo GamesMovies, Music & BooksSports & OutdoorsBackpacks & LuggagePersonal CareHealthPetsUlta Beauty at TargetTarget OpticalParty SuppliesClearanceTarget New Arrivals Target Finds #TargetStyleHanukkahStore EventsAsian-Owned Brands at TargetBlack-Owned or Founded Brands at TargetLatino-Owned Brands at TargetWomen-Owned Brands at TargetLGBTQIA+ ShopTop DealsTarget Circle DealsWeekly AdShop Order PickupShop Same Day DeliveryRegistryRedCardTarget CircleFind Stores
Advanced Analytics with Spark - 2nd Edition by  Sandy Ryza & Uri Laserson & Sean Owen & Josh Wills (Paperback) - 1 of 1

Advanced Analytics with Spark - 2nd Edition by Sandy Ryza & Uri Laserson & Sean Owen & Josh Wills (Paperback)

$59.99

In Stock

Eligible for registries and wish lists

Sponsored

About this item

Highlights

  • In the second edition of this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark.
  • About the Author: Sandy Ryza develops algorithms for public transit at Remix.
  • 277 Pages
  • Computers + Internet, Data Modeling & Design

Description



About the Book



The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by presenting examples and a set of self-contained patterns for performing large-scale data analysis with Spark. You'll start with an introduction to Spark and its ecosystem, and then dive into patterns that apply common techniques-classification, collaborative filtering, and anomaly detection among others-to fields such as genomics, security, and finance. If you have an entry-level understanding of machine learning and statistics, and you program in Java, Python, or Scala, you'll find these patterns useful for working on your own data applications.



Book Synopsis



In the second edition of this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example. Updated for Spark 2.1, this edition acts as an introduction to these techniques and other best practices in Spark programming.

You'll start with an introduction to Spark and its ecosystem, and then dive into patterns that apply common techniques--including classification, clustering, collaborative filtering, and anomaly detection--to fields such as genomics, security, and finance.

If you have an entry-level understanding of machine learning and statistics, and you program in Java, Python, or Scala, you'll find the book's patterns useful for working on your own data applications.

With this book, you will:

  • Familiarize yourself with the Spark programming model
  • Become comfortable within the Spark ecosystem
  • Learn general approaches in data science
  • Examine complete implementations that analyze large public data sets
  • Discover which machine learning tools make sense for particular problems
  • Acquire code that can be adapted to many uses



About the Author



Sandy Ryza develops algorithms for public transit at Remix. Prior, he was a senior data scientist at Cloudera and Clover Health. He is an Apache Spark committer, Apache Hadoop PMC member, and founder of the Time Series for Spark project. He holds the Brown University computer science department's 2012 Twining award for "Most Chill".

Uri Laserson is an Assistant Professor of Genetics at the Icahn School of Medicine at Mount Sinai, where he develops scalable technology for genomics and immunology using the Hadoop ecosystem.

Sean Owen is Director of Data Science at Cloudera. He is an ApacheSpark committer and PMC member, and was an Apache Mahout committer.

Josh Wills is the Head of Data Engineering at Slack, the founder of the Apache Crunch project, and wrote a tweet about data scientists once.

Dimensions (Overall): 9.1 Inches (H) x 7.0 Inches (W) x .5 Inches (D)
Weight: .9 Pounds
Suggested Age: 22 Years and Up
Number of Pages: 277
Genre: Computers + Internet
Sub-Genre: Data Modeling & Design
Publisher: O'Reilly Media
Format: Paperback
Author: Sandy Ryza & Uri Laserson & Sean Owen & Josh Wills
Featured book lists: Adopted Trade Books
Language: English
Street Date: August 1, 2017
TCIN: 1008292801
UPC: 9781491972953
Item Number (DPCI): 315-00-0336
Origin: Made in the USA or Imported
If the item details aren’t accurate or complete, we want to know about it.

Shipping details

Estimated ship dimensions: 0.5 inches length x 7 inches width x 9.1 inches height
Estimated ship weight: 0.9 pounds
We regret that this item cannot be shipped to PO Boxes.
This item cannot be shipped to the following locations: American Samoa (see also separate entry under AS), Guam (see also separate entry under GU), Northern Mariana Islands, Puerto Rico (see also separate entry under PR), United States Minor Outlying Islands, Virgin Islands, U.S., APO/FPO

Return details

This item can be returned to any Target store or Target.com.
This item must be returned within 30 days of the date it was purchased in store, shipped, delivered by a Shipt shopper, or made ready for pickup.
See the return policy for complete information.

Related Categories

Get top deals, latest trends, and more.

Privacy policy

Footer

About Us

About TargetCareersNews & BlogTarget BrandsBullseye ShopSustainability & GovernancePress CenterAdvertise with UsInvestorsAffiliates & PartnersSuppliersTargetPlus

Help

Target HelpReturnsTrack OrdersRecallsContact UsFeedbackAccessibilitySecurity & FraudTeam Member ServicesLegal & Privacy

Stores

Find a StoreClinicPharmacyTarget OpticalMore In-Store Services

Services

Target Circle™Target Circle™ CardTarget Circle 360™Target AppRegistrySame Day DeliveryOrder PickupDrive UpFree 2-Day ShippingShipping & DeliveryMore Services
PinterestFacebookInstagramXYoutubeTiktokTermsCA Supply ChainPrivacy PolicyCA Privacy RightsYour Privacy ChoicesInterest Based AdsHealth Privacy Policy