Entity resolution is a key analytic technique that enables you to identify multiple data records that refer to the same real-world entity.
Author(s): Michael Shearer
196 Pages
Computers + Internet,
Description
About the Book
"Entity resolution is a key analytic technique that enables you to identify multiple data records that refer to the same real-world entity. With this hands-on guide, product managers, data analysts, and data scientists will learn how to add value to data by cleansing, analyzing, and resolving datasets using open source Python libraries and cloud APIs. Author Michael Shearer shows you how to scale up your data matching processes and improve the accuracy of your reconciliations. You'll be able to remove duplicate entries within a single source and join disparate data sources together when common keys aren't available. Using real-world data examples, this book helps you gain practical understanding to accelerate the delivery of real business value. This book covers: challenges in deduplicating and joining datasets; extracting, cleansing, and preparing datasets for matching; text matching algorithms to identify equivalent entities; techniques for deduplicating and joining datasets at scale; matching datasets containing persons and organizations; optimizing and tuning data matching algorithms; entity resolution using cloud APIs; matching using privacy-enhancing technologies. With entity resolution, you'll build rich and comprehensive data assets that reveal relationships for marketing and risk management purposes, key to harnessing the full potential of machine learning and AI."--
Book Synopsis
Entity resolution is a key analytic technique that enables you to identify multiple data records that refer to the same real-world entity. With this hands-on guide, product managers, data analysts, and data scientists will learn how to add value to data by cleansing, analyzing, and resolving datasets using open source Python libraries and cloud APIs.
Author Michael Shearer shows you how to scale up your data matching processes and improve the accuracy of your reconciliations. You'll be able to remove duplicate entries within a single source and join disparate data sources together when common keys aren't available. Using real-world data examples, this book helps you gain practical understanding to accelerate the delivery of real business value.
With entity resolution, you'll build rich and comprehensive data assets that reveal relationships for marketing and risk management purposes, key to harnessing the full potential of ML and AI. This book covers:
Challenges in deduplicating and joining datasets
Extracting, cleansing, and preparing datasets for matching
Text matching algorithms to identify equivalent entities
Techniques for deduplicating and joining datasets at scale
Matching datasets containing persons and organizations
Evaluating data matches
Optimizing and tuning data matching algorithms
Entity resolution using cloud APIs
Matching using privacy-enhancing technologies
Dimensions (Overall): 9.19 Inches (H) x 7.0 Inches (W) x .42 Inches (D)
Weight: .71 Pounds
Suggested Age: 22 Years and Up
Number of Pages: 196
Genre: Computers + Internet
Publisher: O'Reilly Media
Format: Paperback
Author: Michael Shearer
Language: English
Street Date: March 12, 2024
TCIN: 90417828
UPC: 9781098148485
Item Number (DPCI): 247-02-8819
Origin: Made in the USA or Imported
If the item details aren’t accurate or complete, we want to know about it.
Shipping details
Estimated ship dimensions: 0.42 inches length x 7 inches width x 9.19 inches height
Estimated ship weight: 0.71 pounds
We regret that this item cannot be shipped to PO Boxes.
This item cannot be shipped to the following locations: American Samoa (see also separate entry under AS), Guam (see also separate entry under GU), Northern Mariana Islands, Puerto Rico (see also separate entry under PR), United States Minor Outlying Islands, Virgin Islands, U.S., APO/FPO
Return details
This item can be returned to any Target store or Target.com.
This item must be returned within 90 days of the date it was purchased in store, shipped, delivered by a Shipt shopper, or made ready for pickup.