Using OpenRefine
图书信息
| 作者 | Ruben Verborgh |
| 出版社 | Packt Publishing |
| ISBN | 9781783289097 |
| 出版时间 | 2013-09-10 |
| 字数 | 35.5万 |
| 分类 | 进口书,外文原版书,电脑,网络 |
读书简介
The book is styled on a Cookbook, containing recipes - combined with free datasets - which will turn readers into proficient OpenRefine users in the fastest possible way.This book is targeted at anyone who works on or handles a large amount of data. No prior knowledge of OpenRefine is required, as we start from the very beginning and gradually reveal more advanced features. You don't even need your own dataset, as we provide example data to try out the book's recipes.
目录
Using OpenRefine
Table of Contents
Using OpenRefine
Credits
Foreword
About the Authors
About the Reviewers
www.PacktPub.com
Support files, eBooks, discount offers and more
Why Subscribe?
Free Access for Packt account holders
Preface
What this book covers
What you need for this book
Who this book is for
Conventions
Reader feedback
Customer support
Downloading the example files
Errata
Piracy
Questions
1. Diving Into OpenRefine
Introducing OpenRefine
Recipe 1 – installing OpenRefine
Windows
Mac
Linux
Recipe 2 – creating a new project
File formats supported by OpenRefine
Recipe 3 – exploring your data
Recipe 4 – manipulating columns
Collapsing and expanding columns
Moving columns around
Renaming and removing columns
Recipe 5 – using the project history
Recipe 6 – exporting a project
Recipe 7 – going for more memory
Windows
Mac
Linux
Summary
2. Analyzing and Fixing Data
Recipe 1 – sorting data
Reordering rows
Recipe 2 – faceting data
Text facets
Numeric facets
Customized facets
Faceting by star or flag
Recipe 3 – detecting duplicates
Recipe 4 – applying a text filter
Recipe 5 – using simple cell transformations
Recipe 6 – removing matching rows
Summary
3. Advanced Data Operations
Recipe 1 – handling multi-valued cells
Recipe 2 – alternating between rows and records mode
Recipe 3 – clustering similar cells
Recipe 4 – transforming cell values
Recipe 5 – adding derived columns
Recipe 6 – splitting data across columns
Recipe 7 – transposing rows and columns
Summary
4. Linking Datasets
Recipe 1 – reconciling values with Freebase
Recipe 2 – installing extensions
Recipe 3 – adding a reconciliation service
Recipe 4 – reconciling with Linked Data
Recipe 5 – extracting named entities
Summary
A. Regular Expressions and GREL
Regular expressions for text patterns
Character classes
Quantifiers
Anchors
Choices
Groups
Overview
General Refine Expression Language (GREL)
Transforming data
Creating custom facets
Solving problems with GREL
Index
- 中国资本市场:重塑生态链(吴晓求 等)
- 新手学Dreamweaver CS6+Flash CS6+Photoshop CS6网页设计(实例版)(全彩)(含DVD光盘1张)(鼎翰文化)
- “新时代万有文库”公羊传(刘跃进)
- PHP入门很轻松(微课超值版)(云尚科技)
- 未解之谜(下)(百读)
- Once Upon a Christmas (Mills & Boon Love Inspired)(Pamela Tracy)
- 永无止尽的狂热:三岛由纪夫(杨照)
- 365夜亲子共读:写给男孩子的经典智慧故事全集(秦茵)
