Using OpenRefine
图书信息
| 作者 | Ruben Verborgh |
| 出版社 | Packt Publishing |
| ISBN | 9781783289097 |
| 出版时间 | 2013-09-10 |
| 字数 | 35.5万 |
| 分类 | 进口书,外文原版书,电脑,网络 |
读书简介
The book is styled on a Cookbook, containing recipes - combined with free datasets - which will turn readers into proficient OpenRefine users in the fastest possible way.This book is targeted at anyone who works on or handles a large amount of data. No prior knowledge of OpenRefine is required, as we start from the very beginning and gradually reveal more advanced features. You don't even need your own dataset, as we provide example data to try out the book's recipes.
目录
Using OpenRefine
Table of Contents
Using OpenRefine
Credits
Foreword
About the Authors
About the Reviewers
www.PacktPub.com
Support files, eBooks, discount offers and more
Why Subscribe?
Free Access for Packt account holders
Preface
What this book covers
What you need for this book
Who this book is for
Conventions
Reader feedback
Customer support
Downloading the example files
Errata
Piracy
Questions
1. Diving Into OpenRefine
Introducing OpenRefine
Recipe 1 – installing OpenRefine
Windows
Mac
Linux
Recipe 2 – creating a new project
File formats supported by OpenRefine
Recipe 3 – exploring your data
Recipe 4 – manipulating columns
Collapsing and expanding columns
Moving columns around
Renaming and removing columns
Recipe 5 – using the project history
Recipe 6 – exporting a project
Recipe 7 – going for more memory
Windows
Mac
Linux
Summary
2. Analyzing and Fixing Data
Recipe 1 – sorting data
Reordering rows
Recipe 2 – faceting data
Text facets
Numeric facets
Customized facets
Faceting by star or flag
Recipe 3 – detecting duplicates
Recipe 4 – applying a text filter
Recipe 5 – using simple cell transformations
Recipe 6 – removing matching rows
Summary
3. Advanced Data Operations
Recipe 1 – handling multi-valued cells
Recipe 2 – alternating between rows and records mode
Recipe 3 – clustering similar cells
Recipe 4 – transforming cell values
Recipe 5 – adding derived columns
Recipe 6 – splitting data across columns
Recipe 7 – transposing rows and columns
Summary
4. Linking Datasets
Recipe 1 – reconciling values with Freebase
Recipe 2 – installing extensions
Recipe 3 – adding a reconciliation service
Recipe 4 – reconciling with Linked Data
Recipe 5 – extracting named entities
Summary
A. Regular Expressions and GREL
Regular expressions for text patterns
Character classes
Quantifiers
Anchors
Choices
Groups
Overview
General Refine Expression Language (GREL)
Transforming data
Creating custom facets
Solving problems with GREL
Index
- 难惹(第2卷)(梦萌)
- 2019年全国导游人员资格考试辅导教材-全国导游基础知识(圣才电子书)
- 软件需求最佳实践——SERU过程框架原理与应用(典藏版)(徐锋)
- 00后整顿职场指南(赵雪)
- 有趣的语文:一个语文教师的“另类”行走(凌宗伟)
- 欧洲的转折(郭方)
- AutoCAD 2018中文版完全自学手册(龙马高新教育 策划 教传艳)
- 戒子的诗(戒子)
