SAP Data Services 4.x Cookbook
图书信息
| 作者 | Ivan Shomnikov |
| 出版社 | Packt Publishing |
| ISBN | 9781782176572 |
| 出版时间 | 2015-12-03 |
| 字数 | 272.0万 |
| 分类 | 进口书,外文原版书,电脑,网络 |
读书简介
Delve into the SAP Data Services environment to efficiently prepare, implement, and develop ETL processes About This Book Install and configure the SAP Data Services environment Develop ETL techniques in the Data Services environment Implement real-life examples of Data Services uses through step-by-step instructions to perform specific ETL development tasks Who This Book Is For This book is for IT technical engineers who want to get familiar with the EIM solutions provided by SAP for ETL development and data quality management. The book requires familiarity with basic programming concepts and basic knowledge of the SQL language. What You Will Learn Install, configure, and administer the SAP Data Services components Run through the ETL design basics Maximize the performance of your ETL with the advanced patterns in Data Services Extract methods from various databases and systems Get familiar with the transformation methods available in SAP Data Services Load methods into various databases and systems Code with the Data Services *ing language Validate and cleanse your data, applying the Data quality methods of the Information Steward In Detail Want to cost effectively deliver trusted information to all of your crucial business functionsSAP Data Services delivers one enterprise-class solution for data integration, data quality, data profiling, and text data processing. It boosts productivity with a single solution for data quality and data integration. SAP Data Services also enables you to move, improve, govern, and unlock big data. This book will lead you through the SAP Data Services environment to efficiently develop ETL processes. To begin with, you’ll learn to install, configure, and prepare the ETL development environment. You will get familiarized with the concepts of developing ETL processes with SAP Data Services. Starting from smallest unit of work- the data flow, the chapters will lead you to the highest organizational unit—the Data Services job, revealing the advanced techniques of ETL design.You will learn to import XML files by creating and implementing real-time jobs. It will then guide you through the ETL development patterns that enable the most effective performance when extracting, transforming, and loading data. You will also find out how to create validation functions and transforms.Finally, the book will show you the benefits of data quality management with the help of another SAP solution—Information Steward.Style and approach This book is an easy-to-follow guide with step-by-step instructions to perform specific ETL development tasks.
目录
SAP Data Services 4.x Cookbook
Table of Contents
SAP Data Services 4.x Cookbook
Credits
About the Author
About the Reviewers
www.PacktPub.com
Support files, eBooks, discount offers, and more
Why subscribe?
Free access for Packt account holders
Instant updates on new Packt books
Preface
What this book covers
What you need for this book
Who this book is for
Sections
Getting ready
How to do it…
How it works…
There's more…
See also
Conventions
Reader feedback
Customer support
Downloading the example code
Downloading the color images of this book
Errata
Piracy
Questions
1. Introduction to ETL Development
Introduction
Preparing a database environment
Getting ready
How to do it…
How it works…
Creating a source system database
How to do it…
How it works…
There's more…
Defining and creating staging area structures
How to do it…
Flat files
RDBMS tables
How it works…
Creating a target data warehouse
Getting ready
How to do it…
How it works…
There's more…
2. Configuring the Data Services Environment
Introduction
Creating IPS and Data Services repositories
Getting ready…
How to do it…
How it works…
See also
Installing and configuring Information Platform Services
Getting ready…
How to do it…
How it works…
Installing and configuring Data Services
Getting ready…
How to do it…
How it works…
Configuring user access
Getting ready…
How to do it…
How it works…
Starting and stopping services
How to do it…
How it works…
See also
Administering tasks
How to do it…
How it works…
See also
Understanding the Designer tool
Getting ready…
How to do it…
How it works…
Executing ETL code in Data Services
Validating ETL code
Template tables
Query transform basics
The HelloWorld example
3. Data Services Basics – Data Types, Scripting Language, and Functions
Introduction
Creating variables and parameters
Getting ready
How to do it…
How it works…
There's more…
Creating a script
How to do it…
How it works…
Using string functions
How to do it…
Using string functions in the script
How it works…
There's more…
Using date functions
How to do it…
Generating current date and time
Extracting parts from dates
How it works…
There's more…
Using conversion functions
How to do it…
How it works…
There's more…
Using database functions
How to do it…
key_generation()
total_rows()
sql()
How it works…
Using aggregate functions
How to do it…
How it works…
Using math functions
How to do it…
How it works…
There's more…
Using miscellaneous functions
How to do it…
How it works…
Creating custom functions
How to do it…
How it works…
There's more…
4. Dataflow – Extract, Transform, and Load
Introduction
Creating a source data object
How to do it…
How it works…
There's more…
Creating a target data object
Getting ready
How to do it…
How it works…
There's more…
Loading data into a flat file
How to do it…
How it works…
There's more…
Loading data from a flat file
How to do it…
How it works…
There's more…
Loading data from table to table – lookups and joins
How to do it…
How it works…
Using the Map_Operation transform
How to do it…
How it works…
Using the Table_Comparison transform
Getting ready
How to do it…
How it works…
Exploring the Auto correct load option
Getting ready
How to do it…
How it works…
Splitting the flow of data with the Case transform
Getting ready
How to do it…
How it works…
Monitoring and analyzing dataflow execution
Getting ready
How to do it…
How it works…
There's more…
5. Workflow – Controlling Execution Order
Introduction
Creating a workflow object
How to do it…
How it works…
Nesting workflows to control the execution order
Getting ready
How to do it
How it works…
Using conditional and while loop objects to control the execution order
Getting ready
How to do it…
How it works…
There is more…
Using the bypassing feature
Getting ready…
How to do it…
How it works…
There is more…
Controlling failures – try-catch objects
How to do it…
How it works…
Use case example – populating dimension tables
Getting ready
How to do it…
How it works…
Mapping
Dependencies
Development
Execution order
Testing ETL
Preparing test data to populate DimSalesTerritory
Preparing test data to populate DimGeography
Using a continuous workflow
How to do it…
How it works…
There is more…
Peeking inside the repository – parent-child relationships between Data Services objects
Getting ready
How to do it…
How it works…
Get a list of object types and their codes in the Data Services repository
Display information about the DF_Transform_DimGeography dataflow
Display information about the SalesTerritory table object
See the contents of the script object
6. Job – Building the ETL Architecture
Introduction
Projects and jobs – organizing ETL
Getting ready
How to do it…
How it works…
Hierarchical object view
History execution log files
Executing/scheduling jobs from the Management Console
Using object replication
How to do it…
How it works…
Migrating ETL code through the central repository
Getting ready
How to do it…
How it works…
Adding objects to and from the Central Object Library
Comparing objects between the Local and Central repositories
There is more…
Migrating ETL code with export/import
Getting ready
How to do it…
Import/Export using ATL files
Direct export to another local repository
How it works…
Debugging job execution
Getting ready…
How to do it…
How it works…
Monitoring job execution
Getting ready
How to do it…
How it works…
Building an external ETL audit and audit reporting
Getting ready…
How to do it…
How it works…
Using built-in Data Services ETL audit and reporting functionality
Getting ready
How to do it…
How it works…
Auto Documentation in Data Services
How to do it…
How it works…
7. Validating and Cleansing Data
Introduction
Creating validation functions
Getting ready
How to do it…
How it works…
Using validation functions with the Validation transform
Getting ready
How to do it…
How it works…
Reporting data validation results
Getting ready
How to do it…
How it works…
Using regular expression support to validate data
Getting ready
How to do it…
How it works…
Enabling dataflow audit
Getting ready
How to do it…
How it works…
There's more…
Data Quality transforms – cleansing your data
Getting ready
How to do it…
How it works…
There's more…
8. Optimizing ETL Performance
Introduction
Optimizing dataflow execution – push-down techniques
Getting ready
How to do it…
How it works…
Optimizing dataflow execution – the SQL transform
How to do it…
How it works…
Optimizing dataflow execution – the Data_Transfer transform
Getting ready
How to do it…
How it works…
Why we used a second Data_Transfer transform object
When to use Data_Transfer transform
There's more…
Optimizing dataflow readers – lookup methods
Getting ready
How to do it…
Lookup with the Query transform join
Lookup with the lookup_ext() function
Lookup with the sql() function
How it works…
Query transform joins
lookup_ext()
sql()
Performance review
Optimizing dataflow loaders – bulk-loading methods
How to do it…
How it works…
When to enable bulk loading?
Optimizing dataflow execution – performance options
Getting ready
How to do it…
Dataflow performance options
Source table performance options
Query transform performance options
lookup_ext() performance options
Target table performance options
9. Advanced Design Techniques
Introduction
Change Data Capture techniques
Getting ready
No history SCD (Type 1)
Limited history SCD (Type 3)
Unlimited history SCD (Type 2)
How to do it…
How it works…
Source-based ETL CDC
Target-based ETL CDC
Native CDC
Automatic job recovery in Data Services
Getting ready
How to do it…
How it works…
There's more…
Simplifying ETL execution with system configurations
Getting ready
How to do it…
How it works…
Transforming data with the Pivot transform
Getting ready
How to do it…
How it works…
10. Developing Real-time Jobs
Introduction
Working with nested structures
Getting ready
How to do it…
How it works…
There is more…
The XML_Map transform
Getting ready
How to do it…
How it works…
The Hierarchy_Flattening transform
Getting ready
How to do it…
Horizontal hierarchy flattening
Vertical hierarchy flattening
How it works…
Querying result tables
Configuring Access Server
Getting ready
How to do it…
How it works…
Creating real-time jobs
Getting ready
Installing SoapUI
How to do it…
How it works…
11. Working with SAP Applications
Introduction
Loading data into SAP ERP
Getting ready
How to do it…
How it works…
IDoc
Monitoring IDoc load on the SAP side
Post-load validation of loaded data
There is more…
12. Introduction to Information Steward
Introduction
Exploring Data Insight capabilities
Getting ready
How to do it…
Creating a connection object
Profiling the data
Viewing profiling results
Creating a validation rule
Creating a scorecard
How it works…
Profiling
Rules
Scorecards
There is more…
Performing Metadata Management tasks
Getting ready
How to do it…
How it works…
Working with the Metapedia functionality
How to do it…
How it works…
Creating a custom cleansing package with Cleansing Package Builder
Getting ready
How to do it…
How it works…
There is more…
Index
- 饿兔子跳(孙家宇)
- 爱情下一秒(沈星妤)
- PHP入门很轻松(微课超值版)(云尚科技)
- Once Upon a Christmas (Mills & Boon Love Inspired)(Pamela Tracy)
- 软装设计师手册(简明敏)
- AutoCAD 2018中文版完全自学手册(龙马高新教育 策划 教传艳)
- 赢在思维——初中化学拉分题专项集训300题(9年级+中考)(彭嘉全)
- 365夜亲子共读:写给男孩子的经典智慧故事全集(秦茵)
