2 About SAP Data Services
SAP Data Services delivers a single enterprise-class solution for data integration, data quality, data profiling and
text data processing.
SAP Data Services delivers a single enterprise-class solution for data integration, data quality, data profiling and
text data processing that allows you to integrate, transform, improve and deliver trusted data to critical business
processes. It provides one development UI, metadata repository, data connectivity layer, run-time environment
and management console – enabling IT organizations to lower total cost of ownership and accelerate time to
value. With SAP Data Services, IT organizations can maximize operational efficiency with a single solution to
improve data quality and gain access to heterogeneous sources and applications.
2.1 SAP Data Integrator
SAP Data Integrator allows you to integrate data form multiple differing sources.
● Easy-to-configure transforms for typically complex tasks like slow changing dimensions, hierarchy flattening,
etc.
● Everything you need to build large jobs including error handling, dependency handling, and restart-ability
● Extensive operational statistics
● Rich connectivity to many sources and targets - most using the vendor’s native format for maximum
performance
● Easy-to-use parallelization and performance optimization options
● Functionalities to simplify daily operations and project hand-over like web based management console, auto-
documentation features and impact lineage information
2.2 SAP Text Data Procesing
SAP Text Data Procesing enables you to analyze text in detail.
● Analyzes text and automatically identifies and extracts entities including people, dates, places, organizations
and so on, in multiple languages
● Looks for patterns, activities, events, and relationships among entities and enables their extraction
● Goes beyond conventional character matching tools for information retrieval, which can only seek exact
matches for specific strings; understands the semantics of words
● Supports extraction in 31 different languages
● Supports not only text, HTML, and XML but binary document formats such as PDF and Microsoft Word
● Allows you to specify your own list of entities in a custom dictionary, which enables you to store entities and
manage name variations; known entity names can be standardized using a dictionary
● Write rules to customize extraction output, although pre-defined rules are provided to support sentiment
analysis, enterprises, and the public sector
4
P U B L I C
Sizing Guide
About SAP Data Services