Pervasive DataRush Profiler Module
The Pervasive DataRush Profiler module helps you profile your data based on a set of quality metrics that you control. Forget time-consuming, error-prone manual checking for data assurance.
There’s a lot riding on your data. Make sure it’s accurate.
DataRush Product Architecture
Capabilities
- Intuitive API capable of specifying a set of pre-defined and user-defined metrics to execute on a data source
- Splits input data into clean and dirty data streams according to the configured metrics
- Configurable outputs including an object model, embedded database, XML, or PDF
- Extensive set of quality metrics such as field comparison, is blank, is null, is value contained in lookup and a regex matcher
- Statistical metrics such as min, max, mode, median, standard deviation and variance
- Data discovery metrics such as equal range binning with outlier handling, most frequent values, distinct values, data ranges and quantiles
- Extend with user-defined metrics written in an easy to use scripting language