Data quality management using business process modeling -> Monitor Keywords
Fresh Patents
Monitor Patents Patent Organizer How to File a Provisional Patent Browse Inventors Browse Industry Browse Agents Browse Locations
site info Site News  |  monitor Monitor Keywords  |  monitor archive Monitor Archive  |  organizer Organizer  |  account info Account Info  |  
08/23/07 - USPTO Class 705 |  98 views | #20070198312 | Prev - Next | About this Page  705 rss/xml feed  monitor keywords

Data quality management using business process modeling

USPTO Application #: 20070198312
Title: Data quality management using business process modeling
Abstract: A business process modeling framework is used for data quality analysis. The modeling framework represents the sources of transactions entering the information processing system, the various tasks within the process that manipulate or transform these transactions, and the data repositories in which the transactions are stored or aggregated. A subset of these tasks is associated as the potential error introduction sources, and the rate and magnitude of various error classes at each such task are probabilistically modeled. This model can be used to predict how changes in transactions volumes and business processes impact data quality at the aggregate level in the data repositories. The model can also account for the presence of error correcting controls and assess how the placement and effectiveness of these controls alter the propagation and aggregation of errors. Optimization techniques are used for the placement of error correcting controls that meet target quality requirements while minimizing the cost of operating these controls. This analysis also contributes to the development of business “dashboards” that allow decision-makers to monitor and react to key performance indicators (KPIs) based on aggregation of the transactions being processed. Data quality estimation in real time provides the accuracy of these KPIs (in terms of the probability that a KPI is above or below a given value), which may condition the action undertaken by the decision-maker.
(end of abstract)
Agent: Whitham, Curtis, & Christofferson, P.C. Suite 340 - Reston, VA, US
Inventors: Sugato Bagchi, Xue Bai, Jayant Ramarao Kalagnanam
USPTO Applicaton #: 20070198312 - Class: 705007000 (USPTO)

Related Patent Categories: Data Processing: Financial, Business Practice, Management, Or Cost/price Determination, Automated Electrical Financial Or Business Practice Or Management Arrangement, Operations Research
The Patent Description & Claims data below is from USPTO Patent Application 20070198312.
Brief Patent Description - Full Patent Description - Patent Application Claims  monitor keywords

BACKGROUND OF THE INVENTION

[0001] 1. Field of the Invention

[0002] The present application generally relates to modeling and quantitative analysis techniques for managing the quality of data and, more particularly, to extending a business process model with constructs to identify the sources data whose quality is of interest, the data transformative tasks where error may be introduced, the error detection and correction controls in the process, and the data repositories whose quality is to be assessed.

[0003] 2. Background Description

[0004] As companies increasingly adopt information systems that cover a range of functional areas, they have electronic access to vast amounts of transactional data. Increasingly companies are looking develop dashboards where a variety of key performance indicators that are composed from the transactional data are displayed to assist to business decisions. The quality of data contained in these enterprise information systems has important consequences, both from the internal perspective of making business decisions based on the data as well as the legal obligation to provide accurate reporting to external agencies and stakeholders. As a result, companies spend considerable time and money to assess and improve the quality of data in the transactions that flow through its information systems and are stored in its repositories.

[0005] A considerable body of literature exists on the issue of data quality assessment from the perspective of auditing a given information processing system. The prior work on data quality management comes from the fields of financial accounting and auditing and information systems.

[0006] Data quality and control assessment has been studied in accounting literature since the early 1970s. Most of the studies have approached reliability assessment with the accounting system viewed as a "black box" that transforms data into aggregations of account balances contained in various ledgers (see, for example, W. R. Knechel, "The use of Quantitative Models in the Review and Evaluation of Internal Control: A Survey and Review", Journal of Accounting Literature, (Vol. 2), Spring 1983:205-219). This approach works well from the perspective of an auditor who is interested in assessing the reliability with which the black box performs the data transformations. We review this literature to make note of the key concepts, definitions, and analyses that we adopt and extend in order to develop data quality modeling and analysis techniques at the detailed level of the transformational tasks and processes that are contained within the accounting system.

[0007] B. E. Cushing in "A Mathematic Approach of the Analysis and Design of Internal Control Systems" in The Accounting Review 1974, pp. 24-41, developed a mathematic formulation for measuring the reliability for an accounting system. He used the probability that the system makes no errors of any kind in its outputs as the system reliability measure. He also derived a cost measurement by taking into consideration of the cost of executing error correction controls and the risk of undetected errors in the system. It is useful in the sense of evaluating the reliability assessment of a given system. However, Cushing's control model takes the system structure as given; it does not address any problem from the system design perspective. We apply the same basic concepts of reliability and cost measurement to the problems of evaluating system reliability for a detailed process model and to design the optimal set of corrective controls with the objective of cost minimization.

[0008] S. S. Hamlen in "A Chance-Constrained Mix Integer Programming Model for Internal Control Systems", The Accounting Review 1980, pp. 578-593, proposed a mixed integer programming model for designing an internal control system. Her model minimizes the cost of controls subject to a given percentage of quality improvement desired in the output from the system. In order to formulate a linear program, the model imposes instrumental polynomial terms with their respective constraints which have the drawback of growing exponentially with the number of terms. The accounting system is modeled as a set of controls that can correct a set of error types (which could be errors in various ledgers). We extend Hamlen's approach to a more detailed model that identifies error sources within the business process of the accounting system and controls that may be selectively applied to these error sources. Our model also allows us to assess the effect of applying a control to an error source on the resulting probability of errors at all the ledgers that are linked to that error source. This leads to greater flexibility in selecting controls to apply with the potential of better solutions. We also show how our optimization problem formulation, though more detailed than Hamlen's, can be reduced to a non-exponential series of knapsack problems without having to convert a non-linear system into a linear one.

[0009] Other research in accounting literature focused on probabilistic modeling and quantitative assessment of accounting information system reliability. These studies have focused at the accounting system level modeling of reliability assessment using probabilistic or deterministic methods. They treat the transactions streams and transformative processes within the accounting information systems as a black box. Recent studies have begun to develop more detailed models for the assessment of accounting system reliability.

[0010] R. B. Lea, S. J. Adams, and R. F. Boykin in "Modeling of the audit risk assessment process at the assertion level within an account balance", Auditing: A Journal of Practice & Theory 1992 (Vol. 11, Supplement): 152-179, discussed the audit risk assessment models at different levels of detail within accounting systems. They model how risks of error at the level of the various transaction streams are related to the risk of error at the account balance level to which they contribute. They note that the level of tolerable error at the transaction stream level cannot be assumed to be the same as that for the account balance level. Their risk model covers both inherent risk (in the absence of internal controls) and control risk. We follow their motivation to decompose an account balance to its constituent transaction streams but extend their purely additive model to include (a) the volume of transactions in the various streams and (b) the probabilistic network structure of these transaction streams, identifying the various sources of errors (as represented by a process model). This allows us to overcome the assumption made by their model that the errors in the various transaction streams are independent.

[0011] R. Nado, M. Chams, J. Delisio, and W. Hamscher in "Comet: An Application of Model-Based Reasoning to Accounting Systems", Proceedings of the Eighth Innovative Applications of Artificial Intelligence Conference AAAI Press (1996) pp. 1482-1490, developed a process model based reasoning system, which they called "Comet", for analyzing the effectiveness of controls. This is one of the earliest attempts to decompose the accounting system structure into the level of tasks that process transactions and implement internal controls. They modeled accounting systems as a hierarchically structured graph with nodes representing the transaction processing activities and collection points. The potential for failure in each activity is propagated to the collection points that are the accounts being audited. Controls are modeled in terms of the probability that they will not cover the failures. This model can be used to select the key set of controls that reduce the risk of failure below a threshold. However, the paper does not clarify the quantitative model (if any) that is used. It models only the probability of failures but ignores the magnitude of error in these failures. It also implicitly assumes identical and fixed costs for all controls. Our model adopts the basic process modeling concepts introduced in this paper and extends them to develop the quantitative framework described hereinafter. This enables the performance of rigorous quantitative analysis including Monte Carlo simulation of inherent and control risk and optimization of control usage based on risk and cost.

[0012] Research on data quality in the information systems literature has focused on identifying the important characteristics that define the quality of data (see, for example, Y. Wand and R. Y. Wang, "Anchoring data quality dimensions in ontological foundations", Communications of the ACM (39:11) (1996), pp. 86-95, and R. Y. Wang, "A Product Prospective on Total Data Quality Management", Communications of the ACM, (41:2) (1998), pp. 58-65). Recently, the management of data quality and the quality of associated data management processes has been identified as a critical issue (see D. Ballou, R. Wang, H. Pazer, and G. Tayi, "Modeling Information Manufacturing Systems to Determine Information Product Quality", Management Science (44:4), April, 1998, pp. 462-484). However, most of the papers describe the criteria for the information systems design to improve or achieve good data quality (DQ) or information quality (IQ). To our knowledge, none of the papers have tackled data quality management from the point of view quantitative reliability assessment and optimization, nor did they bring the costs of quality and quality improvement into the DQ or IQ assessment consideration. We consider these issues to be critical from the practical perspective of design and management of enterprise information systems.

[0013] Wand and Wang, supra, are amongst the first who studied the data quality in the context of information systems design. They suggested rigorous definitions of data quality dimensions by anchoring them in ontological foundations and showed that such dimensions can provide guidance to systems designers on data quality issues. They developed a set of Ontological Concepts, and defined Design Deficiencies and Data Quality Dimensions. Then they presented the analysis of Dimensions and the Implications to Information Systems Design. Wang, supra, and Ballou et al., supra, developed the Total Data Quality Management methodology (TDQM). TDQM consists of the concepts and the principles of information quality (IQ) and the information product (IP), and procedures of information management system (IMS) for defining, measuring, analyzing, and improving information products.

[0014] L. L. Pipino, Y. W. Lee, and R. Y. Wang, in "Data Quality Assessment", Communications of the ACM, (45:4), (2002), pp. 211-218, introduced three functional forms of data quality: simple ratio, min or max operators, and weighted average. Based on these functional forms, they developed the illustrative metrics for important data quality dimensions. Finally, they presented an approach that combines the subjective and objective assessments of data quality, and demonstrated how the approach can be used effectively in practice.

[0015] H. Xu in "Managing accounting information quality: an Australian study", Managing Accounting Information Quality, (2000), pp. 628-634, developed and tested a model that identifies the critical success factors (CSF) influencing data quality in accounting information systems. He first proposed a list of factors influencing the data quality of AIS from the literature, and then conducted pilot case studies, using the findings from the pilot study together with the literature to identify possible critical success factors for data quality of accounting information systems. He did case studies of accounting information quality in Australian organizations in practice to test and customize the initial research model and compared similarities and differences between proposed critical success factors with real-world critical success factors.

[0016] E. M. Pierce in "Assessing Data Quality with Control Matrices", Communications of the ACM, (47:2), (2004), pp. 82-86, developed a technique for information quality management based on the practice from auditing field: an information product control matrix, to evaluate the reliability of an information product. Pierce defined the components of the matrix, and presented a way to link the data problems to the quality controls that should detect and correct these data problems during the information manufacturing process.

[0017] D. Strong, Y. W. Lee, and R. Wang in "Data Quality in Context", Communications of the ACM, (40:5), (1997), pp. 58-65, propose a data-consumer perspective for data assessments as opposed to the traditional intrinsic DQ assessment. They presented a set of DQ dimensions that consists of not only the Intrinsic DQ, but Accessibility DQ, Contextual DQ and Representational DQ. The latter three concern about the user-task context. They argued that data quality assessment should incorporate the task context of users and the processes by which users' access and manipulate data to meet their task requirements.

[0018] Adopted from Strong et al.'s idea, C. Cappiello, C. Francalanci, and B. Pernici in "Data quality assessment from the user's perspective", International Workshop on Information Quality in Information Systems., 2004, proposed a data quality assessment model that takes into consideration user requirements in the assessment phase. In their mathematical formulation, parameters and matrices to capture the user and user class's preference and requirement are introduced. Their model showed how data quality assessment should take into account how user requirements vary with the accessed service.

SUMMARY OF THE INVENTION

[0019] Our invention addresses the issue of data quality management from the perspectives of the owner or the consumer of the information processing system and predicting and managing the quality of its data when faced with anticipated changes in the business environment in which the system operates. Such changes could include: [0020] Changes in the relative volume of transactions arriving from different input sources. For example, a small but fast-growing business unit alters the mix of sales transactions over time and therefore impacts the overall quality of sales data. [0021] Changes in the business processes and policies that transform the data in the transactions. For example, automated systems replace manual tasks or sections of a process are outsourced. [0022] Changes in the business controls that attempt to detect and fix errors in the transaction. For example, the thresholds that trigger a control are altered or controls are added or removed as part of process re-engineering.

[0023] This invention provides the modeling and analysis for predicting how these changes impact data quality. Then, on the basis of this predictive ability, optimization techniques are used for the placement of error correcting controls that meet target quality requirements while minimizing the cost of operating these controls. This analysis also contributes to the development of business "dashboards" that allow decision-makers to monitor and react to key performance indicators (KPIs) based on aggregation of the transactions being processed. Data quality estimation in real time provides the accuracy of these KPIs (in terms of the probability that a KPI is above or below a given value), which may condition the action undertaken by the decision-maker.

[0024] Our approach to modeling data quality takes advantage of the increasing emphasis in many businesses on the formal modeling of business processes and their underlying information processing systems. Although the initial objective of process modeling is usually for resource planning, and services and workflow design purposes, data quality estimation can be an important secondary outcome.

[0025] A business process model can be used to represent the sources of transactions entering the information processing system and the various tasks within the process that manipulate or transform these transactions. We associate a subset of these tasks as the potential error introduction sources and probabilistically model the rate and magnitude of various error classes at each such task. We also define the information repositories such as accounting ledgers and other databases where the transactions are eventually stored and whose quality needs to be assessed. A network of links (often with probabilistic branches) connects the transaction sources, error sources, and the information repositories.

Continue reading...
Full patent description for Data quality management using business process modeling

Brief Patent Description - Full Patent Description - Patent Application Claims
Click on the above for other options relating to this Data quality management using business process modeling patent application.
###
monitor keywords

How KEYWORD MONITOR works... a FREE service from FreshPatents
1. Sign up (takes 30 seconds). 2. Fill in the keywords to be monitored.
3. Each week you receive an email with patent applications related to your keywords.  
Start now! - Receive info on patent apps like Data quality management using business process modeling or other areas of interest.
###


Previous Patent Application:
Virtual credit with transferability
Next Patent Application:
Method and system for managing material movement and inventory
Industry Class:
Data processing: financial, business practice, management, or cost/price determination

###

FreshPatents.com Support
Thank you for viewing the Data quality management using business process modeling patent info.
IP-related news and info


Results in 0.12066 seconds


Other interesting Feshpatents.com categories:
Software:  Finance AI Databases Development Document Navigation Error