Data Quality Checker Agent
The Data Quality Checker Agent validates data integrity, identifies quality issues, and ensures data meets specified standards. It performs comprehensive checks across multiple dimensions of data quality.Quick Start
Quality Dimensions
Completeness
Identifies missing values and null data
Validity
Validates data against format and domain rules
Consistency
Checks for logical consistency across records
Accuracy
Verifies data against reference sources
Validation Rules
Built-in Rules
- Email Format: Valid email address patterns
- Phone Numbers: International phone number formats
- Dates: Valid date ranges and formats
- Numeric Ranges: Min/max value constraints
- Text Patterns: Regex pattern matching
- Referential Integrity: Foreign key constraints
Custom Rules
Quality Report
- Summary Report
- Detailed Issues
- Recommendations
Configuration
Basic Quality Check
Basic Quality Check
Custom Rule Set
Custom Rule Set
Threshold-Based Validation
Threshold-Based Validation
Integration Features
- CI/CD Integration: Automated quality gates
- Real-time Monitoring: Continuous quality assessment
- Alert System: Notifications for quality degradation
- Reporting Dashboard: Visual quality metrics
- Data Lineage: Track quality across data pipeline
Best Practices
- Rule Design
- Quality Monitoring
- Issue Resolution
- Start with basic rules, then add complexity - Use business context for rule definitions - Balance strictness with practicality - Document rule rationale and exceptions