File Analyzer Agent
The File Analyzer Agent provides comprehensive analysis of file contents, structure, and metadata. It automatically detects data types, validates quality, and extracts meaningful insights from various file formats.Quick Start
Features
Multi-Format Support
Supports CSV, Excel, JSON, Parquet, and more
Data Profiling
Comprehensive statistical analysis and profiling
Schema Detection
Automatic schema inference and validation
Quality Assessment
Data quality metrics and issue detection
Supported File Types
- Delimited Files: CSV, TSV, pipe-delimited
- Spreadsheets: Excel (.xlsx, .xls), Google Sheets
- Structured Data: JSON, JSONL, XML, YAML
- Columnar Formats: Parquet, ORC, Avro
- Databases: SQLite, database dumps
Analysis Output
- File Metadata
- Column Analysis
- Data Quality
Configuration Options
Basic File Analysis
Basic File Analysis
Excel Sheet Selection
Excel Sheet Selection
Custom Analysis Options
Custom Analysis Options
Use Cases
Data Discovery
- Understand new datasets quickly
- Identify data types and patterns
- Assess data quality before processing
Migration Planning
- Analyze source data structure
- Identify potential migration issues
- Plan data transformation strategies
Quality Monitoring
- Regular data quality assessments
- Track data drift over time
- Automated quality reporting
Performance Features
- Streaming Analysis: Handles large files efficiently
- Incremental Processing: Only analyzes changed portions
- Memory Optimization: Smart sampling for large datasets
- Parallel Processing: Concurrent analysis of multiple files
Best Practices
- File Preparation
- Analysis Optimization
- Quality Management
- Use consistent file naming conventions - Ensure proper encoding (UTF-8 recommended) - Include headers in structured files - Document file sources and update schedules