flightline mimic
Generate synthetic variations from an existing sample file. This is the fastest way to create test data when you have an example but can’t share the original due to PII concerns.Usage
The Problem It Solves
Developers at companies handling sensitive data face a blocking problem:“I need to test my LLM feature against realistic credit reports, tax documents, and loan applications. But I can’t download production data due to PII and security policies.”The
mimic command analyzes the structure of your sample file and generates synthetic variations that:
- Match the exact schema and field types
- Follow inferred business rules
- Contain zero PII, making them safe to share and commit
Example
How It Works
- Schema Analysis: Flightline analyzes your sample to detect field types, PII fields, and nested structures
- Rule Inference: Business rules are inferred from the data (e.g., “if employed, income > 0”)
- Generation: Synthetic data is generated following all constraints
Use Cases
- Local Development: Can’t access production data? Mimic one sample and generate hundreds of variations locally.
- CI/CD Testing: Create a synthetic test suite that covers edge cases without exposing customer data.
- Sharing Examples: Need to share a realistic file with a contractor or in documentation? Mimic creates shareable versions.
Learn More: The Fabricator
Deep dive into how synthetic data generation works.
