$49/year vs. $20,000+ enterprise platforms
34/34 PostgreSQL types (ltree, PostGIS, ranges, hstore) + 28/28 MySQL types (JSON, spatial, ENUM/SET, partitioning)
62 total exotic types. 84 production-ready tables across 6 industries. Zero foreign key violations.
Built for healthcare, fintech, and SaaS teams who refuse to overpay.
No Credit Card • 1,000 Rows Per Table
Free forever for local development
Or download binary for your OS
Works perfectly with your modern stack
Native CLI integration for Docker, CI/CD, and Seed Scripts
100% exotic type support for both PostgreSQL and MySQL/MariaDB
Available for both PostgreSQL and MySQL/MariaDB
Trusted by developers at
"Finally, realistic healthcare data without HIPAA violations. The ICD-10 and LOINC generators are perfect."
"Saved us 40+ hours per sprint. CI/CD auto-approve mode is a game-changer for our testing pipeline."
"The constraint-safe generation is incredible. No more foreign key violations. Just works."
Compliance-Ready Data Generation
Aphelion generates 100% synthetic data to help you maintain compliance. No real patient data, financial records, or personal information is used or required. Learn more →
Built by engineers tired of SQL seeds. Perfect for MVP velocity and Enterprise scale.
Never worry about foreign key violations. Topological dependency graph ensures perfect referential integrity.
Intelligent data generation for partitioned tables. Respects date ranges and lists constraints automatically.
Automatically computes values for generated/computed columns based on defined expressions.
HIPAA/PCI-DSS compliant. Hash, redact, partial masking. Auto-detect SSN, email, phone, credit cards.
Zipfian, power-law, weighted distributions. Data skew, correlation, temporal patterns match reality.
IMSI, IMEI (Luhn-valid), MSISDN, ICCID, CDRs, billing invoices, network topology (2G-5G).
Fraud detection (6 types), geolocation, device fingerprints, velocity metrics, PCI-DSS tokenization.
Markdown, code snippets, regex patterns, formatted text for social platforms and forums.
MRN (5 formats), ICD-10, RxNorm, SNOMED, LOINC, NDC codes. Comprehensive OMOP CDM & OpenMRS support.
Dates make sense across tables. Bookings before flights, appointments after registration.
Deep trees (5-11 levels), ltree support, cycle detection, HierarchyID paths, JSONB trees.
Shard keys, tenant isolation, realistic data skew. 60% primary, 25% satellite, 15% remote.
CHECK constraints, domains, composite keys, multi-column uniqueness, enum-like types.
XML columns, JSONB trees, ltree paths, HierarchyID, custom domains, PostgreSQL extensions.
Pre-built generators for healthcare, finance, e-commerce, and more.
HIPAA-compliant, HL7 FHIR, clinical terminologies
Rich content, code snippets, reputation systems
PCI-DSS, SOX compliance, fraud detection
CI/CD integration, rapid iteration, zero config
IMSI, IMEI, CDRs, billing, network topology
P&C policies, claims, actuarial data
Case management, contracts, compliance audits
No configuration needed to start. We introspect your database, detect types, and map them to realistic Faker generators automatically.
From empty DB to seeded test environment in 3 commands.
# 1. Install globally
# 2. Introspect your existing database (or use our templates)
> Created schema.json with 42 detected tables
# 3. Generate 10k rows of constraint-safe data
> Generating... Done! (1.4s)
We fill the gap between hacking together scripts and expensive enterprise platforms.
| Feature |
Aphelion
|
Faker.js / Seeds | Custom SQL Scripts |
|---|---|---|---|
| Relational Integrity (FKs) | Automated | Manual ID tracking | Complex CTEs needed |
| Circular Dependencies | Handled | Impossible | ⚠️ Very hard to write |
| Maintenance | Zero Auto-introspects schema | High Break on schema change | High Rewrite query on change |
| Feature |
Aphelion
|
Enterprise AI
Platforms (Gretel, MOSTLY AI, Tonic) |
|---|---|---|
| Primary Focus | Relational Structure Perfect DB seeding & Foreign Keys | Statistical Similarity ML Model Training & Privacy |
| Developer Experience | CLI Native Runs locally, works in CI | Web UI / SaaS Upload data to cloud |
| Postgres Depth | Native Support ltree, hierarchyid, jsonb, ranges | Generic SQL Often treats everything as tables |
| Price | Free / $49 mo | $20k+ / year |
Start free on your local machine. Scale when your team grows.
Perfect for local development and testing.
💰 Save $19,951/year vs. Tonic.ai
For teams automating CI/CD pipelines.
Secure payment via Stripe
🔒 You get realistic data without inheriting production risk.
We never copy, store, hash, or transform real data — we observe structure and generate new data from scratch. All PII is automatically detected and replaced with safe synthetic values.
Scale Transparency: Tested and proven with up to 1.5M rows (100K patients in healthcare demos). Production-ready for datasets up to 250K patients (~3.75M rows) with current configuration. For larger datasets, we offer streaming implementation and direct database loading options. View technical details.
Everything you need to know about Aphelion
Faker.js generates random data but doesn't understand database constraints. Aphelion introspects your schema to ensure zero foreign key violations, handles circular dependencies, and generates realistic healthcare/finance codes (ICD-10, LOINC, etc.) that Faker.js doesn't support.
Yes! Aphelion uses weighted distributions, temporal patterns, and industry-specific generators. Healthcare data includes real ICD-10 codes, LOINC lab tests, and MRN formats. Financial data includes realistic transaction patterns and account hierarchies. It's designed to mirror production data without the compliance risk.
No. Aphelion is for testing, development, and staging environments only. The data is synthetic and realistic, but not suitable for production use. It's designed to replace production data in non-production environments to maintain HIPAA/PCI-DSS compliance.
Currently, Aphelion supports PostgreSQL (including complex features like ltree, JSONB, arrays, and enums). Support for MySQL, SQL Server, and Oracle is on the roadmap.
Use the --seed flag to generate
identical data every time. Same seed = same data. Perfect for reproducible testing, CI/CD pipelines, and debugging. Different
team members can generate the exact same dataset.
No coding required! Aphelion introspects your database schema automatically. Just point it at your database, and it generates a JSON configuration with smart defaults. You can customize if needed, but it works out of the box.
Pro ($49/mo) includes: unlimited rows (tested up to 1.5M), CI/CD auto-approve mode (no manual confirmations), priority email support, and advanced custom generators. Perfect for teams with automated testing pipelines.
Aphelion generates ~10,000 rows/second on modern hardware. A 100K row dataset typically takes 10-15 seconds. The constraint-safe algorithm adds minimal overhead while ensuring perfect referential integrity.
Still have questions?
Contact Us →Updates, tutorials, and announcements.
Solving the hardest PostgreSQL data generation challenges: intelligent partition support and automatic generated column handling. Zero config required.
Comprehensive support for healthcare standards. Millions of realistic patient records.
Read More →Simulate policies, complex claims, and risk models for Actuarial testing.
Read More →Generate massive catalogs and realistic user shopping sessions.
Read More →