Research-grade SEC data — without the $24K terminal.
Point-in-time and survivorship-free by default, with full provenance so every fact traces to the filing it came from. Reproducible by design — no silent revisions changing the dataset under your published paper.
- Point-in-time, as-first-reported data — kill look-ahead bias before it inflates your results.
- Full survivorship-free universe including delisted, bankrupt, and merged entities.
- Every fact traceable to its source filing for citation and peer review.
- Pre-normalized XBRL so you don't lose months cleaning raw tags.
Built for
Researchers
- Point-in-time accurate
- Survivorship-bias-free
- Every number cited to its filing
Works where you do
Point-in-time accurate · Survivorship-bias-free · Every number cited to its filing
The pain points we remove
Rigorous research needs clean, point-in-time, reproducible data — but the standard sources are expensive, gated, and quietly mutable. Valuein is built for the opposite.
The cost wall
Bloomberg is ~$24K+/user/yr; Compustat and CRSP come through WRDS, gated to whoever holds a university subscription. Independent researchers are locked out.
Look-ahead bias baked into vendor data
Look-ahead bias is present in common Compustat products — using the wrong vintage silently inflates results in studies of fundamentals and returns.
Survivorship bias
Testing on current constituents overstates returns because the underperformers dropped out. You need the delisted and bankrupt names present.
Reproducibility broken by silent revisions
When a vendor readjusts its time series after the fact, the dataset under your published paper changes — and replication breaks.
EDGAR is free but not usable
EDGAR is free but not trivial to scrape, and raw XBRL needs heavy processing. DIY normalization eats months you'd rather spend on the research.
Built around your actual cadence
From the daily grind to the month-end crunch — Valuein fits the rhythm of the work, not the other way around.
- Write and run analysis code
- Clean and normalize raw data
- Debug coverage gaps and tag mismatches
- Construct datasets and factor panels
- Run regressions and backtests
- Validate against look-ahead and survivorship traps
- Refresh panels with new filings
- Version data for reproducibility
- Document provenance for submission and peer review
What you can do with Valuein
Each job you need done, mapped to the exact capability that delivers it.
Affordable research-grade access
Free sample + S&P500 tiers, then Pro at $49/mo — no $24K terminal, no university WRDS gate.
Point-in-time, as-first-reported data
accepted_at on every fact and as_of PIT enforcement in the SDK kill look-ahead bias.
Full survivorship-free universe
The complete SEC population keyed on CIK — active plus inactive — back to 1993.
Reproducible, provenance-tracked facts
verify_fact_lineage traces each number to its filing; versioned Parquet schema means immutable vintages.
Pre-normalized XBRL
~11,966 raw tags mapped to ~286 canonical concepts — comparable out of the box.
One token. Every channel.
A single Stripe-issued token unlocks every surface at your tier — use Valuein from your AI client, your code, or the browser.
Research-grade SEC fundamentals without a $24,000 terminal or a WRDS login.
Point-in-time and survivorship-free by default — kill look-ahead bias before it inflates your Sharpe.
Reproducible by design: every fact traces back to the filing it came from.
Frequently asked
Can I cite Valuein data in a paper, and is it reproducible?
Yes. Every fact resolves to its source filing via verify_fact_lineage, and the Parquet schema is versioned so a given vintage is immutable — you can re-run the exact dataset that backed your results.
Do you offer academic or student access?
The sample and S&P500 tiers are free (the S&P500 tier is full history, 1993-present, for the index). Pro at $49/mo opens the full 19,000+ universe — a fraction of a WRDS seat. Reach out for classroom or research-group needs.
How do you handle look-ahead and survivorship bias?
Point-in-time acceptance timestamps prevent look-ahead, and the universe includes delisted/bankrupt/merged entities so it's survivorship-free — the two biases most likely to invalidate an empirical finance result.
What's the difference from raw SEC EDGAR?
EDGAR is free but raw — inconsistent XBRL tags, no standardization, painful to scrape at scale. We normalize ~11,966 raw tags into ~286 canonical concepts and serve them point-in-time as columnar Parquet.
Research-grade SEC fundamentals without a $24,000 terminal or a WRDS login.
105M+ standardized SEC facts across 19,000+ companies, 1993–present. Free to start — no credit card.