Workspace beta is live — BYO-LLM chat wired to 57 SEC tools. Try it free →
ValueinValuein
For academics, PhD students & independent researchers

Research-grade SEC data — without the $24K terminal.

Point-in-time and survivorship-free by default, with full provenance so every fact traces to the filing it came from. Reproducible by design — no silent revisions changing the dataset under your published paper.

  • Point-in-time, as-first-reported data — kill look-ahead bias before it inflates your results.
  • Full survivorship-free universe including delisted, bankrupt, and merged entities.
  • Every fact traceable to its source filing for citation and peer review.
  • Pre-normalized XBRL so you don't lose months cleaning raw tags.

Built for

Researchers

  • Point-in-time accurate
  • Survivorship-bias-free
  • Every number cited to its filing

Works where you do

Python SDKBulk Data APIMCP Server
Recommended plan
Pro

Point-in-time accurate · Survivorship-bias-free · Every number cited to its filing

$24K+
terminal cost, avoided
1993→now
reproducible history
100%
traceable to filing

The pain points we remove

Rigorous research needs clean, point-in-time, reproducible data — but the standard sources are expensive, gated, and quietly mutable. Valuein is built for the opposite.

1

The cost wall

Bloomberg is ~$24K+/user/yr; Compustat and CRSP come through WRDS, gated to whoever holds a university subscription. Independent researchers are locked out.

2

Look-ahead bias baked into vendor data

Look-ahead bias is present in common Compustat products — using the wrong vintage silently inflates results in studies of fundamentals and returns.

3

Survivorship bias

Testing on current constituents overstates returns because the underperformers dropped out. You need the delisted and bankrupt names present.

4

Reproducibility broken by silent revisions

When a vendor readjusts its time series after the fact, the dataset under your published paper changes — and replication breaks.

5

EDGAR is free but not usable

EDGAR is free but not trivial to scrape, and raw XBRL needs heavy processing. DIY normalization eats months you'd rather spend on the research.

Built around your actual cadence

From the daily grind to the month-end crunch — Valuein fits the rhythm of the work, not the other way around.

Every day
  • Write and run analysis code
  • Clean and normalize raw data
  • Debug coverage gaps and tag mismatches
Every week
  • Construct datasets and factor panels
  • Run regressions and backtests
  • Validate against look-ahead and survivorship traps
Month / quarter-end
  • Refresh panels with new filings
  • Version data for reproducibility
  • Document provenance for submission and peer review

What you can do with Valuein

Each job you need done, mapped to the exact capability that delivers it.

Affordable research-grade access

Free sample + S&P500 tiers, then Pro at $49/mo — no $24K terminal, no university WRDS gate.

Datasets · all tiers

Point-in-time, as-first-reported data

accepted_at on every fact and as_of PIT enforcement in the SDK kill look-ahead bias.

Python SDK PIT

Full survivorship-free universe

The complete SEC population keyed on CIK — active plus inactive — back to 1993.

Survivorship-free universe

Reproducible, provenance-tracked facts

verify_fact_lineage traces each number to its filing; versioned Parquet schema means immutable vintages.

verify_fact_lineage · versioned schema

Pre-normalized XBRL

~11,966 raw tags mapped to ~286 canonical concepts — comparable out of the box.

Standardized concepts

Research-grade SEC fundamentals without a $24,000 terminal or a WRDS login.

Point-in-time and survivorship-free by default — kill look-ahead bias before it inflates your Sharpe.

Reproducible by design: every fact traces back to the filing it came from.

Frequently asked

Can I cite Valuein data in a paper, and is it reproducible?

Yes. Every fact resolves to its source filing via verify_fact_lineage, and the Parquet schema is versioned so a given vintage is immutable — you can re-run the exact dataset that backed your results.

Do you offer academic or student access?

The sample and S&P500 tiers are free (the S&P500 tier is full history, 1993-present, for the index). Pro at $49/mo opens the full 19,000+ universe — a fraction of a WRDS seat. Reach out for classroom or research-group needs.

How do you handle look-ahead and survivorship bias?

Point-in-time acceptance timestamps prevent look-ahead, and the universe includes delisted/bankrupt/merged entities so it's survivorship-free — the two biases most likely to invalidate an empirical finance result.

What's the difference from raw SEC EDGAR?

EDGAR is free but raw — inconsistent XBRL tags, no standardization, painful to scrape at scale. We normalize ~11,966 raw tags into ~286 canonical concepts and serve them point-in-time as columnar Parquet.

Research-grade SEC fundamentals without a $24,000 terminal or a WRDS login.

105M+ standardized SEC facts across 19,000+ companies, 1993–present. Free to start — no credit card.