Methodology

Point-in-Time Accuracy

Financial backtests fail when data is available too early — you use information that wasn't publicly known at the time. Valuein timestamps every fact with accepted_at: the exact moment the SEC accepted the filing. Filter by it and your backtest is safe.

report_date

Fiscal period end

filing_date

Date filed with SEC

accepted_at

SEC acceptance timestamp

Why Most Databases Introduce Look-Ahead Bias

Most financial databases store data as it exists today, not as it was known historically. A fiscal year 2021 annual report filed in March 2022 is often backdated to December 31, 2021 — making it appear as if that data was available before it was. Backtests built on this data are invalid.

Timeline for a 2022 10-K Filing

Dec 31, 2022

report_date

Fiscal period ends. Results for the full year are computed internally. The public knows nothing yet.

Feb 15, 2023

filing_date

Company submits the 10-K to SEC EDGAR. Still not indexed in full — processing occurs over hours.

Feb 15, 2023 17:42 UTC

accepted_at

SEC accepts and timestamps the filing. This is the earliest moment any investor could have seen this data. Filter by this.

Now run a dated query against a whole filing history. Everything accepted on or before the as_of date is returned; the later restatement stays walled off — invisible, exactly as it would have been on that day.

query: get_company_fundamentals(as_of="2021-06-30")What a query dated 2021-06-30 returns — and what it is forbidden to see.

as_of · 2021-06-30

10-K FY2019

2020-02-26

10-Q Q1

2020-05-01

10-Q Q2

2020-07-30

10-K FY2020

2021-02-24

as originally reported

10-K/A FY2020

2021-11-09

restatement · filed later

10-Q Q3 2021

2021-12-03

visible at query datefiled later · walled offthe 10-K/A restatement is invisible to this query — no look-ahead

The hidden trap

If a data vendor stores this filing against fiscal_year = 2022 with no timestamp, a backtest that says "use 2022 annual data as of Jan 1, 2023" will include it — but the filing wasn't accepted until Feb 15, 2023. Your simulated portfolio used information that didn't exist yet. This introduces look-ahead bias and inflates backtest performance.

The Three Key Fields

accepted_at

fact

TIMESTAMPTZ

The exact UTC timestamp the SEC accepted the filing that disclosed this fact. Each fact row inherits accepted_at from its parent filing on indexing, so the filter works identically on either table. This is your PIT anchor — use it exclusively for backtest-safe queries. It represents the earliest moment any investor could have read this data.

Always filter: WHERE accepted_at <= your_date

filing_date

filing

DATE

The date the SEC received the filing. Very close to accepted_at but lacks the exact time component. Suitable for rough date-range filtering but accepted_at is more precise for PIT analysis.

Safe for range filtering, less precise than accepted_at

report_date

filing

DATE

The fiscal period end date (e.g. December 31 for a calendar-year company). This is NOT a PIT field — using it as a filter introduces look-ahead bias because the data wasn't known until the filing date weeks or months later.

For display purposes only — never use as a PIT filter

These tables carry accepted_at; the SDK filters every PIT view to accepted_at <= as_of.

Core financialsDerived

Relationships (text)

filing references entity via entity_id → cik (many → 1)
fact references entity via entity_id → cik (many → 1)
fact references filing via accession_id (many → 1)
ratio references entity via entity_id → cik (many → 1)

Wrong vs. Right Queries

The difference between a biased and a valid backtest often comes down to a single WHERE clause.

Wrong — look-ahead bias

sql

-- WRONG: look-ahead bias introduced-- This returns data as if you knew it on Jan 1 2022,-- but 10-K filings for fiscal year 2021 weren't published-- until Feb–March 2022. You're using future information.SELECT  r.symbol,  fa.numeric_value / 1e9 AS revenue_billionsFROM references rJOIN filing f  ON f.entity_id = r.cikJOIN fact fa   ON fa.accession_id = f.accession_idWHERE fa.standard_concept = 'Revenues'  AND f.fiscal_year = 2021          -- WRONG: fiscal year is NOT when data was known  AND f.form_type   = '10-K'ORDER BY revenue_billions DESC;

Right — PIT-safe

sql

-- RIGHT: point-in-time safe using accepted_at-- Only returns data that was publicly available on 2022-01-01.-- If a company filed its 2020 10-K late (e.g. Feb 2022),-- it will NOT appear in this query -- correct behavior.SELECT  r.symbol,  fa.numeric_value / 1e9 AS revenue_billions,  fa.accepted_at                            -- visible timestampFROM references rJOIN filing f  ON f.entity_id = r.cikJOIN fact fa   ON fa.accession_id = f.accession_idWHERE fa.standard_concept = 'Revenues'  AND f.form_type         = '10-K'  AND fa.accepted_at     <= '2022-01-01'   -- RIGHT: PIT filterORDER BY revenue_billions DESC;

Survivorship Bias

Look-ahead bias is temporal — using today's data in the past. Survivorship bias is structural — only analyzing companies that still exist today. Both inflate backtest returns and both are invisible unless your dataset is specifically built to prevent them.

Delisted companies

Valuein tracks all entities including those that were delisted, acquired, or went bankrupt. The Pro and Institutional tiers include 19,000+ entities — active and inactive.

Historical index membership

The index_membership table records exact effective_date / removal_date for each company in each index, with [) interval semantics. A 2010 S&P500 backtest uses the 2010 constituents, not today's.

PIT universe construction

Use get_pit_universe(as_of_date) to reconstruct the exact investable universe on any historical date — free of additions that happened after.

The dead are still in the indexDelisted, bankrupt, merged, taken private — kept, not survivorship-pruned.

ENRNbankrupt · 2001
LEHbankrupt · 2008
WAMUseized · 2008
BBBYbankrupt · 2023
SIVBfailed · 2023
FTXcollapsed · 2022

get_pit_universe(date) → the index as it stood, not as it survived

Survivorship-bias-free universe constructionsql

-- Build a survivorship-bias-free universe for March 2020-- This returns exactly who was in the S&P500 on that date ---- before COVID additions/removals, before failures, before mergers.SELECT  cik,  ticker,  name,  sectorFROM get_pit_universe(  as_of_date => '2020-03-01',  index       => 'SP500'); -- WRONG alternative (survivorship bias):-- Using the current S&P500 list for 2020 data excludes-- companies that were dropped and includes companies that-- didn't exist in the index yet.

PIT in the Python SDK

Every SDK method that returns time-series data accepts an as_of_date parameter. Pass it to transparently filter by accepted_at.

pit_backtest.pypython

from valuein_sdk import ValueinClient, ValueinError try:    with ValueinClient() as client:         # PIT-safe: only data known as of the backtest date        df = client.run_query("""            SELECT r.symbol, fa.fiscal_year,                   fa.numeric_value / 1e9 AS revenue_bn,                   fa.accepted_at            FROM fact fa            JOIN references r ON fa.entity_id = r.cik            WHERE r.symbol              = 'AAPL'              AND fa.standard_concept   = 'TotalRevenue'              AND fa.fiscal_period      = 'FY'              AND fa.accepted_at      <= '2023-01-01'   -- PIT filter            ORDER BY fa.fiscal_year DESC            LIMIT 10        """)         # All rows have accepted_at <= 2023-01-01        print(df[["fiscal_year", "revenue_bn", "accepted_at"]]) except ValueinError as e:    print(f"Error: {e}")

Frequently Asked Questions

Ready to build a PIT-safe backtest?

Start with the free sample tier — all PIT fields are included. Upgrade for full S&P500 history or the complete 19,000+ ticker universe.

PIT Universe Tool Python SDK Guide