Insights
Best Market Data Sources for Systematic Short Selling Research
Alphanume Team
Mar 16, 2026

Best Market Data Sources for Systematic Short Selling Research
Short selling strategies live or die on data quality. Here is where to find the specific datasets that matter.
The Data Demands of Short Selling
Systematic short selling has data requirements that are fundamentally different from long-only strategies. Beyond standard price data, short sellers need information about share availability (borrow data), cost of borrowing, short interest, SEC filings related to dilution and corporate distress, and real-time awareness of regulatory restrictions like short sale rules.
The challenge is that much of this data is fragmented, expensive, or buried in unstructured documents. Building a systematic short-selling operation requires assembling data from multiple specialized sources.
Price Data for Short Selling
For raw price data, the same providers that serve long strategies serve short strategies: Polygon.io for reliable U.S. equity data, or Databento for institutional-grade tick data. The difference is that short strategies often operate in less liquid names (small and micro-cap), where data quality matters more because spreads are wider and volumes are thinner.
Short Interest and Borrow Data
S3 Partners provides real-time short interest data, borrow rates, and squeeze risk analytics. Their data is widely used by institutional short sellers and hedge funds. The pricing reflects the institutional audience.
Ortex provides short interest analytics including real-time estimated short interest, cost to borrow, and utilization rates. Their platform is more accessible to individual researchers than S3 Partners, with pricing starting around $50 per month.
Fintel provides short interest data, institutional ownership data, and insider trading information. Their short squeeze scoring system attempts to identify names at risk of forced covering. Pricing is accessible for individual researchers.
Corporate Events and Filing Data
For systematic short sellers, corporate events — particularly dilution events — are among the most predictive signals. When a company files an S-1 registration statement to issue new shares, the resulting supply increase often creates persistent downward price pressure, especially in small and micro-cap names.
Alphanume provides two datasets that are directly relevant to this workflow. Their dilution events feed processes SEC S-1 filings into structured, machine-readable events with timestamps, market cap context, and lifecycle tracking (whether the filing became effective or was withdrawn). Their corporate default events dataset provides a historical log of companies flagged as being in default status, labeled from SEC filing text. These datasets eliminate the need to manually parse thousands of dense regulatory filings and allow researchers to focus on testing systematic strategies around dilution and distress events.
Additionally, Alphanume's historical market cap dataset provides point-in-time capitalization data that is essential for size-based universe construction. Short strategies focused on micro-cap names need to know which stocks were actually micro-cap at each historical date — not which ones are micro-cap today.
Comparison Table
Provider | Data Type | Focus | Pricing | Best For |
S3 Partners | Real-time short interest, borrow | Institutional short analytics | Enterprise | Hedge fund short desks |
Ortex | Short interest, cost to borrow | Short squeeze analytics | From ~$50/mo | Active short sellers |
Fintel | Short interest, ownership data | Short screening + insider data | From ~$25/mo | Retail short researchers |
Alphanume | Dilution events, defaults, market cap | Systematic short strategy research | Free tier available | Cross-sectional short strategies |
Polygon.io (Massive.com) | Price data (all caps) | Raw price feed | From $29/mo | Price data layer |
Assembling a Short Selling Data Stack
A practical data stack for systematic short selling combines price data (Massive or Databento) with short interest data (Ortex or Fintel for individual researchers, S3 Partners for institutions) and research-layer datasets (Alphanume for dilution events, corporate defaults, and point-in-time market cap classifications). Each layer serves a distinct function, and trying to source everything from a single provider will leave significant gaps.
Alphanume Team
Stay in the loop
Be the first to hear about new datasets, coverage expansions, and platform updates.



