AAU Student Projects is unavailable between June 15th 1.30pm and 17th 1.30pm due to planned system maintenance. The projects cannot be downloaded during this period.
AAU Student Projects - visit Aalborg University's student projects portal
A master's thesis from Aalborg University
Book cover


Bid-Ask Spread Forecasting: Modelling and Forecasting Bid–Ask Spreads with Integer-Valued Trawl Processes

Translated title

Bid-Ask Spread Forecasting: Modellering og prognosticering af bid--ask-spreads med heltalsværdige trawlprocesser

Author

Term

4. term

Publication year

2026

Submitted on

Pages

59

Abstract

This thesis studies how to model and forecast the bid–ask spread—the difference between the best buy and sell prices—measured every 5 seconds for four liquid stocks. The spread takes non‑negative integer values and changes slowly over time. It also varies more than a simple Poisson model allows (overdispersion). For these reasons, I use integer‑valued trawl processes, a flexible class of time‑series models for counts that capture long‑lived dependence. I compare six specifications that combine two ways to describe the variability in the data (Poisson or negative binomial) with three ways to describe how past effects decay over time (exponential, inverse‑Gaussian, or Gamma trawl). The models are estimated by pairwise composite likelihood, which fits the model using information from pairs of observations, and evaluated by in‑sample fit and by forecast performance in an expanding‑window design that re‑estimates the model as new data arrive. The results show that the negative binomial improves the marginal fit by allowing overdispersion, while the Gamma trawl generally matches the empirical autocorrelation structure best (how current values relate to past ones). Forecast performance depends strongly on the horizon: Poisson‑INGARCH, a common count‑time‑series model, is most competitive at the 5‑second horizon; negative‑binomial IVT models perform best around 1 minute; and at longer horizons the strong persistence makes further gains hard to achieve. Overall, IVT models offer an interpretable and useful framework for spread dynamics, but richer specifications require careful numerical diagnostics and do not unambiguously dominate simpler benchmarks.

Afhandlingen handler om at modellere og forudsige bid–ask-spreadet – forskellen mellem bedste købs- og salgspris – målt hvert 5. sekund for fire likvide aktier. Spreadet er et heltal, kan ikke være negativt og ændrer sig langsomt over tid. Det svinger også mere end en simpel Poisson-model tillader (overdispersion). Derfor bruger jeg heltalsværdige trawlprocesser, en fleksibel klasse af modeller, der kan beskrive tælletal over tid og deres hukommelse. Jeg sammenligner seks varianter, der kombinerer to måder at beskrive variationen i observationerne på (Poisson eller negativ binomial) med tre måder at beskrive, hvordan effekter aftager over tid (eksponentiel, invers‑Gaussisk eller Gamma trawl). Modellerne estimeres med parvis sammensat likelihood, en metode der udnytter information i par af observationer, og vurderes både på pasform til de data, de er trænet på, og på forudsigelser i et ekspanderende vinduesdesign, hvor modellen løbende genestimeres, når nye data tilføjes. Resultaterne viser, at den negative binomial forbedrer den marginale pasform ved at tillade overdispersion, mens Gamma‑trawlen typisk rammer den observerede autokorrelationsstruktur bedst (hvordan nutidige værdier hænger sammen med fortidige). Forudsigelsesstyrken afhænger stærkt af horisonten: Poisson‑INGARCH, en udbredt model for tælletidsserier, er mest konkurrencedygtig ved 5‑sekunders horisonten; negative binomiale IVT‑modeller klarer sig bedst omkring 1 minut; og ved længere horisonter gør den udtalte persistens det svært at opnå forbedringer. Samlet set giver IVT‑modeller en fortolkelig og nyttig ramme for spread‑dynamik, men mere komplekse specifikationer kræver omhyggelig numerisk diagnostik og overgår ikke entydigt simplere benchmarks.

[This apstract has been rewritten with the help of AI based on the project's original abstract]