AAU Student Projects - visit Aalborg University's student projects portal
A master's thesis from Aalborg University
Book cover


Leveraging LLMs to Generate Business Plans

Authors

;

Term

4. semester

Publication year

2025

Submitted on

Pages

74

Abstract

In a competitive market, startups need clear business plans to attract funding, but many lack the time and money to create them. This study explores whether large language models (LLMs)—AI systems that generate text—can help draft business plans and how they compare to traditional methods. We built a multi-agent system using the CrewAI framework, where six specialized AI agents each wrote a different section of a plan. For writing we used Google’s Gemini 2.5 Pro; for an initial check we used Meta’s Llama 3.3 70B. We tested the system with two types of inputs: fictional company briefs prepared by experts, and synthetic data derived from existing plans using GPT-4.1. We assessed results with both human experts and an LLM-as-a-Judge approach, using five criteria: relevance, completeness, correctness, consistency, and clarity. The system produced business plans that were well structured and on-topic across sections, and it offered clear benefits in speed, cost, and accessibility for startups. However, the outputs are best treated as drafts: they can include invented or inaccurate details (hallucinations), occasional mismatches between sections, and a lack of deeper business insight that experienced humans provide. Overall, AI-generated business plans work well as starting points that still require human review and refinement. They complement—but do not replace—traditional planning methods.

I et konkurrencepræget marked har startups brug for klare forretningsplaner for at tiltrække finansiering, men mange mangler tid og penge til at udarbejde dem. Dette studie undersøger, om Large Language Models (LLMs) – AI-systemer der genererer tekst – kan hjælpe med at skrive forretningsplaner, og hvordan de klarer sig i forhold til traditionelle metoder. Vi byggede et multi-agent-system med CrewAI-rammeværket, hvor seks specialiserede AI-agenter hver skrev en sektion af planen. Til selve skrivningen brugte vi Googles Gemini 2.5 Pro; til en indledende evaluering brugte vi Metas Llama 3.3 70B. Systemet blev testet på to typer input: fiktive virksomhedsoplæg udarbejdet af eksperter og syntetiske data afledt af eksisterende planer ved hjælp af GPT-4.1. Resultaterne blev vurderet både af menneskelige eksperter og med en LLM-as-a-Judge-tilgang, ud fra fem kriterier: relevans, fuldstændighed, korrekthed, konsistens og klarhed. Systemet fremstillede forretningsplaner, der var velstrukturerede og relevante på tværs af sektioner, og gav tydelige fordele i hastighed, omkostninger og tilgængelighed for startups. Men outputtet bør ses som udkast: der kan forekomme opfundne eller unøjagtige oplysninger (hallucinationer), lejlighedsvise uoverensstemmelser mellem sektioner og mangel på dybere forretningsindsigt, som erfarne mennesker tilfører. Samlet set fungerer AI-genererede forretningsplaner bedst som udgangspunkt, der stadig kræver menneskelig gennemgang og forfinelse. De supplerer – men erstatter ikke – traditionelle planlægningsmetoder.

[This apstract has been rewritten with the help of AI based on the project's original abstract]