Retrieval-Augmented Generation (RAG) for Real-Time Financial Market Analysis

Priyank Tailor

doi:10.37547/tajiir/Volume07Issue07-12

Authors

Priyank Tailor
Data Scientist / AI Researcher Jersey City, NJ, USA

DOI:

https://doi.org/10.37547/tajiir/Volume07Issue07-12

Abstract

The rapid growth of unstructured financial data—ranging from earnings calls and SEC filings to real-time social me- dia and global news—has outpaced the ability of traditional analysis tools to provide timely, contextual insights. Most natural language models are trained on static data and lack the capacity to integrate dynamic, real-world updates. Retrieval- Augmented Generation (RAG) bridges this gap by combining document retrieval with generative capabilities, creating a more grounded and up-to-date understanding of user queries. This paper presents a domain-adapted RAG-based framework for real-time financial analysis, using vector databases and domain-specific language models. The framework demon- strates improved contextual accuracy, reduced hallucination, and greater interpretability compared to traditional NLP mod- els. Our findings indicate that RAG has the potential to become a core component in next-generation financial intelli- gence systems.

The American Journal of Interdisciplinary Innovations and Research

137

https://www.theamericanjournals.com/index.php/tajiir

Type

Original Research

PAGE NO.

137-144

DOI

10.37547/tajiir/Volume07Issue07-12

OPEN ACCESS

SUBMITED

14 June 2025

ACCEPTED

26 June 2025

PUBLISHED

29 July 2025

VOLUME

Vol.07 Issue 07 2025

CITATION

Priyank Tailor. (2025). Retrieval-Augmented Generation (RAG) for Real-
Time Financial Market Analysis. The American Journal of Interdisciplinary
Innovations and Research, 7(07), 137

–

144.

https://doi.org/10.37547/tajiir/Volume07Issue07-12

COPYRIGHT

© 2025 Original content from this work may be used under the terms
of the creative commons attributes 4.0 License.

Investi
Retrieval-Augmented
Generation (RAG) for Real-
Time Financial Market
Analysis

Priyank Tailor

Data Scientist / AI Researcher Jersey City, NJ, USA

Abstract

- The rapid growth of unstructured financial

data

—

ranging from earnings calls and SEC filings to real-

time social me- dia and global news

—

has outpaced the

ability of traditional analysis tools to provide timely,
contextual insights. Most natural language models are
trained on static data and lack the capacity to integrate
dynamic, real-world updates. Retrieval- Augmented
Generation (RAG) bridges this gap by combining
document retrieval with generative capabilities,
creating a more grounded and up-to-date understanding
of user queries. This paper presents a domain-adapted
RAG-based framework for real-time financial analysis,
using vector databases and domain-specific language
models. The framework demon- strates improved
contextual accuracy, reduced hallucination, and greater
interpretability compared to traditional NLP mod- els.
Our findings indicate that RAG has the potential to
become a core component in next-generation financial
intelli- gence systems.

1.

Introduction

The financial sector is inherently dynamic, characterized

by rapid market fluctuations, evolving policies, and real-
time events that significantly influence investment
decisions. Re- cent events such as sudden stock market
crashes and eco- nomic downturns underscore the need
for advanced, real-time financial analysis systems.
Traditional methods lag behind rapidly unfolding market
realities, necessitating intelligent, context-aware AI
models like RAG to drive timely decision- making.

Traditional financial analysis often relies on historical data
and static models, which struggle to keep pace with the

The American Journal of Interdisciplinary Innovations and Research

138

https://www.theamericanjournals.com/index.php/tajiir

The American Journal of Interdisciplinary Innovations and Research

sheer volume, velocity, and variety of modern financial
information. The proliferation of unstructured data from
diverse sources, including earnings call transcripts, SEC
filings, real-time news feeds, and social media, presents
both an opportunity and a challenge. While these data
sources contain invaluable insights, extracting and
synthesizing them in a timely and accurate manner is
beyond the capabilities of conventional tools. This
limitation can lead to delayed decision-making, missed
opportunities, and increased exposure to market risks.
Furthermore, traditional Natural Language Processing
(NLP) models, often trained on static datasets, tend to
hallucinate or provide outdated information when
confronted with rapidly evolving financial landscapes.
This gap between the static nature of trained models
and the dynamic reality of financial markets highlights a
critical need for more adaptive and context-aware AI
solutions.

Retrieval-Augmented Generation (RAG) models have
emerged as a promising solution by combining neural
re- trieval mechanisms with generative transformers.
These mod- els enable queries to be grounded in the
most relevant and up-to-date information from
external data sources, thereby mitigating the issues of
hallucination and outdated knowl- edge inherent in
purely generative models. The core idea behind RAG is
to augment the generative capabilities of large language
models (LLMs) by providing them with access to an
external, continuously updated knowledge base. This
hybrid approach ensures that the generated responses
are not only fluent and coherent but also factually
accurate and contextually relevant to the latest
financial developments.

The objective of this study is to design, implement, and
evaluate a Retrieval-Augmented Generation (RAG) model
op- timized for real-time financial market analysis, with
the aim of enhancing decision support systems for
traders, analysts, and policymakers. This paper will
delve into the architec- tural components of our RAG
framework, detail the diverse financial data sources
utilized, elaborate on the preprocessing and retrieval
pipelines, and present a comprehensive evalua- tion of
its performance. We will also discuss the significant
benefits of RAG in terms of contextual accuracy,
reduced hallucination, and improved interpretability,
positioning it as a vital tool for next-generation financial
intelligence systems.

The main contributions of this paper are fourfold:

1

We present a complete, end-to-end RAG framework

tai- lored specifically for the high-velocity, high-
stakes do- main of real-time financial analysis.

2

We detail a robust data ingestion and preprocessing

pipeline that integrates diverse, unstructured data
sources, from regulatory filings to social media,
into a unified knowledge base.

3

We provide a comprehensive empirical evaluation

of the system, using a combination of quantitative
metrics and qualitative assessments from financial
experts, demon- strating significant improvements
over baseline models.

We discuss the practical implementation details, ethical
considerations, and limitations, offering a blueprint for
the responsible deployment of generative AI in financial
markets.

2.

Related Work

There has been considerable research on applying
Natural Language Processing (NLP) to finance,
particularly in sentiment analysis, event detection, and
summarization of fi- nancial texts. Early efforts focused
on finetuning generalpurpose language models on
financial corpora to improve their understanding of
domainspecific terminology and nuances. For instance,

FinBERT

stands out as a foundational work in this area.

It is a BERT-based model pretrained on a large financial
corpus, enabling more accurate sentiment classification
and named entity recognition within financial
documents. This domain-specific fine-tuning proved
crucial for capturing the unique linguistic patterns and
emotional ex- pressions prevalent in financial discourse,
which often differ significantly from general language.
The success of FinBERT highlighted the importance of
domain adaptation for NLP models in specialized fields
like finance.

Recent advancements in retrieval-augmented models
have revolutionized the field of NLP by addressing the
limitations of purely generative models, particularly their
tendency to hal- lucinate or provide outdated
information. These models dy- namically fetch relevant
documents at inference time, ground- ing their responses
in up-to-date external knowledge. Notable examples
include

Google’s

REALM

(Retrieval-Augmented

Language Model pre-training)

and Facebook AI’s

RAG

(Retrieval-Augmented Generation)

. REALM introduced

the concept of pre-training a language model with a

The American Journal of Interdisciplinary Innovations and Research

139

https://www.theamericanjournals.com/index.php/tajiir

The American Journal of Interdisciplinary Innovations and Research

retrieval component, allowing it to learn to retrieve
relevant documents during the pre-training phase. This
approach demonstrated significant improvements in
open-domain question answer- ing. Similarly, Lewis et al.
proposed the RAG model, which combines a pre-trained
neural retriever with a seq2seq gen- erator, showcasing
promising results in knowledge-intensive NLP tasks.
These models laid the groundwork for integrat- ing
external knowledge into generative processes, making
AI systems more reliable and factual.

Fusion-in-Decoder

(FiD)

further improved upon this by processing multiple

re- trieved documents simultaneously in the decoder,

enhancing the model’s ability to synthesize information

from various sources and generate more comprehensive
responses.

In parallel with the development of RAG architectures,
there has been a growing recognition of the value of
domain- specific large language models (LLMs) in
finance. Projects like

BloombergGPT

have demonstrated

the immense poten- tial of training LLMs specifically on
vast financial datasets. BloombergGPT, a 50-billion
parameter LLM, was trained on a diverse range of
financial data, including news, filings, and proprietary
data, showcasing superior performance on finan- cial
NLP tasks compared to general-purpose LLMs. This un-
derscores the critical importance of fine-tuning both
retrieval and generation components on financial
datasets to achieve optimal performance in this highly
specialized domain. The insights from BloombergGPT
reinforce the notion that while general LLMs provide a
strong foundation, domain-specific adaptation is
essential for real-world financial applications.

Despite these significant advances, limited work has
been done to adapt retrieval-augmented models for the
fast-paced financial domain, particularly in the context of
real-time anal- ysis. The unique challenges of financial
data, such as its high volume, velocity, complexity, and
the stringent regulatory requirements, demand
specialized RAG systems. Traditional RAG systems often
struggle with the sheer volume and com- plexity of
financial data, which includes highly diverse and
context-sensitive information. Moreover, the need for
trans- parency and traceability in financial decision-
making necessi- tates a RAG system that can provide
auditable insights. Our proposed framework addresses
this gap by incorporating ro- bust financial data
pipelines, real-time retrieval mechanisms using vector
databases like ChromaDB, and generation with fine-

tuned transformers, specifically designed to handle the
dynamic nature of financial markets. This approach aims
to overcome the limitations of existing models by
providing a more accurate, timely, and interpretable
solution for financial market analysis.

FinBERT [

1

] stands out as a foundational work...

...Google’s

REALM [

2

] and Facebook

AI’s

RAG [

3

]...

...Fusion-in-Decoder (FiD) [

4

] further improved...

...BloombergGPT [

5

] showcased superior performance...

...noted in industry blogs [

3.

Methodology

Our proposed RAG system for real-time financial market

analysis is designed with a modular architecture to
ensure scalability, efficiency, and adaptability. The system
comprises two main components: the

Retriever

and the

Generator

. This architecture is specifically tailored to

address the unique challenges of the financial domain,
such as the need for real- time data access, high
accuracy, and interpretability.

3.1

Data Ingestion and Sources

The efficacy of our system hinges on the quality and

diversity of its data sources. We integrate multiple
heterogeneous sources to ensure a comprehensive
understanding of market dynamics:

•

SEC Filings (10-K, 10-Q, 8-K):

Sourced from the

EDGAR database, these provide structured,
fundamen- tal data on company performance, risk
factors, and major corporate events.

•

Earnings Call Transcripts:

These offer qualitative in-

sights into

management’s

perspective, future

outlook, and sentiment, which are often not
captured in formal filings.

•

Real-Time Market News:

We ingest continuous news

feeds from reputable providers like Bloomberg and
Reuters to capture breaking news, geopolitical
events, and macroe- conomic announcements that
can instantly impact markets.

•

Social Media:

We process data from platforms like

Twitter (now X) and Reddit to gauge real-time
public sentiment, identify emerging trends, and
detect viral discussions re- lated to specific assets.

3.2

Data Preprocessing and Cleaning

The American Journal of Interdisciplinary Innovations and Research

140

https://www.theamericanjournals.com/index.php/tajiir

The American Journal of Interdisciplinary Innovations and Research

Before vectorization, raw textual data undergoes a
rigorous preprocessing pipeline to transform noisy,
heterogeneous data into a high-quality, standardized

format. This is a critical step, as the quality of the input
data directly impacts the relevance

Figure 1: Detailed System Architecture Diagram, showing the flow from data sources through ingestion and

inference pipelines

of the retrieved context and the accuracy of the
generated response.

1.

Text Extraction and Normalization:

We first extract

plain text from various formats (e.g., HTML, PDF). We
then normalize the text by converting it to lowercase,
re- moving special characters and irrelevant
boilerplate

text

(e.g.,

legal

disclaimers,

headers/footers), and standardiz- ing whitespace.

2.

Segmentation Strategy:

Long documents are

segmented into smaller, manageable chunks. Our
strategy is content- aware: SEC filings are segmented
by paragraph to pre- serve semantic context, while
earnings call transcripts are segmented by speaker
turn. News articles and social media posts are
segmented into sentences or short para- graphs (max
256 tokens) to ensure optimal chunk size for
embedding and retrieval.

3.

Named Entity Recognition (NER):

We employ a custom

SpaCy model, fine-tuned on a financial entity dataset,
to identify and classify key entities such as company
names (e.g.,

‘Tesla‘),

financial metrics (e.g.,

‘EPS‘,

‘P/E

ratio‘), and key events. This enhances retrieval

precision by en- abling entity-aware search.

3.3

Vector Database and Indexing Strategy

The core of our retrieval pipeline is the vector database,

which stores the embeddings of the preprocessed data
chunks.

•

Vector Store Implementation:

We selected

ChromaDB

as our primary vector store due to its

ease of integration, scalability, and straightforward
API for building LLM ap- plications. For our
experiments, we configured ChromaDB with an in-
memory client for rapid development and a per-
sistent client for larger-scale evaluations.

•

Indexing Mechanism:

We utilize the

Hierarchical

Navi- gable Small World (HNSW)

index,

implemented

via the ‘hnswlib‘

library. HNSW is

chosen for its exceptional per- formance in
approximate nearest neighbor (ANN) search,
providing an excellent trade-off between search
speed and accuracy, which is paramount for real-
time applications.

The American Journal of Interdisciplinary Innovations and Research

141

https://www.theamericanjournals.com/index.php/tajiir

The American Journal of Interdisciplinary Innovations and Research

•

Embedding Models:

We use a dual-model approach

for vectorization. For formal documents like SEC
filings and earnings calls, we use

FinBERT

, a model

pre-trained on a vast financial corpus that captures
the nuances of financial language. For less formal
sources like social media and news, we use

Sentence-BERT

, which is optimized for producing

semantically meaningful sentence embeddings for
a broader range of contexts.

3.4

Retrieval and Generation Pipeline

3.4.1

Retrieval Process

When a user query is received, it is first vectorized using

the same embedding model appropriate for the query’s

context. The retriever then performs a similarity search
against the HNSW index in ChromaDB using

cosine

similarity

as the distance metric. This metric is

particularly effective for text embeddings as it captures
semantic relatedness by focusing on the orientation of
the vectors rather than their magnitude. The pipeline
identifies the

top-k

most relevant document chunks,

where

‘k‘

is empirically set to 5 to provide sufficient

context without overwhelming the generative model.

3.4.2

Generation Process

The retrieved document chunks are concatenated with
the original user query using a structured template and
fed into a fine-tuned

BART (Bidirectional and Auto-

Regressive Transformers)

model. BART is well-suited

for this task due to its denoising autoencoder
architecture, which makes it robust for synthesizing
responses from diverse and some- times noisy
document chunks. The model is fine-tuned on a
proprietary dataset of financial question-answer pairs to
align its responses with financial terminology and
reporting standards. The fine-tuning process involved
training for 5 epochs with a batch size of 8, using the
AdamW optimizer with a learning rate of 2e-5. Early
stopping was implemented based on validation loss to
prevent overfitting.

4.

Results and Discussion

Our comprehensive evaluation of the RAG system for
real- time financial market analysis yielded promising
results across multiple dimensions. The system
demonstrated signif- icant improvements over traditional
NLP approaches in terms of accuracy, factuality, and
efficiency,

while

maintaining

high

levels

of

interpretability and business value as assessed by
financial analysts.

4.1

Quantitative Performance

The RAG system achieved impressive scores on standard
NLP evaluation metrics, summarized in Table

1

.

BLEU

scores

, which measure precision, ranged from 0.72 to

0.85 across different query types. The highest
performance was observed for factual queries about
company financials (e.g., "What was

Apple’s

revenue in

Q4?"), while more complex an- alytical questions
requiring synthesis across multiple sources yielded
scores on the lower end of the range.

ROUGE scores

,

which measure recall, were consistently high, with
ROUGE-1 scores averaging 0.78, ROUGE-2 at 0.65, and
ROUGE-L at

0.74. These results indicate that the system effectively
cap- tures and reproduces the essential information from
retrieved documents while maintaining high linguistic
quality.

Factuality

metrics

showed

particularly

strong

performance. With

FEVER scores

achieving 0.89 accuracy

in fact verifica- tion tasks and

FactCC scores

reaching 0.82

for factual con- sistency, our model demonstrates a
significant improvement over baseline generative
models, which typically achieve FEVER scores around
0.65-0.70. This enhanced factuality is a direct result of the
RAG

architecture’s

reliance on retrieved, verifiable

sources, which effectively grounds the model and
prevents it from relying on potentially outdated or
incorrect information from its training data.

Table 1: Summary of Quantitative Evaluation Metrics

Metric

Our RAG System

Baseline Model

BLEU Score

0.72 - 0.85

0.55 - 0.65

ROUGE-1

0.78

0.62

ROUGE-L

0.74

0.58

FEVER Accuracy

0.89

0.68

FactCC Consistency

0.82

0.61

The American Journal of Interdisciplinary Innovations and Research

142

https://www.theamericanjournals.com/index.php/tajiir

The American Journal of Interdisciplinary Innovations and Research

4.2

Efficiency Analysis

Time-to-response benchmarks revealed that the system

main- tains excellent performance under real-time
constraints. Aver- age response times were

1.2 seconds

for simple queries

and

2.8 seconds for complex multi-source queries

. The 95th

percentile latency remained under 4.5 seconds even
during peak load conditions. The system demonstrated
through- put capabilities of up to 50 queries per second
on the speci- fied hardware configuration, meeting the
demanding require- ments of real-time financial analysis
environments where millisecond-level latency can be
critical.

4.3

Qualitative Assessment and Discussion

Human evaluation by experienced financial analysts
provided crucial insights into the practical utility of the
system.

Con- textual correctness scores

averaged 4.3

out of 5, with an- alysts noting that the system
consistently provided relevant and accurate information
aligned with current market condi- tions.

Business value

assessments

were particularly strong, averaging 4.1 out

of 5, with analysts highlighting the sys-

tem’s

ability to

synthesize information from multiple sources and
provide actionable insights for investment decisions.

Interpretability scores

were exceptionally high at 4.6 out

of 5, largely due to the

system’s

transparent source

attribution and confidence scoring mechanisms.
Analysts appreciated the ability to trace generated
insights back to specific doc- uments and assess the
reliability of information based on confidence scores.
This transparency is crucial for regulatory compliance and
risk management in financial applications, as it provides a
clear audit trail for how an AI-generated insight was
formed.

4.4

Comparative Analysis

As illustrated in Figure

2

, our RAG approach showed sub-

stantial improvements over both traditional keyword-
based systems and general-purpose LLMs without
retrieval aug- mentation. Traditional systems achieved
accuracy scores of only 0.45-0.55 on similar tasks, while
non-RAG LLMs scored 0.60-0.68. The RAG

system’s

performance of 0.72-

0.85 represents a significant advancement. The system
also demonstrated superior handling of recent events. In
tests in- volving queries about events occurring after the
training cut- off dates of traditional models, our RAG
system maintained high accuracy (0.81) while static
models showed dramatic performance degradation
(0.23).

Figure 2: Comparative performance of our RAG system

against baseline models, showing minimum and
maximum accuracy scores.

5

Qualitative Analysis

While quantitative metrics provide an objective

measure of performance, a qualitative analysis is
essential to understand the practical utility and nuances
of the

system’s

output. We conducted a review of

generated responses with experienced financial
analysts.

One key finding was the

system’s

ability to synthesize

in- formation from multiple sources to provide a
comprehensive answer. For instance, when asked,

*"What is the market sentiment regarding Tesla’s

upcoming battery day, consid- ering recent news and
executive statements?"*, a traditional model might
provide a generic summary. Our RAG system, however,
generated a nuanced response that:

•

Cited a recent news article about a new patent filing

(from the news feed).

•

Referenced a specific, optimistic quote from Elon

Musk’s latest earnings call (from the transcript).

Included a summary of retail investor sentiment from
Red- dit, noting both excitement and skepticism
(from social media).

•

Highlighted the official risk factors mentioned in the

latest 10-Q filing (from SEC data).

This ability to provide a multi-faceted, evidence-based
narra- tive was consistently rated as highly valuable by
the analysts. The source attribution feature was
particularly praised, as it allowed analysts to
immediately verify the information by clicking through
to the original documents.

6

Limitations

Despite its strong performance, our RAG system has sev-
eral limitations that must be acknowledged for
responsible deployment and future research.

•

Data Source Quality and Bias:

The

system’s

insights

are fundamentally dependent on the quality and
coverage of its underlying data sources. Inherent
biases in news reporting or social media discussions
(e.g., overly positive or negative coverage of certain
companies) can be perpetuated and amplified by
the system if not carefully monitored.

•

Handling Conflicting Information:

The system faces

challenges in resolving direct contradictions
between au- thoritative sources. While our post-
processing steps can flag uncertain content, the
model does not yet have a so- phisticated

The American Journal of Interdisciplinary Innovations and Research

143

https://www.theamericanjournals.com/index.php/tajiir

The American Journal of Interdisciplinary Innovations and Research

mechanism for determining which source is more
credible, which often requires human-level domain
expertise.

•

Emerging Financial Instruments:

The system

’s

perfor- mance may degrade when dealing with
highly specialized or emerging financial instruments
and concepts (e.g., com- plex derivatives, novel
cryptocurrencies) that are not well- represented in
the training data of the embedding or gener- ative
models.

•

Scalability and Cost:

While the system is efficient,

main- taining a real-time ingestion and vectorization
pipeline for a massive volume of global financial data
is computationally expensive and presents a
significant engineering challenge for large-scale
deployment.

7

Conclusion

This paper has presented a comprehensive Retrieval-
Augmented Generation (RAG) framework specifically
de- signed for real-time financial market analysis. We
have de- tailed its modular architecture, encompassing
robust retrieval and generation pipelines,

and

highlighted the critical role of diverse and continuously
updated financial data sources. Our methodology
emphasized domain-specific preprocessing, efficient
vector storage using ChromaDB, and fine-tuned gen-
erative models like BART, all tailored to address the
unique challenges of the financial sector.

The evaluation results underscore the significant advan-
tages of our RAG system.

Quantitatively, it

demonstrated superior accuracy and factuality compared
to traditional NLPmodels, with high BLEU and ROUGE
scores and impressive performance on FEVER and
FactCC metrics, significantly reducing the incidence of
hallucinations. Crucially, the sys- tem achieved low
latency and high throughput, meeting the demanding
efficiency requirements of real-time financial en-
vironments. Qualitatively, human evaluations by
financial

analysts confirmed the system’s contextual

correctness, busi- ness value, and, most importantly, its
interpretability through transparent source attribution.
This transparency is vital for building trust and ensuring
compliance in the highly regulated financial domain.

In conclusion, Retrieval-Augmented Generation stands
as a transformative technology for financial intelligence.
By bridging the gap between static knowledge and
dynamic mar- ket realities, our RAG framework offers a
powerful, accurate, and interpretable solution for
navigating the complexities of real-time financial
markets, empowering professionals with the insights
needed to make informed and timely decisions.

7.1

Future Work

While the current iteration of our RAG system

represents a substantial advancement, future work will
focus on several key areas.

•

Enhanced Data Ingestion:

We plan to enhance the

real- time data ingestion capabilities to incorporate
even more ephemeral data sources, such as high-
frequency trading signals and live audio feeds from
earnings calls, which will further improve the

system’s responsiveness.

•

Advanced Ambiguity Handling:

We will explore more

advanced techniques for handling highly nuanced
or am- biguous financial language. This includes
potentially using reinforcement learning from
human feedback (RLHF) to refine the generative

model’s

ability to provide even more precise

insights when faced with conflicting reports.

•

Multimodal Integration:

We aim to integrate

multimodal data, such as financial charts, news
videos, and satellite imagery, which could unlock
new dimensions of analysis and provide a richer
context for decision-making.

•

Personalization:

Finally, further research into

personal- ized RAG systems that can adapt to
individual analyst preferences, risk profiles, and
specific investment strategies would be a valuable
direction for creating a truly bespoke financial
analysis tool.

References

1

D. Araci,

“Finbert:

Financial sentiment analysis

with pre-

trained language models,”

arXiv preprint

arXiv:1908.10063

, 2019.

2

K. Guu, K. Lee, Z. Tung, P. Pasupat, and M.-W.

Chang, “Realm: Retrieval

-augmented language

model pre-

training,”

International conference on

machine learn- ing

, pp. 3929

–

3938, 2020.

3

P. Lewis, E. Perez, A. Piktus, F. Petroni, V. Karpukhin,

N. Goyal, H. Küttler, M. Lewis, W.-t. Yih, T. Rocktäschel

et al.

, “Retrieval

-augmented generation for

knowledge- intensive nlp

tasks,”

Advances in neural

information pro- cessing systems

, vol. 33, pp. 9459

–

9474, 2020.

4

G. Izacard and E. Grave, “Leveraging passage

retrieval with generative models for open domain
question

answer-

ing,”

arXiv

preprint

arXiv:2007.01282

, 2020.

5

S. Wu, O. Irsoy, S. Lu, V. Dabravolski, M. Dredze,

S. Gehrmann, P. Kambadur, D. Rosenberg, and G.

The American Journal of Interdisciplinary Innovations and Research

144

https://www.theamericanjournals.com/index.php/tajiir

The American Journal of Interdisciplinary Innovations and Research

Mann,

“Bloomberggpt:

A large language model for

finance,”

arXiv preprint arXiv:2303.17564

, 2023.

6

RavenPack,

“The

most powerful rag for

finance,”

2024, accessed: 2024-07-19. [Online]. Available:

https://www. ravenpack.com/blog/most-powerful-
rag-for-finance

7

Datategy, “Scaling rag systems in financial or

-

ganizations,”

2025, accessed: 2025-07-19. [On-

line].

Available:

https://www.datategy.net/2025/03/26/

scaling-

rag-systems-in-financial-organizations/

References

D. Araci, “Finbert: Financial sentiment analysis with pre-trained language models,” arXiv preprint arXiv:1908.10063, 2019.

K. Guu, K. Lee, Z. Tung, P. Pasupat, and M.-W. Chang, “Realm: Retrieval-augmented language model pre-training,” International conference on machine learn- ing, pp. 3929–3938, 2020.

P. Lewis, E. Perez, A. Piktus, F. Petroni, V. Karpukhin, N. Goyal, H. Küttler, M. Lewis, W.-t. Yih, T. Rocktäschel et al., “Retrieval-augmented generation for knowledge- intensive nlp tasks,” Advances in neural information pro- cessing systems, vol. 33, pp. 9459–9474, 2020.

G. Izacard and E. Grave, “Leveraging passage retrieval with generative models for open domain question answer- ing,” arXiv preprint arXiv:2007.01282, 2020.

S. Wu, O. Irsoy, S. Lu, V. Dabravolski, M. Dredze, S. Gehrmann, P. Kambadur, D. Rosenberg, and G. Mann, “Bloomberggpt: A large language model for finance,” arXiv preprint arXiv:2303.17564, 2023.

RavenPack, “The most powerful rag for finance,” 2024, accessed: 2024-07-19. [Online]. Available: https://www. ravenpack.com/blog/most-powerful-rag-for-finance

Datategy, “Scaling rag systems in financial or- ganizations,” 2025, accessed: 2025-07-19. [On- line]. Available: https://www.datategy.net/2025/03/26/ scaling-rag-systems-in-financial-organizations/