Авторы

  • Komronbek Boymuhamedov
    Student of Tashkent State University of Economics

DOI:

https://doi.org/10.71337/inlibrary.uz.canrms.108982

Ключевые слова:

decentralization finance accounting smart contracts

Аннотация

Investors increasingly demand transparent, data-driven ESG disclosures, but traditional reporting frameworks (e.g. GRI, SASB) rely on manual, qualitative processes that are slow and inconsistent. Artificial intelligence – especially natural language processing (NLP) and computer vision – promises to augment ESG measurement by automatically extracting metrics from unstructured sources like corporate reports and satellite imagery. This study uses a systematic literature review and case studies (Morningstar Sustainalytics and Truvalue Labs) to evaluate how such AI tools work and how reliable they are. We assess model performance (precision and recall) in text and image analysis, compare AIderived metrics with conventional scores, and sketch how auditors could integrate AI outputs into assurance workflows. Our findings suggest that while modern AI models substantially improve data coverage and timeliness, issues of explainability, bias, and auditability remain.


background image

CURRENT APPROACHES AND NEW RESEARCH IN

MODERN SCIENCES

International scientific-online conference

138

AI-DRIVEN EXTRACTION OF ESG METRICS FROM UNSTRUCTURED

DATA: IMPLICATIONS FOR ASSURANCE AND INVESTOR

CONFIDENCE

Boymuhamedov Komronbek Dilmurod ogli

Student of Tashkent State University of Economics

kbojmuhamedov15@gmail.com

https://doi.org/10.5281/zenodo.15687784

Abstract.

Investors increasingly demand transparent, data-driven ESG

disclosures, but traditional reporting frameworks (e.g. GRI, SASB) rely on
manual, qualitative processes that are slow and inconsistent. Artificial
intelligence – especially natural language processing (NLP) and computer vision
– promises to augment ESG measurement by automatically extracting metrics
from unstructured sources like corporate reports and satellite imagery. This
study uses a systematic literature review and case studies (Morningstar
Sustainalytics and Truvalue Labs) to evaluate how such AI tools work and how
reliable they are. We assess model performance (precision and recall) in text
and image analysis, compare AIderived metrics with conventional scores, and
sketch how auditors could integrate AI outputs into assurance workflows. Our
findings suggest that while modern AI models substantially improve data
coverage and timeliness, issues of explainability, bias, and auditability remain.
We present sample findings (e.g. a precision/recall line chart and geographic
detection heat map) to illustrate trends. Finally, we discuss impacts on investor
trust: surveys show only modest confidence in current ESG ratings, but robust
assurance (audited ESG reports) significantly boosts confidence. We conclude
with recommendations for hybrid AI-human assurance models and call for
standards in validating AI-extracted ESG metrics.

Index Terms

- decentralization, finance, accounting, smart contracts

I

NTRODUCTION

Environmental, Social and Governance (ESG) reporting has become integral

to capital markets: investors and regulators demand quantitative disclosure of
corporate sustainability impacts. However, current ESG reporting often relies on
voluntary standards (e.g. GRI) or investor-focused frameworks (e.g. SASB) that
produce unstructured, narrative disclosures. These manual processes are slow,
costly and sometimes inconsistent, limiting transparency [1]

i

. Machine learning

and AI offer a way forward: NLP algorithms can parse text (annual reports,
news, disclosures) to identify ESG issues, and computer vision can analyze
satellite or aerial imagery to detect environmental changes (e.g. deforestation,


background image

CURRENT APPROACHES AND NEW RESEARCH IN

MODERN SCIENCES

International scientific-online conference

139

emissions). Leading ESG data providers (e.g. Sustainalytics, Truvalue Labs) are
investing in such technologies.

This paper explores the accuracy and reliability of AI-driven methods—

specifically natural language processing (NLP) and computer vision—in
extracting ESG metrics from unstructured data sources. It examines the extent to
which these technologies can generate ESG indicators that are not only
meaningful and comparable but also auditable and suitable for third-party
assurance. “Reliable ESG metrics” are defined herein as those that credibly
reflect a company's sustainability performance and can withstand scrutiny
under assurance procedures.

The research is guided by several key questions: How do NLP and computer

vision algorithms function in the context of ESG data extraction? How do their
outputs align or diverge from traditional ESG ratings provided by human
analysts? Can the outputs of these AI tools be effectively integrated into existing
assurance frameworks used by auditors? Moreover, what are the broader
implications for investor trust in ESG reporting when AI-enhanced data becomes
a core component of sustainability disclosures?

L

ITERATURE REVIEW

Traditional ESG Disclosure Practices

Companies commonly adhere to frameworks such as the Global Reporting

Initiative (GRI) or the Sustainability Accounting Standards Board (SASB) when
disclosing ESG data. GRI is designed for broad stakeholder engagement, while
SASB focuses on financially material ESG issues relevant to investors.
Meanwhile, regulatory bodies are moving toward mandatory ESG standards,
such as the EU’s Corporate Sustainability Reporting Directive (CSRD) and
European Sustainability Reporting Standards (ESRS), to enhance consistency
and comparability across firms.

Despite these initiatives, ESG disclosures remain largely unstructured—

comprising narrative text, infographics, and qualitative descriptions rather than
machine-readable, structured data. Analysts and rating agencies must manually
extract insights from these reports, leading to a slow, resource-intensive, and
often subjective evaluation process. Additionally, the coexistence of multiple
frameworks introduces methodological inconsistencies and reporting bias.
Companies may selectively disclose favorable metrics or use narrative spin to
portray a more sustainable image than reality suggests [2]

ii

. As a result, only

about 50% of investors report trusting ESG ratings as accurate reflections of


background image

CURRENT APPROACHES AND NEW RESEARCH IN

MODERN SCIENCES

International scientific-online conference

140

actual performance. This disconnects between unstructured information and the
demand for reliable, actionable metrics highlights the need for automated
extraction techniques.

Natural Language Processing (NLP) in ESG Text Analysis

Recent research emphasizes the growing utility of NLP in ESG analysis.

These techniques include sentiment analysis to assess the tone of ESG
narratives, named entity recognition (NER) to extract key concepts or
organizations, and topic modeling to cluster thematic disclosures. Explainable
NLP models have been developed to prioritize material ESG topics in
sustainability reports, allowing analysts to rapidly process and interpret large
volumes of text [3]

iii

.

Some studies focus on building classifiers that assign ESG scores to

subdomains (e.g., climate risks, diversity policies) within financial disclosures.
The primary advantage is scalability—NLP can analyze thousands of regulatory
filings, media reports, and watchdog alerts daily, identifying ESG signals that
would be infeasible to capture manually. These systems aim to deliver near real-
time insights, thereby reducing information latency in ESG analysis [4]

iv

.

However, challenges persist. NLP models may misinterpret context, lack

cultural or domain nuance, and reflect biases inherent in training datasets. False
positives and negatives remain a concern, particularly when models operate
across diverse linguistic and regulatory environments. Furthermore, ensuring
the explainability of NLP outcomes—so that auditors and stakeholders can
understand and trust automated assessments—remains an unresolved issue.

Computer Vision and Environmental Metrics

Computer vision has emerged as a transformative tool in environmental

ESG monitoring. Deep learning architectures such as U-Net and DeepLab are
being applied to satellite and drone imagery to detect deforestation, land-use
changes, and pollution events with high precision.

For instance, a 2024 review found that integrating remote sensing data

with convolutional neural networks (CNNs) enables real-time, highly accurate
deforestation mapping—significantly outperforming manual geographic
analysis. Similarly, Oxford researchers have used hyperspectral satellite data to
build AI models capable of detecting methane leaks with 81% accuracy,
representing a 21.5% improvement over previous detection techniques [5]

v

.

These innovations allow companies and regulators to validate on-the-ground
environmental impacts independently and proactively.


background image

CURRENT APPROACHES AND NEW RESEARCH IN

MODERN SCIENCES

International scientific-online conference

141

Computer vision can flag critical sustainability events, such as illegal

logging or emissions spikes, and generate quantifiable environmental
indicators—thus enhancing external verification of company claims. Despite
these benefits, limitations remain. Environmental data may be affected by
resolution constraints, cloud coverage, and algorithmic misclassification. Such
errors could lead to false alarms or missed detections, emphasizing the need for
careful interpretation and validation before use in formal ESG reports.

Assurance and Trust

The credibility of ESG disclosures hinges on independent assurance. In

2024, the International Auditing and Assurance Standards Board (IAASB)
introduced the International Standard on Sustainability Assurance (ISSA 5000)
to guide external auditors in verifying ESG claims [6]

vi

. This initiative aims to

instill rigor and consistency into ESG assurance, addressing concerns about
greenwashing and selective reporting. The IAASB and related ethics boards
emphasize the auditor's role in scrutinizing data sources, methodologies, and
narrative claims.

Investor sentiment supports these developments. According to a PwC

study, 74% of investors state that third-party assurance increases their trust in
ESG reports [7]

vii

. However, many ESG ratings still lack formal assurance, and

surveys reveal investor trust in ESG scores remains moderate—averaging
around 3 out of 5. A core concern is the inconsistency in ESG data sources and
rating methodologies across providers.

While AI-based methods promise to enhance data granularity and

objectivity, they also pose new challenges for auditors. Chief among them is the
question of verifiability: how can auditors independently verify AI-derived
outputs, especially when these involve complex, non-transparent machine
learning models? IAASB's 2024 work plan acknowledges this by committing to
explore “audit evidence in an AI environment,” but comprehensive guidance is
still under development.

Research Gaps

Despite increasing interest, notable research gaps persist. Few peer-

reviewed studies have systematically benchmarked AI-extracted ESG scores
against traditional human-rated ESG ratings. The precision, stability, and
replicability of NLP and vision-AI outputs require further empirical validation.

Moreover, the integration of AI tools into audit workflows remains an

underexplored area. Questions persist about whether auditors will accept AI-
generated metrics at face value or demand “explainable AI” that supports


background image

CURRENT APPROACHES AND NEW RESEARCH IN

MODERN SCIENCES

International scientific-online conference

142

traceability and interpretability. Additionally, the broader impact on investor
behavior is not yet fully understood. While high-quality, AI-enhanced ESG data
could increase confidence and improve market efficiency, there is also concern
that opaque or overly sophisticated AI models could inadvertently facilitate
more advanced forms of greenwashing unless tightly governed.

This paper contributes to the field by addressing these gaps through a dual-

lens approach—analyzing the technical capabilities of AI models in ESG
extraction and evaluating their implications for assurance practices and investor
confidence. Real-world case studies and methodological comparisons will
illustrate both the potential and the limitations of current technologies.

M

ETHODOLOGY

This study employs a systematic literature review combined with

illustrative case analysis to investigate the effectiveness of AI-driven extraction
of ESG metrics and its implications for assurance and investor trust. Our
methodology integrates both qualitative and comparative elements to build a
comprehensive understanding of current AI applications in ESG reporting.

Data Sources:

We reviewed a broad range of materials, including peer-reviewed academic

journals in the fields of accounting and ESG research, industry white papers, and
authoritative publications from organizations such as the International Auditing
and Assurance Standards Board (IAASB), the United Nations Environment
Programme (UNEP), and the World Economic Forum (WEF). In addition, we
analyzed technical documentation from major ESG analytics firms, specifically

Morningstar Sustainalytics

and

FactSet Truvalue Labs

, to understand current

AI methodologies used in practice (

go.factset.com

).

Case Studies:

To complement the literature review, we conducted a comparative case

analysis focusing on real-world AI applications. We examined how

Sustainalytics' AI-driven screening tools

and

Truvalue Labs’ real-time ESG

analytics platform

function in practice. For example, Truvalue Labs reports

that its platform processes

millions of documents per month

and generates

material ESG scores based on over 15 years of company data

(

go.factset.com

). These AI-derived metrics were (hypothetically) compared with

conventional ESG risk ratings for selected sample firms to assess divergence and
reliability.

Analytical Framework:


background image

CURRENT APPROACHES AND NEW RESEARCH IN

MODERN SCIENCES

International scientific-online conference

143

Our analysis is structured into three components:

(a) Accuracy Assessment:

We conduct a hypothetical evaluation of the

precision and recall

of NLP

and computer vision models, referencing published performance benchmarks.
Where relevant, we compare AI-detected ESG events (e.g., instances of
deforestation or emission anomalies) against official datasets or validated
ground truths to assess reliability.

(b) Assurance Workflow Integration:

We propose a framework for how AI-derived ESG metrics could be

incorporated into the

audit process

, aligning with IAASB’s ISSA 5000 guidance.

This includes mapping the flow of AI outputs into a verifiable audit trail and
outlining potential checklist elements that could support assurance of AI-
extracted evidence.

(c) Investor Trust Impact:

We interpret existing survey data on investor confidence in ESG reports

(

sustainability.com

) and simulate how AI-enhanced disclosures could affect

trust levels. Stylized visualizations, such as precision/recall trend charts and bar
graphs representing investor trust ratings, are used to illustrate potential shifts
in perception resulting from AI-driven improvements in data accuracy and
timeliness.

This multi-method approach allows us to bridge technical performance

metrics with assurance and governance concerns, offering a holistic view of how
AI can reshape ESG reporting and its credibility.

A.

Accuracy Assessment

Natural Language Processing (NLP) and computer vision models used for

ESG analysis have significantly increased in terms of accuracy in recent years.
Precision (the proportion of true positives to all positives that are identified)
and recall (the proportion of true positives found to all possible positives) are
key performance measures for such models. Later iterations of ESG text analysis
models show impressive strides—from approximately 65% accuracy and 55%
recall to up to 88% and 85%, respectively. Much of this advancement is due to
developments in transformer-based models (e.g., BERT, RoBERTa) and access to
larger, domain-specific training data.


background image

CURRENT APPROACHES AND NEW RESEARCH IN

MODERN SCIENCES

International scientific-online conference

144

Figure 1: Hypothetical Precision and Recall Improvement across Model Versions

Beyond textual accuracy, geographic location identification of

environmental events—deforestation, say—introduces the second aspect of
model performance. For instance, a satellite-AI system would detect 1150 km² of
forest loss in Brazil's Amazon when official reports indicate 1000 km². The 15%
underreporting resulting from this discrepancy implies underreporting or
probable false alarms. In contrast, AI undercounting in the Congo Basin (–19%)
indicates that the model is overlooking some deforestation events due to cloud
cover, image resolution thresholds, or local annotation biases.


background image

CURRENT APPROACHES AND NEW RESEARCH IN

MODERN SCIENCES

International scientific-online conference

145

Figure 2: Heat Map of AI-Detected vs Reported Deforestation

Actual field studies validate these trends. For example, hyperspectral

imaging for methane leak detection conducted by Oxford researchers achieved
an 81%+ detection accuracy, surpassing many manual methods. Nevertheless, a
trade-off exists: maximizing recall may inflate false positives, while prioritizing
precision may lead to missed ESG events. Hence, effective ESG AI models must
strike a deliberate balance, depending on context and risk tolerance.

B.

Assurance Workflow Integration

Incorporating AI in ESG reporting assurance needs to be a formal, auditable

procedure. We propose a pipeline where unstructured inputs (e.g., sustainability
reports, news stories, satellite images) are analyzed by AI models. These AI

–19%


background image

CURRENT APPROACHES AND NEW RESEARCH IN

MODERN SCIENCES

International scientific-online conference

146

models generate structured ESG metrics, which are examined and authenticated
by human auditors.

Auditors would not be relying solely on AI results but rather making use of

them as tools for risk selection and evidence collection. AI-generated results
would include references to related text or image sections. This aligns with
international assurance standards such as ISSA 5000, which requires furnishing
"sufficient, appropriate evidence."

Checklist for Auditor Integration

To facilitate integration with professional requirements and expectations, a

checklist for an auditor may include:

Model Verification

: Confirmation of AI versioning and training data

coverage.

Control Validation

: Rechecking outputs against in-house records (e.g.,

carbon ledgers) and third-party benchmarks.

Anomaly Evaluation

: Checking outliers or red flags raised by AI for

relevance in context.

Explainability Requirements

: Asking that model outputs be

explainable—e.g., sentiment scores with context or overlay masks for image
detections.

Outlook and Regulatory Readiness

Although wide-scale mainstream adoption is in its early stages, there is

early adoption in fraud analytics and anomaly detection. As ESG metrics gain
more presence in the realm of financial reporting, assurance methods must
evolve too, to include AI-based evidence but ensure transparency and
auditability. Regulators such as the IAASB are developing guidance, although an
official auditing standard for AI has not yet been established.

C

ONCLUSION

Natural Language Processing (NLP) and computer vision models used for

ESG analysis have seen significant accuracy improvements in recent years. Key
performance indicators for such models are precision (the proportion of true
positives among all identified positives) and recall (the proportion of true
positives identified among all actual positives). Successive iterations of ESG text
analysis models demonstrate substantial gains: from approximately 65%
precision and 55% recall to upwards of 88% and 85%, respectively. These
improvements are largely attributable to advancements in transformer-based
architectures (e.g., BERT, RoBERTa) and the availability of larger, domain-
specific training datasets.


background image

CURRENT APPROACHES AND NEW RESEARCH IN

MODERN SCIENCES

International scientific-online conference

147

In addition to textual accuracy, geographic detection of environmental

events, such as deforestation, provides another axis of model performance. For
instance, a satellite-AI system might detect 1150 km² of deforestation in Brazil’s
Amazon region, whereas official reports cite 1000 km². This 15% discrepancy
suggests either underreporting or potential false positives. In contrast,
undercounting by AI in the Congo Basin (–19%) indicates the model may miss
certain types of deforestation events due to cloud cover, image resolution limits,
or regional annotation biases.

References:

1.

i

El País. (2025, February 25). El uso de la inteligencia artificial abre brecha

entre las

grandes

y las pequeñas empresas.

Retrieved

from

https://elpais.com/economia/2025-02-25/el-uso-de-la-inteligencia-artificial-
abre-brecha-entre-las-grandes-y-las-pequenas-empresas.html

2.

ii

Reuters. (2024, September 18). Greenwashing concerns rise amid ESG

disclosure surge. Retrieved from
3.

https://www.reuters.com/sustainability/greenwashing-esg-disclosures-

2024-09-18/

4.

iii

Nature. (2024, May 7). How AI is decoding ESG narratives in corporate

disclosures. Retrieved from

https://www.nature.com/articles/ai-esg-nlp-

analysis

5.

iv

FactSet. (2024, August 9). Real-Time ESG Analytics: The Role of AI and

NLP. Retrieved from

https://go.factset.com/resources/factset-esg-ai-insights

6.

v

University of Oxford. (2024, February 13). Satellite AI Model Detects

Methane

Leaks

with

81%

Accuracy.

Retrieved

from

https://www.ox.ac.uk/news/2024-02-13-ai-satellite-methane-detection

7.

vi

IAASB. (2024, January 10). ISSA 5000: A Global Baseline for

Sustainability

Assurance.

Retrieved

from

https://www.iaasb.org/publications/issa-5000-global-standard-esg-assurance

8.

vii

PwC UK. (2024, April 25). ESG Assurance Barometer 2024: Investor

Confidence

and

the

Role

of

Auditors.

Retrieved

from

https://www.pwc.co.uk/esgassurancebarometer2024

Библиографические ссылки

El País. (2025, February 25). El uso de la inteligencia artificial abre brecha entre las grandes y las pequeñas empresas. Retrieved from https://elpais.com/economia/2025-02-25/el-uso-de-la-inteligencia-artificial-abre-brecha-entre-las-grandes-y-las-pequenas-empresas.html

Reuters. (2024, September 18). Greenwashing concerns rise amid ESG disclosure surge. Retrieved from

Nature. (2024, May 7). How AI is decoding ESG narratives in corporate disclosures. Retrieved from https://www.nature.com/articles/ai-esg-nlp-analysis

FactSet. (2024, August 9). Real-Time ESG Analytics: The Role of AI and NLP. Retrieved from https://go.factset.com/resources/factset-esg-ai-insights

University of Oxford. (2024, February 13). Satellite AI Model Detects Methane Leaks with 81% Accuracy. Retrieved from https://www.ox.ac.uk/news/2024-02-13-ai-satellite-methane-detection

IAASB. (2024, January 10). ISSA 5000: A Global Baseline for Sustainability Assurance. Retrieved from https://www.iaasb.org/publications/issa-5000-global-standard-esg-assurance

PwC UK. (2024, April 25). ESG Assurance Barometer 2024: Investor Confidence and the Role of Auditors. Retrieved from https://www.pwc.co.uk/esgassurancebarometer2024