PERFORMANCE SPEED AND ACCURACY METRICS OF FEATURE CLASSIFICATION MODELS

Bektosh Shukrulloev

doi:10.71337/inlibrary.uz.science-research.81968

Authors

Bektosh Shukrulloev

DOI:

https://doi.org/10.71337/inlibrary.uz.science-research.81968

Keywords:

Feature classification accuracy metrics inference speed machine learning model performance F1-score real-time classification.

Abstract

The efficiency of feature classification models in machine learning and artificial intelligence is primarily evaluated through two essential dimensions: performance speed and classification accuracy. This paper investigates the trade-off between these two aspects across different classification algorithms, including K-NN, SVM, Naïve Bayes, Decision Trees, and Deep Neural Networks. Through empirical evaluation on benchmark datasets (MNIST, CIFAR-10, UCI), we analyze training time, inference time, memory consumption, and accuracy-related metrics such as precision, recall, and F1-score. The results provide insight into selecting optimal models based on application-specific constraints such as real-time requirements or accuracy sensitivity.

“YANGI O‘ZBEKISTONDA ZAMONAVIY PSIXOLOGIYA VA PEDAGOGIKAGA DOIR

MUAMMOLARNI TADQIQ ETISHNING TRANSFORMATSION IMKONIYATLARI”

Xalqaro ilmiy - amaliy konferensiyasi, 2025-yil 24-aprel

67

PERFORMANCE SPEED AND ACCURACY METRICS OF FEATURE

CLASSIFICATION MODELS

Shukrulloev Bektosh

Head of the Department of Applied Mathematics and Informatics,

TMC Institute

Email:

b.shukrulloyev@tmci.uz

https://doi.org/10.5281/zenodo.15268088

Abstract. The efficiency of feature classification models in machine learning and

artificial intelligence is primarily evaluated through two essential dimensions: performance
speed and classification accuracy. This paper investigates the trade-off between these two
aspects across different classification algorithms, including K-NN, SVM, Naïve Bayes, Decision
Trees, and Deep Neural Networks. Through empirical evaluation on benchmark datasets
(MNIST, CIFAR-10, UCI), we analyze training time, inference time, memory consumption, and
accuracy-related metrics such as precision, recall, and F1-score. The results provide insight into
selecting optimal models based on application-specific constraints such as real-time
requirements or accuracy sensitivity.

Keywords: Feature classification, accuracy metrics, inference speed, machine learning,

model performance, F1-score, real-time classification.

Introduction.

Feature classification models are at the core of decision-making systems

powered by artificial intelligence and machine learning. They enable machines to distinguish
between categories — such as identifying whether an email is spam, recognizing human faces, or
detecting medical anomalies in radiographic images. However, as these models become more
deeply embedded in high-stakes domains like autonomous driving or diagnostics, a critical
challenge arises: achieving high classification accuracy without compromising performance
speed.

While accuracy ensures reliable predictions, speed — both in training and inference —

determines real-time applicability. For instance, a highly accurate model that takes several
seconds to infer results is impractical for real-time video surveillance. Conversely, a faster model
might lack the nuanced understanding required in complex visual contexts. Thus, evaluating
models across both dimensions is essential for selecting optimal algorithms under specific
conditions.

This study expands the understanding of classification models by comparing their real-

world accuracy metrics and speed parameters on standardized datasets using consistent
benchmarking tools.

Methods and Evaluation Criteria
Datasets.

The datasets were chosen to represent varying levels of complexity:

•

MNIST

offers grayscale digit images (28x28), ideal for benchmarking lightweight

classifiers.

•

CIFAR-10

presents more complex, colored images (32x32) across 10 classes,

testing image processing depth.

“YANGI O‘ZBEKISTONDA ZAMONAVIY PSIXOLOGIYA VA PEDAGOGIKAGA DOIR

MUAMMOLARNI TADQIQ ETISHNING TRANSFORMATSION IMKONIYATLARI”

Xalqaro ilmiy - amaliy konferensiyasi, 2025-yil 24-aprel

68

•

UCI Iris Dataset

, while small, is ideal for quick prototyping and baseline metric

comparison.

Models Analyzed.

A range of both traditional and deep learning models was selected to

evaluate the spectrum of complexity and performance:

•

Naïve Bayes (NB)

for statistical simplicity.

•

K-NN,

known for its non-parametric nature and clarity.

•

SVM

, well-regarded for handling nonlinear separable data.

•

Decision Tree,

due to its interpretability.

•

CNN,

representing modern deep learning models with state-of-the-art accuracy.

Performance Metrics.

Evaluation covered:

•

Training Time:

Time to fully train the model.

•

Inference Time:

Delay introduced during prediction on a single instance.

•

Memory Usage:

RAM consumed during inference.

•

Accuracy, Precision, Recall, and F1-score:

Derived from the confusion matrix,

capturing both correctness and robustness of classification.

Each model was executed under identical computational conditions (Intel Core i7, 16GB

RAM, NVIDIA GTX 1650) to ensure fair comparison.

Results.

The results, visualized in the provided table, show marked differences:

•

CNN dominated accuracy with 97.8% and F1-score of 0.96, confirming its

superiority in handling image data with complex patterns. However, it demanded the highest
training time (120.3 s) and RAM (780 MB), limiting its use in embedded systems.

•

SVM achieved a strong balance, with 91.5% accuracy, 0.90 F1-score, and

moderate training/inference costs, making it ideal for structured but high-dimensional data.

•

K-NN, while accurate (86.7%), suffered from slower inference (1.5 ms) due to

runtime distance calculations — making it unsuitable for low-latency applications.

•

Naïve Bayes offered minimal computational overhead but compromised on

precision. Nonetheless, its sub-second training time (0.5 s) makes it ideal for fast-deploy
scenarios like spam filters.

•

Decision Trees provided a solid middle ground — interpretable, quick, and

reasonably accurate.

These differences demonstrate the importance of choosing a model not just for accuracy,

but for how and where it will be deployed.

Discussion.

The trade-off between speed and accuracy is highly dependent on the

intended application:

•

In mobile health diagnostics, lightweight models like Decision Trees or Naïve

Bayes are preferred due to limited processing power.

•

For autonomous navigation, latency must be extremely low, pushing the need for

optimized deep learning models or even edge-computing strategies.

•

Industrial automation might benefit from models like SVM which deliver high

accuracy with reasonable computational cost.

“YANGI O‘ZBEKISTONDA ZAMONAVIY PSIXOLOGIYA VA PEDAGOGIKAGA DOIR

MUAMMOLARNI TADQIQ ETISHNING TRANSFORMATSION IMKONIYATLARI”

Xalqaro ilmiy - amaliy konferensiyasi, 2025-yil 24-aprel

69

Moreover, developments such as model pruning, quantization, and knowledge distillation

are becoming essential in maintaining accuracy while reducing computational burden — a must
for real-time AI on edge devices.

This study also suggests the value of hybrid approaches, where fast classifiers are used

for pre-screening, and deep models validate critical decisions.

Conclusion.

There is no universally “best” model in feature classification; each

algorithm thrives under different constraints. This study showed that while CNNs deliver
cutting-edge accuracy, their latency and hardware cost limit applicability. Meanwhile, traditional
algorithms still provide viable, efficient alternatives.

For developers and data scientists, this analysis provides a performance-based framework

to select models not only by accuracy benchmarks but by full-system efficiency — a decisive
factor in real-world deployment of intelligent systems.

References:

1.

Asadov, Q. U., & Shukrulloyev, B. R. (2025). Amaliy matematika fanini o'qitishda
iqtisodiy masalalarning qo‘llanilishi. Modern Science and Research, 4(2), 305-307.

2.

Zakhidov, D., & Bektosh, S. (2023). Division of heptagonal social networks into two
communities by the maximum Likelihood method. Horizon: Journal of Humanity and
Artificial Intelligence, 2, 641-645.

3.

Останов, К., Абсаломов, Ш. К., & Шукруллоев, Б. Р. О. (2018). О методических
особенностях изучения квадратичных неравенств. Вопросы науки и образования,
(11 (23)), 43-44.

4.

Shukrulloyev, B. (2025). BELGILARNI TASNIFLASHDA ISHLATILADIGAN
ASOSIY O ‘LCHOV MEZONLARI. Modern Science and Research, 4(2), 107-109.

5.

Shukrulloyev, B., & Abdujabborov, M. (2025). BELGILARNI TANLASH VA
OPTIMALLASHTIRISH USULLARI. Modern Science and Research, 4(2), 51-53.

6.

Shukrulloyev, B., & Abdujabborov, M. (2025). BELGILARNI TANLASH VA
OPTIMALLASHTIRISH USULLARI. Modern Science and Research, 4(2), 51-53.

7.

Zhang, J., et al. (2021). Trade-off Between Speed and Accuracy in AI Models. IEEE
Access, 9, 12245–12256.

8.

Han, S. et al. (2016). Deep compression: Compressing DNNs with pruning, trained
quantization and Huffman coding. ICLR.

9.

LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning. Nature, 521(7553), 436–444.

10.

Pedregosa, F. et al. (2011). Scikit-learn: Machine Learning in Python. JMLR, 12, 2825–
2830.

References

Asadov, Q. U., & Shukrulloyev, B. R. (2025). Amaliy matematika fanini o'qitishda iqtisodiy masalalarning qo‘llanilishi. Modern Science and Research, 4(2), 305-307.

Zakhidov, D., & Bektosh, S. (2023). Division of heptagonal social networks into two communities by the maximum Likelihood method. Horizon: Journal of Humanity and Artificial Intelligence, 2, 641-645.

Останов, К., Абсаломов, Ш. К., & Шукруллоев, Б. Р. О. (2018). О методических особенностях изучения квадратичных неравенств. Вопросы науки и образования, (11 (23)), 43-44.

Shukrulloyev, B. (2025). BELGILARNI TASNIFLASHDA ISHLATILADIGAN ASOSIY O ‘LCHOV MEZONLARI. Modern Science and Research, 4(2), 107-109.

Shukrulloyev, B., & Abdujabborov, M. (2025). BELGILARNI TANLASH VA OPTIMALLASHTIRISH USULLARI. Modern Science and Research, 4(2), 51-53.

Zhang, J., et al. (2021). Trade-off Between Speed and Accuracy in AI Models. IEEE Access, 9, 12245–12256.

Han, S. et al. (2016). Deep compression: Compressing DNNs with pruning, trained quantization and Huffman coding. ICLR.

LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning. Nature, 521(7553), 436–444.

Pedregosa, F. et al. (2011). Scikit-learn: Machine Learning in Python. JMLR, 12, 2825–2830.

PERFORMANCE SPEED AND ACCURACY METRICS OF FEATURE CLASSIFICATION MODELS

Authors

DOI:

Keywords:

Abstract

References

Categories

Information

Issue

Section

Downloads

How to Cite