DETECTION OF FAKE AD CLICKS USING DATA-DRIVEN MACHINE LEARNING MODELS

Authors

  • B. Gayatri Author
  • A. Sai Charitha Author
  • A. Sri Saketh Reddy Author
  • G. Gowtham Author
  • Prof. (Dr.) Ravi Kiran Author

DOI:

https://doi.org/10.62643/ijerst.2026.v22.n1(2).pp228-232

Keywords:

Ad Click Fraud; Anomaly Detection; Class Imbalance; Feature Engineering; Isolation Forest; Risk Scoring; XGBoost.

Abstract

The proliferation of pay-per-click (PPC) digital advertising has given rise to a critical threat: fake ad click fraud, where bots or malicious actors generate illegitimate clicks causing billions of dollars in annual losses. This paper presents a hybrid machine learning system combining a supervised XGBoost classifier and an unsupervised Isolation Forest anomaly detector to identify fraudulent clicks in real time. Behavioral feature engineering extracts IP-level click counts, hourly burst patterns, and app-channel interaction frequencies from raw click-log data. The XGBoost model achieves 98.2% accuracy and an AUC-ROC of 0.974 with a scale_pos_weight of 438.56 to handle severe class imbalance. A weighted risk score (70% XGBoost + 30% Isolation Forest) drives a three-tier fraud classification: Genuine, Suspicious, and Fraud. A Streamlit dashboard provides real-time scoring, burst detection, feature importance visualization, and session export functionality

Downloads

Published

28-03-2026

How to Cite

DETECTION OF FAKE AD CLICKS USING DATA-DRIVEN MACHINE LEARNING MODELS. (2026). International Journal of Engineering Research and Science & Technology, 22(1(2), 228-232. https://doi.org/10.62643/ijerst.2026.v22.n1(2).pp228-232