0% Complete
Home
/
15th International Conference on Computer and Knowledge Engineering
Balanced Learning with Optimized Extra Trees Classifier for Reliable Lithology Identification in Imbalanced Well Log Data
Authors :
Ali Daneshpour
1
Behnam Yousefimehr
2
Mehdi Ghatee
3
1- Department of Mathematics and Computer Science, Amirkabir University of Technology, Tehran, Iran
2- Department of Mathematics and Computer Science, Amirkabir University of Technology, Tehran, Iran
3- Department of Mathematics and Computer Science, Amirkabir University of Technology, Tehran, Iran
Keywords :
Lithology Identification،Extra Trees Classifier،Machine Learning،Well Logs،Imbalanced Data
Abstract :
Accurate lithology identification is critical for subsurface characterization in hydrocarbon exploration, yet conventional methods often fail to capture complex nonlinear relationships in well log data. To address this challenge, we propose a robust machine learning framework based on an optimized Extra Trees Classifier, enhanced by a hybrid resampling strategy to mitigate severe class imbalance. Our approach combines random oversampling of minority lithologies (e.g., coal and dolomite) with strategic undersampling of dominant classes, ensuring balanced representation while preserving critical geological patterns. Hyperparameter tuning via optimization further refines model performance, achieving an accuracy of 83.91% with a penalty score of -0.4087, demonstrating superior reliability, particularly for underrepresented facies. A comparative computational analysis confirms our framework’s efficiency, outperforming complex models such as GrowNet, Blender, and deep neural networks in both speed and scalability. To promote reproducibility, we provide the complete implementation, including preprocessing scripts and trained models, at https://github.com/alidaneshpour/ICCKE-2025
Papers List
List of archived papers
Cluster Sampling: A Cluster-Driven Sampling Strategy for Deep Metric Learning
Hamideh Rafiee - Ahmad Ali Abin - Seyed Soroush Majd
Fine-tuned Generative Adversarial Network-based Model for Medical Image Super-Resolution
Alireza Aghelan - Modjtaba Rouhani
City Intersection Clustering and Analysis Based on Traffic Time Series
Mohammad Aminazadeh - Fakhroddin Noorbehbahani
Parallel Local Feature Selection For High-dimensional Data
Zhaleh Manbari - Chiman Salavati - Fardin AkhlaghianTab - Barzan Saeedpoor - Himan Delbina - Mahmud Abdulla Mohammad
Solving the influence maximization problem by using entropy and weight of edges
Farzaneh Kazemzadeh - Amir Karian - Mitra Mirzarezaee - Ali Asghar Safaei
Reversible Data Insertion in Encryption Domain Based on Reduced Quad Difference Expansion
Alireza Ghaemi - Mohammad Zare Ehteshami - Amirhossein Ghaemi
Adaptive Pronunciation Scoring: Aligning Automated Assessments with Human Expert Evaluations
Omid Aghdaei - Mohammad Sadegh Safari - Mohammad Hassan Rasoolizadeh - Abedeh Mirzaee
Optimizing Foreign Exchange Trading Performance Through Reinforcement Machine Learning Framework
Ervin Gubin Moung - Hani Yasmin Binti Murnizam - Maisarah Mohd Sufian - Valentino Liaw - Ali Farzamnia - Lorita Angeline
Semi-Supervised Supply Chain Fraud Detection with Unsupervised Pre-Filtering
Fatemeh Moradi - Mehran Tarif - Mohammadhossein Homaei
Adaptive Multi-Scale Attentional Network for Semantic Segmentation of Remote Sensing Images
Melika Zare - Sattar Hashemi
more
Samin Hamayesh - Version 43.7.0