Publications

FormulAI: Designing Rule-Based Datasets for Interpretable and Challenging Machine Learning Tasks

 

Abstract
In a period marked by the transformative impact of machine learning algorithms across different disciplines, challenges in achieving model interpretability persist. Existing evaluation datasets often lack transparency, obscuring the decision-making process of machine learning models, particularly in complex deep learning architectures. This opacity raises concerns spanning sectors like healthcare, emphasizing the pivotal part of explainability in breeding trust and clinging to nonsupervisory norms. While progress has been made through interpretable model developments, the absence of formalized, interpretable datasets hampers technique validation and comparison. Rule-based datasets, distinct from general synthetic datasets, offer an avenue to pretend real-world challenges while retaining interpretability. This paper presents FormulAI, a framework for generating comprehensive rule-grounded datasets, encompassing categorical and continuous features, calibrated noise, and imbalanced class distribution. Emphasizing scalability and reproducibility, these datasets function as a robust standard, fostering exploration in interpretability and robustness.

 

URL
https://ojs.bonviewpress.com/index.php/AIA/article/view/1781

 

DOI
10.47852/bonviewAIA42021781

 

LaTeX
@article{FormulAI2024, 
    title   = {FormulAI: Designing Rule-Based Datasets for Interpretable and Challenging Machine Learning Tasks}, 
    url     = {https://ojs.bonviewpress.com/index.php/AIA/article/view/1781}, 
    DOI     = {10.47852/bonviewAIA42021781}, 
    journal = {Artificial Intelligence and Applications}, 
    author  = {Tissot, Hegler}, 
    year    = {2024}, 
    month   = {Mar.} 
}