Publication
Subgroup Discovery with SD4Py
Dan Hudson; Martin Atzmueller
In: Proc. European Conference on Artificial Intelligence Workshops. European Conference on Artificial Intelligence (ECAI-2023), ECAI Workshops, Pages 338-348, Springer, 2024.
Abstract
We present SD4Py, a free open-source Python package for performing subgroup discovery and analysis. SD4Py makes it easy to discover subgroups from data stored in a Pandas data frame, to undertake follow-on analysis to examine the variability in the quality of the subgroups and to visualise important parameters. The core algorithms for discovering subgroups are implemented by an existing well-established and efficient Java back-end, but are exposed through a user-friendly Python interface. SD4Py offers a concise workflow for not only discovering but also comparing subgroups, in order to select those of interest, and for gaining insights into what is distinctive about individual subgroups.