GlassBox¶
A transparent, white-box AutoML library built from scratch with NumPy.
GlassBox is a lightweight AutoML library designed for auditability and clarity. Every algorithm is implemented from first principles using only NumPy — no hidden black-box dependencies.
Features¶
-
Frame
A minimal
Datasetcontainer with CSV I/O, column selection, row slicing, and in-place mutations — all backed by NumPy arrays. -
Inspector
Non-destructive Exploratory Data Analysis: statistical profiling, auto-typing, outlier detection, and Pearson / Cramér's V associations.
-
Cleaner
A scikit-learn-style
fit/transformpipeline for imputation, outlier capping, scaling, and encoding. -
Models
Decision Trees, Random Forests, and K-Nearest Neighbors — for both classification and regression.
Quick Install¶
Philosophy¶
| Principle | Meaning |
|---|---|
| Clarity over cleverness | Every line should be readable. |
| Explicit over implicit | No hidden transformations. |
| NumPy is the engine | Zero heavy dependencies. |
| Auditability first | Built for white-box inspection. |