Data Science MS Thesis Defense by Sudhanshu Mukherjee
Date: Monday, April 14, 2025
Time: 2:30pm-4pm
Title: Rapid Insight Data Engine (R.I.D.E.): An Open-Source Python Framework for Automated Analysis of Tabular Data
Zoom link: https://umassd.zoom.us/j/99649322280?pwd=ayS8nmJcIXhzJEiAcgRBdAHQGbvr0N.1
Meeting ID: 996 4932 2280
Passcode: 021889
Abstract:
Data practitioners rely on easy-to-use tools to streamline data investigation and interpretation workflows.
This work introduces Rapid Insight Data Engine (R.I.D.E.), an open-source Python framework and command line interface (CLI)
designed for tabular data analysis. The framework is accessible to users from diverse backgrounds with minimal programming
knowledge, allowing them to focus on analysis, help prepare reports, and decide what to do next with the data. R.I.D.E. offers
NoCode capabilities for data enthusiasts, accelerating their development cycle through integrated backend systems that handle
universal data processing. We tested the framework across diverse datasets, evaluating its performance in data preprocessing,
feature scaling, transformation, and AutoML for regression, classification, and clustering. Performance testing with small—to
medium-sized datasets concerning processing time, memory utilization, and CPU load is also provided. The open-source nature of
the framework allows users to modify the codes to suit their workflow.
Acknowledgment:
This research is partially supported by the UMassD Mathematics Department.
Advisor:
Dr. Alfa Heryudono, Department of Mathematics
Committee members:
Dr. Bo Dong, Department of Mathematics
Dr. Zheng Chen, Department of Mathematics
For additional information, please contact Alfa Heryudono
All data science graduate students are encouraged to attend.
Via Zoom (see event description for zoom details)
Alfa Heryudono
aheryudono@umassd.edu