About Gators#
What is Gators?#
Gators is a high-performance machine learning preprocessing library built on top of Polars, designed to streamline your entire ML workflow from raw data to production-ready models. Leveraging Polarsβ blazing-fast multi-core processing, Gators makes data preprocessing and feature engineering both faster and simpler.
Built by the PSP Data Team at PayPal, Gators solves a critical pain point: bridging the gap between Python-based model development and production deployment. With Gators, you can develop and deploy using only Python β no more reimplementing preprocessing logic in other languages for production!
Why Gators?#
The Problem#
Traditional ML workflows face a critical challenge: data preprocessing is developed in Python (pandas/sklearn) but often needs to be reimplemented in faster languages (C++, Java, Scala) for production deployment. This creates:
π΄ Maintenance burden - Two codebases to maintain
π΄ Bugs and inconsistencies - Different implementations can behave differently
π΄ Slower development - Every change needs to be implemented twice
The Solution#
Gators solves this by combining:
β Python-first development - Write once, deploy everywhere
β Production-grade performance - Polars enables Rust speeds in Python
β Unified workflow - Same code from experimentation to production
Use Cases#
Gators is perfect for:
Fraud Detection - Extensive feature engineering for anomaly detection
Risk Modeling - Create powerful predictive features
Customer Analytics - Transform complex customer data
Time Series - Rich datetime feature engineering
NLP Tasks - String feature extraction and encoding
Production ML - Deploy preprocessing pipelines without rewriting code
Key Features#
π Lightning Fast: Built on Polars for multi-core parallel processing
π Unified API: Consistent sklearn-style
.fit()and.transform()interfaceπ¦ Production Ready: Deploy the same Python code from notebook to production
π― Comprehensive: 60+ preprocessing transformers covering every use case
π Pipeline Support: Chain transformers seamlessly with the Pipeline class
π Easy to Learn: If you know sklearn, you already know Gators
Credits#
Developed by the PSP Data Team at PayPal.
Built with β‘ by data scientists, for data scientists