HDR Gateway logo
HDR Gateway logo

Bookmarks

pyxplorer

Description

The goal of pyxplorer is to provide a simple tool that allows interactive profiling of datasets that are accessible via a SQL like interface. The only requirement to run data profiling is that you are able to provide a Python DBAPI like interface to your data source and the data source is able to understand simplistic SQL queries. Supported Features: -Column Count (Database / Table) -Table Count -Tuple Count (Database / Table) -Min / Max -Most Frequent / Least Frequent -Top-K Most Frequent / Top-K Least Frequent -Top-K Value Distribution (Database / Table ) -Uniqueness -Constancy -Distinct Value Count

Results/Insights

The goal of pyxplorer is to provide a simple tool that allows interactive profiling of datasets that are accessible via a SQL like interface. The only requirement to run data profiling is that you are able to provide a Python DBAPI like interface to your data source and the data source is able to understand simplistic SQL queries. Supported Features: -Column Count (Database / Table) -Table Count -Tuple Count (Database / Table) -Min / Max -Most Frequent / Least Frequent -Top-K Most Frequent / Top-K Least Frequent -Top-K Value Distribution (Database / Table ) -Uniqueness -Constancy -Distinct Value Count

Details

License

2-Clause BSD License

Last Updated

2024-10-08T13:17:54.000000Z