I would like to automate data profiling on PostgreSQL with a free tool, a tool that inspects data content through a column profile or percentage distribution of values. like max, min, avg.
generate PostgreSQL stats / data profiling
1.5k Views Asked by rachid At
1
There are 1 best solutions below
Related Questions in SQL
- SQL schema for a fill-in-the-blank exercise
- Hibernate: JOIN inheritance question - why the need for two left joins
- What's supposed to be the problem in this query?
- Compare fields in two tables
- How to change woocomerce or full wordpress currency with value from USD to AUD
- Dynamic query creation with Array like implementation
- SQL query to get student enrolled in this month in a course - Moodle
- SQL LAG() function returning 0 for every row despite available previous rows
- Convert C# DateTime.Ticks to Bigquery DateTime Format
- Use row values from another table to select them as columns and establish relations between them (pivot table)
- SQL: Generate combination table based on source and destination column from same table
- how to use system's environnement variables in sql script
- PHP fetchAll on JOIN
- Multitable joining in Sql
- How to display name starting from 'z' by using BETWEEN cmd only?
Related Questions in POSTGRESQL
- Only the first SQL script gets executed inside Docker Postgres container
- Compare fields in two tables
- Hibernate ClobJdbcType bindings: what are the diferences?
- Postgres && statement Error in Mybatis Mapper?
- Can this query be optimized? (Choosing a random row to insert, that excludes previously inserted Rows)
- Connection terminated unexpectedly while performing multi row insert using pg-promise
- Processing multiple forms in nodejs and postgresql
- How to copy data from SQLite to postgreSQL?
- PGAdmin4 configured behind a reverse proxy but unable to connect to Postgresql server
- Updates to pgsodium encrypted values don't use specified key_id
- Connecting to Postgres running in a Docker container using psql
- Can't connect to local postgresql server from my docker container
- Django Arrayfield migration to cloud sql (Postgresql) not creating the column
- Get list of matching keywords for each post
- docker-compose can't reset postgresql database
Related Questions in DATA-PROFILING
- Why to_notebook_iframe (ydata-profiling) does not render the report on SageMaker notebook?
- How to identify all possible differences in duplicate data from two different datasets and calculate frequency?
- Extract specific values from pandas profiling to a data frame
- AttributeError when attempting to generate a report with ydata-profiling in Python
- Saving and Reloading a ydata-profiling / pandas-profiling ProfileReport object for later use
- Spark report with pandas profiling
- how can we create alerts for datadrift by giving threshold
- Data Profiling using Pyspark
- How to customize customize alerts + other metrics in pandas_profiling / y_data_profiling alerts
- Is it possible in snowflake to write a query that lists the columns that have all null values?
- Databricks : Export data profiling report
- Using Pydequu on Jupyter Notebook and having this "An error occurred while calling o70.run.'
- Detecting similar columns across multiple files based on statistical profile
- How can I connect a local delta lake with talend for data profiling purpose?
- Not able to perform operations on resulting dataframe after "join" operation in PySpark
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
https://www.postgresql.org/docs/current/static/view-pg-stats.html will give you the idea of data distribution for column. It is populated by autovacuum based on your settings. Or manual runs.
Also yo can run queries like
select max(c), min(c), avg(c) from tnameto get exact data the is of interest for you.To do that I would recommend using
psql- it is free and extremely handy for querying Postgres. Also you can easilycronpsql -c "your select here"to format any report by your needs.You can save profiles and data either to files or database. It can be interactive and scripted. It works with local and remote databases. You can easily mix SQL with bash or any other scripting language variables.
All this (and much much more) cool features you will find with psql. Documentation is here. You don't need to download it if you already have Postgres client - it is part of the package.