Open
Description
Sample set of values from columns / API (initial metadata)
Output: For each table, output the corresponding metadata for each column.
Write code to extract from each column:
- Number of non-empty cells
- Number of empty cells
- Number of distinct values
- Top-5 most frequent value(s)
Considerations:
- skewed data
- profiling runtime
- optimize as much as possible