Skip to content

Releases: capitalone/DataProfiler

0.10.1

17 Jul 18:21
6cb789a
Compare
Choose a tag to compare

Profiler

  • Hot Fix: .astype("bool") #960

Documentation

  • Staging: Update docs 0.10.1 #961
  • Update docs 0.10.1 #962

Full Changelog: 0.10.0...0.10.1

What's Changed

Full Changelog: 0.10.0...0.10.1

0.10.0

30 Jun 15:04
77ddb29
Compare
Choose a tag to compare

Profiler

  • Forking workflow directions CONTRIBUTING.md #857
  • Fixing diagram rendering in CONTRIBUTING.md #862
  • Fix initial value of processor_type #863
  • fix: test bug due to bad mocks #878
  • added differences section to unstructured data example #877
  • Reservoir sampling refactor #910
  • feat: add dev to workfow for testing #897
  • Cms for categorical #892
  • Hotfix: fix post feature serialization merge #942
  • Update version to 0.10.0 #944
  • Staging/main/0.10.0 #943

Profiler: Profile Serialization

  • Staging/dev/profile serialization #940
  • fix: order bug #939
  • fix: null_rep mat should calculate even if datetime #933
  • Profiler: load_method hotfix #932
  • Top level hotfix: save / load .lower() #931
  • Notebook Example save/load Profile #930
  • refactor: use seed for sample for consistency #927
  • Profile Builder load() serialization #925
  • Reuse passed labeler #924
  • BaseProfiler save() for json #923
  • Added testing for values for test_json_decode_after_update #915
  • UnstructuredProfiler: Added NoImplementationError #907
  • fix: bug and add tests for structuredcolprofiler #904
  • Stuctured profiler encode decode #903
  • refactor: allow options to go through all #902
  • StructuredColProfiler Encode / Decode #901
  • Decode options #894
  • Quick Test update #893
  • Deserialization of datalabeler #891
  • ColumnDataLabelerCompiler: serialize / deserialize #888
  • Add Serialization and Deserialization Tests for Stats Compiler, plus refactors for order Typing #887
  • Adds deserialization for compilers and validates tests for Primitive; fixes numerical deserialization #886
  • Adds tests validating serialization with Primitive type for compiler #885
  • feat: add test and compiler serialization #884
  • ready datalabeler for deserialization and improvement on serializatio… #879
  • Encode Options #875
  • Encode/Decode TextColumnProfiler #870
  • Created encoder for the datalabelercolumn #869
  • Added test to ensure order attribute for ordered column profiler functions correctly after deserialization #868
  • Added decoding for encoding of ordered column profiles #864
  • Json decode date time column #861
  • Float column profiler encode decode #854
  • hot fixes for encode and decode of numeric stats mixin and intcol pro… #852

Profiler: Options

  • staging/dev/options #909
  • RowStatisticsOptions: Implementing option #871
  • New preset implementation and test #867
  • RowStatisticsOptions: Add option #865

Documentation

  • Staging update docs 0.10.0 #945
  • Documentation: Fix Req #922
  • Documentation: Update for Reservoir Sampling #919
  • documentation update for cms specific options to category #917
  • Add forking / branch workflow image #858

Documentation: Profile Serialization

  • Merge staging/dev-gh-pages/profile-serialization into dev-gh-pages #937
  • Docs: Profiler Serialization Clean Up #936
  • Docs: Profiler Serialization #928

Documentation: Options

  • Documentation: feature/options branch docs updates #921
  • Row statistics option documentation #883
  • updating docs for preset name #882
  • Add documentation for median_abs_deviation option #881
  • Preset test updated w new names and different toggles #880
  • reset ignore, update .gitignore, update documentation on presets #874
  • Fixed documentation for sampling_ratio option #873

Full Changelog: 0.9.0...0.10.0

What's Changed

Full Changelog: 0.9.0...0.10.0

0.9.0

01 Jun 16:05
4d157c8
Compare
Choose a tag to compare

Profiler

  • Encode int column #780
  • Decode categorical #786
  • Encode update format #789
  • Optimization for text column profile ksneab #791
  • Remove unnecessary cast() in csv_data.py (1) #796
  • Remove unnecessary cast() in csv_data.py (2) #798
  • Update main with change in memory-optimization #799
  • Remove unnecessary cast() in data.py #800
  • Remove unnecessary cast() in graph_data.py #801
  • Fix CatgoricalColumn test #804
  • Specify init calls in data readers reload() methods #805
  • Fix dask dataframe import #812
  • Fix CharsetMatches type error #813
  • Json Decoder Code Cleanup #814
  • Fix override errors #819
  • Sampling ratio option #825
  • Memory Optimization to main #832
    • Fixed testing to run on all feature branches for PRs #793
    • Part 1 fix for categorical mem opt issue #795
    • cleanup time space analysis code #797
    • quick update to feature/memory-optimization for merge to main #802
    • Update feat mem #803
    • Categorical Stop Condition Options #808
    • Space time analysis improvement #809
    • implementation of setting stop conds via options for cat column profiler #810
    • Fix for histogram merging #815
    • Fixes categorical bug when stop condition is met #816
    • hotfix for more conservatitive stop condition in categorical columns #817
    • Coverage Fix Memory Optimization Feature Branch #823
    • Added option to remove calculations for updating row statistics #827
    • Fix to doc strings #829
    • Preset Option Fix: presets docsstring added #830
  • Fix LSP violations #840
  • Fix argument types in doc comments #843

Documentation

  • Fix minor typo #788
  • Github pages memory optimization #833
    • added new options to docs #828
    • Preset Option Fix: Added presets documentation to profiler options section #831
  • Update docs for 0.9.0 #851

Other Changes

  • Memory testing and data gen scripts #781
  • Update for new Dask version in Validator test #784
  • Space analysis dataset sampling addition #787
  • fix bug in dataset generation #790
  • Update pre-commit mypy dependencies #811
  • Coverage Fix to Main Branch #822
  • Update version to 0.9.0 #848

Full Changelog: 0.8.9...0.9.0

What's Changed

New Contributors

Full Changelog: 0.8.9...0.9.0

0.8.9

12 Apr 15:18
a7f0d3e
Compare
Choose a tag to compare

Profiler

  • Create BaseColumnProfiler.to_dict to make JSONable #766
  • Chi2 docs update #767
  • Create Profile Encoder to JSONify BaseColumnProfiler #769
  • Encode categorical column #770
  • Encode order column #772
  • Add and test JSONify DateTimeColumn #774

Documentation

  • Update docs 0.8.9 #779

Other Changes

  • fix: update ml reqs #777
  • Update to version 0.8.9 #778

Full Changelog: 0.8.8...0.8.9

What's Changed

New Contributors

Full Changelog: 0.8.8...0.8.9

0.8.8

21 Feb 22:56
7613a1a
Compare
Choose a tag to compare

Profiler

  • Quick chi2 test fix #763

Documentation

  • Update docs 0.8.8 #765
  • Chi2 docs update #767

Other Changes

  • Update to version 0.8.8 #764
  • PyPi image rendering issue #761
  • [BUG] update isort version pin #760
  • [BUG] isort version change #759

Full Changelog: 0.8.7.post1...0.8.8

What's Changed

Full Changelog: 0.8.7.post1...0.8.8

0.8.7.post1

27 Jan 19:56
7f72ec0
Compare
Choose a tag to compare

Profiler

  • Bug: requirements-ml fix #754

Other Changes

  • Update to version 0.8.7.post1 #755

Full Changelog: 0.8.7...0.8.7.post1

What's Changed

Full Changelog: 0.8.7...0.8.7.post1

0.8.7

23 Jan 15:34
5270b5d
Compare
Choose a tag to compare

Profiler

  • relax requests and networkx dependencies #750

Documentation

  • Generate docs for v0.8.7 #752

Other Changes

  • Update version to 0.8.7 #751

Full Changelog: 0.8.6...0.8.7

What's Changed

Full Changelog: 0.8.6...0.8.7

0.8.6

06 Jan 17:16
a6a0b85
Compare
Choose a tag to compare

Profiler

  • Removes futures from required libs #746

Documentation

  • Generate Docs for v0.8.6 #749

Other Changes

  • Update version to 0.8.6 #748

Full Changelog: 0.8.5...0.8.6

0.8.5

20 Dec 18:54
51cdb8b
Compare
Choose a tag to compare

Profiler

  • Rework Graph Test for mocking missing imports #736
  • Windows Install error - Path Resouces - Fixes 738 #739

Documentation

  • Generate Docs for v0.8.5 #744

Other Changes

  • adding pyupgrade & autoflake #734
  • loosening pin on typing-extensions #735
  • removing six #740
  • Use tensorflow-macos and clean up some test running warning noise #741
  • Update version to 0.8.5 #742

Full Changelog: 0.8.4...0.8.5

What's Changed

  • loosening pin on typing-extensions by @leos in #735
  • Rework Graph Test for mocking missing imports by @JGSweets in #736
  • Windows Install error - Path Resouces - Fixes #738 by @rxm7706 in #739
  • adding pyupgrade & autoflake by @leos in #734
  • removing six by @leos in #740
  • Use tensorflow-macos and clean up some test running warning noise by @leos in #741
  • Update version to 0.8.5 by @JGSweets in #742

New Contributors

Full Changelog: 0.8.4...0.8.5

0.8.4

02 Dec 21:26
6d5cc38
Compare
Choose a tag to compare

Profiler

  • Replaces Merging dict func #731

Documentation

  • Great Expectations Examples Fix #726
  • WIP Generate Docs for v0.8.4 #729
  • Generate Docs for v0.8.4 #732

Other Changes

  • Fix numpy version and drop python 3.7 in checks #725
  • Updating the version to v0.8.4 #728

Full Changelog: 0.8.3...0.8.4

What's Changed

Full Changelog: 0.8.3...0.8.4