Skip to content

Latest commit

 

History

History
122 lines (86 loc) · 5.74 KB

CHANGELOG.md

File metadata and controls

122 lines (86 loc) · 5.74 KB

NVTabular v0.6.1 (11 August 2021)

Bug Fixes

  • Fix installing package via pip #1030
  • Fix inference with groupby operator #1019
  • Install tqdm with conda package #1030
  • Fix workflow output_dtypes with empty partitions #1028

NVTabular v0.6.0 (5 August 2021)

Improvements

  • Add CPU support #534
  • Speed up inference on Triton Inference Server #744
  • Add support for session based recommenders #355
  • Add PyTorch Dataloader support for Sparse Tensors #500
  • Add ListSlice operator for truncating list columns #734
  • Categorical ids sorted by frequency #799
  • Add ability to select a subset of a ColumnGroup #809
  • Add option to use Rename op to give a single column a new fixed name #825
  • Add a 'map' function to KerasSequenceLoader, which enables sample weights #667
  • Add JoinExternal option on nvt.Dataset in addition to cudf #370
  • Allow passing ColumnGroup to get_embedding_sizes #732
  • Add ability to name LambdaOp and provide a better default name in graph visualizations #860

Bug Fixes

  • Fix make_feature_column_workflow for Categorical columns #763
  • Fix Categorify output dtypes for list columns #963
  • Fix inference for Outbrain example #669
  • Fix dask metadata after calling workflow.to_ddf() #852
  • Fix out of memory errors #896, #971
  • Fix normalize output when stdev is zero #993
  • Fix using UCX with a dask cluster on Merlin containers #872

NVTabular v0.5.3 (1 June 2021)

Bug Fixes

  • Fix Shuffling in Torch DataLoader #818
  • Fix "Unsupported type_id conversion" in triton inference for string columns #813
  • Fix HugeCTR inference backend Merlin#8

NVTabular v0.5.1 (4 May 2021)

Improvements

  • Update dependencies to use cudf 0.19
  • Removed conda from docker containers, leading to much smaller container sizes
  • Added CUDA 11.2 support
  • Added FastAI v2.3 support

Bug Fixes

  • Fix NVTabular preprocessing with HugeCTR inference

NVTabular v0.5.0 (13 April 2021)

Improvements

  • Adding Horovod integration to NVTabular's dataloaders, allowing you to use multiple GPU's to train TensorFlow and PyTorch models
  • Adding a Groupby operation for use with session based recommender models
  • Added ability to read and write datasets partitioned by a column, allowing
  • Add example notebooks for using Triton Inference Server with NVTabular
  • Restructure and simplify Criteo example notebooks
  • Add support for PyTorch inference with Triton Inference Server

Bug Fixes

  • Fix bug with preprocessing categorical columns with NVTabular not working with HugeCTR and Triton Inference Server #707

NVTabular v0.4.0 (9 March 2021)

Breaking Changes

  • The API for NVTabular has been signficantly refactored, and existing code targetting the 0.3 API will need to be updated. Workflows are now represented as graphs of operations, and applied using a sklearn 'transformers' style api. Read more by checking out the examples

Improvements

  • Triton integration support for NVTabular with TensorFlow and HugeCTR models
  • Recommended cloud configuration and support for AWS and GCP
  • Reorganized examples and documentation
  • Unified Docker containers for Merlin components (NVTabular, HugeCTR and Triton)
  • Dataset analysis and generation tools

NVTabular v0.3.0 (23 November 2020)

Improvements

  • Add MultiHot categorical support for both preprocessing and dataloading
  • Add support for pretrained embeddings to the dataloaders
  • Add a Recsys2020 competition example notebook
  • Add ability to automatically map tensorflow feature columns to a NVTabular workflow
  • Multi-Node support

NVTabular v0.2.0 (10 September 2020)

Improvements

  • Add Multi-GPU support using Dask-cuDF
  • Add support for reading datasets from S3, GCS and HDFS
  • Add 11 new operators: ColumnSimilarity, Dropna, Filter, FillMedian, HashBucket, JoinGroupBy, JoinExternal, LambdaOp, NormalizeMinMax, TargetEncoding and DifferenceLag
  • Add HugeCTR integration and an example notebook showing an end to end workflow
  • Signicantly faster dataloaders featuring a unified backend between TensorFlow and PyTorch

NVTabular v0.1.1 (3 June 2020)

Improvements

  • Switch to using the release version of cudf 0.14

Bug Fixes

  • Fix PyTorch dataloader for compatability with deep learning examples
  • Fix FillMissing operator with constant fill
  • Fix missing yaml dependency on conda install
  • Fix get_emb_sz off-by-one error

NVTabular v0.1.0 - (13 May 2020)

  • Initial public release of NVTabular