As model training becomes more distributed in nature, tf.data has evolved to be more distribution aware and performant. This talk presents tf.data tools for scaling TensorFlow data processing. In particular: tf.data service that allows your tf.data pipeline to run on a cluster of machines, and tf.data.snapshot that materializes the results to disk for reuses across multiple invocations.

Speaker:
Rohan Jain - Staff Software Engineer

#tensorflow #python #machinelearning

Scaling Tensorflow data processing with tf.data
5.25 GEEK