Batch Training vs. Online Learning

Written by fewshot | Published 2024/08/27
Tech Story Tags: neural-networks | machine-learning-benchmarks | continual-learning-for-llm | class-agnostic-network | disentangled-learning | open-set-classification | class-incremental-learning | forward-and-backward-transfer

TLDRIn online learning experiments, our method, when trained with streaming data and single-pass updates, shows continuous improvement over time, though batch training still yields better accuracy. The results indicate that our framework is adaptable to both online and offline learning scenarios, with potential convergence across different training regimes.via the TL;DR App

Authors:

(1) Sebastian Dziadzio, University of Tübingen ([email protected]);

(2) Çagatay Yıldız, University of Tübingen;

(3) Gido M. van de Ven, KU Leuven;

(4) Tomasz Trzcinski, IDEAS NCBR, Warsaw University of Technology, Tooploox;

(5) Tinne Tuytelaars, KU Leuven;

(6) Matthias Bethge, University of Tübingen.

Table of Links

Abstract and 1. Introduction

2. Two problems with the current approach to class-incremental continual learning

3. Methods and 3.1. Infinite dSprites

3.2. Disentangled learning

4. Related work

4.1. Continual learning and 4.2. Benchmarking continual learning

5. Experiments

5.1. Regularization methods and 5.2. Replay-based methods

5.3. Do we need equivariance?

5.4. One-shot generalization and 5.5. Open-set classification

5.6. Online vs. offline

Conclusion, Acknowledgments and References

Supplementary Material

5.6. Online vs. offline

In all previous experiments, we applied our method in batch mode: we performed multiple training passes over the data for each task. However, efficiently learning from streaming data might require observing each training sample only once to make sure computation is not becoming a bottleneck. This is why we test our method in the online learning regime and compare it to two batch learning scenarios. The results are shown in Fig. 9. Unsurprisingly, training for multiple epochs results in better and more robust accuracy on past tasks; it is however worth noting that our method still improves over time in the online learning scenario. It is possible that, given enough tasks, all three curves would converge.

This paper is available on arxiv under CC 4.0 license.


Written by fewshot | Spearheading research, publications, and advancements in few-shot learning, and redefining artificial intelligence.
Published by HackerNoon on 2024/08/27