Skip to main content

Showing 1–2 of 2 results for author: Jaghouar, S

  1. arXiv:2407.07852  [pdf, other

    cs.LG cs.DC

    OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training

    Authors: Sami Jaghouar, Jack Min Ong, Johannes Hagemann

    Abstract: OpenDiLoCo is an open-source implementation and replication of the Distributed Low-Communication (DiLoCo) training method for large language models. We provide a reproducible implementation of the DiLoCo experiments, offering it within a scalable, decentralized training framework using the Hivemind library. We demonstrate its effectiveness by training a model across two continents and three countr… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  2. Improving traffic sign recognition by active search

    Authors: S. Jaghouar, H. Gustafsson, B. Mehlig, E. Werner, N. Gustafsson

    Abstract: We describe an iterative active-learning algorithm to recognise rare traffic signs. A standard ResNet is trained on a training set containing only a single sample of the rare class. We demonstrate that by sorting the samples of a large, unlabeled set by the estimated probability of belonging to the rare class, we can efficiently identify samples from the rare class. This works despite the fact tha… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: 6 pages, 7 Figures

    Journal ref: DAGM GCPR 2022 Pattern Recognition pp. 594--606 (2022)