Skip to main content

Showing 1–8 of 8 results for author: Masson, C

  1. arXiv:2407.07874  [pdf, ps, other

    cs.LG cs.AI

    Toto: Time Series Optimized Transformer for Observability

    Authors: Ben Cohen, Emaad Khwaja, Kan Wang, Charles Masson, Elise Ramé, Youssef Doubli, Othmane Abou-Amal

    Abstract: This technical report describes the Time Series Optimized Transformer for Observability (Toto), a new state of the art foundation model for time series forecasting developed by Datadog. In addition to advancing the state of the art on generalized time series benchmarks in domains such as electricity and weather, this model is the first general-purpose time series forecasting foundation model to be… ▽ More

    Submitted 11 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

  2. arXiv:2402.17549  [pdf, other

    cs.DS

    FlipHash: A Constant-Time Consistent Range-Hashing Algorithm

    Authors: Charles Masson, Homin K. Lee

    Abstract: Consistent range-hashing is a technique used in distributed systems, either directly or as a subroutine for consistent hashing, commonly to realize an even and stable data distribution over a variable number of resources. We introduce FlipHash, a consistent range-hashing algorithm with constant time complexity and low memory requirements. Like Jump Consistent Hash, FlipHash is intended for applica… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 16 pages, 3 figures, 4 tables

    ACM Class: E.2; E.1

  3. arXiv:2402.08604  [pdf, other

    cs.DS cs.DB

    Sampling Space-Saving Set Sketches

    Authors: Homin K. Lee, Charles Masson

    Abstract: Large, distributed data streams are now ubiquitous. High-accuracy sketches with low memory overhead have become the de facto method for analyzing this data. For instance, if we wish to group data by some label and report the largest counts using fixed memory, we need to turn to mergeable heavy hitter sketches that can provide highly accurate approximate counts. Similarly, if we wish to keep track… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 12 pages, 5 figures

    ACM Class: E.1

  4. arXiv:2205.10158  [pdf, other

    cs.CV cs.LG

    Swapping Semantic Contents for Mixing Images

    Authors: Rémy Sun, Clément Masson, Gilles Hénaff, Nicolas Thome, Matthieu Cord

    Abstract: Deep architecture have proven capable of solving many tasks provided a sufficient amount of labeled data. In fact, the amount of available labeled data has become the principal bottleneck in low label settings such as Semi-Supervised Learning. Mixing Data Augmentations do not typically yield new labeled samples, as indiscriminately mixing contents creates between-class samples. In this work, we in… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

    Comments: Accepted at ICPR 2022, 7 pages, 4 figures, 6 tables

  5. arXiv:2205.10139  [pdf, other

    cs.LG

    Towards efficient feature sharing in MIMO architectures

    Authors: Rémy Sun, Alexandre Ramé, Clément Masson, Nicolas Thome, Matthieu Cord

    Abstract: Multi-input multi-output architectures propose to train multiple subnetworks within one base network and then average the subnetwork predictions to benefit from ensembling for free. Despite some relative success, these architectures are wasteful in their use of parameters. Indeed, we highlight in this paper that the learned subnetwork fail to share even generic features which limits their applicab… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

    Comments: 7 pages, 6 figures, 1 table

  6. DDSketch: A fast and fully-mergeable quantile sketch with relative-error guarantees

    Authors: Charles Masson, Jee E. Rim, Homin K. Lee

    Abstract: Summary statistics such as the mean and variance are easily maintained for large, distributed data streams, but order statistics (i.e., sample quantiles) can only be approximately summarized. There is extensive literature on maintaining quantile sketches where the emphasis has been on bounding the rank error of the sketch while using little memory. Unfortunately, rank error guarantees do not precl… ▽ More

    Submitted 28 August, 2019; originally announced August 2019.

    Comments: 11 pages, 11 figures, VLDB

    Journal ref: PVLDB, 12(12): 2195-2205, 2019

  7. arXiv:1610.01910  [pdf, ps, other

    cs.CL

    Toward Automatic Understanding of the Function of Affective Language in Support Groups

    Authors: Amit Navindgi, Caroline Brun, Cécile Boulard Masson, Scott Nowson

    Abstract: Understanding expressions of emotions in support forums has considerable value and NLP methods are key to automating this. Many approaches understandably use subjective categories which are more fine-grained than a straightforward polarity-based spectrum. However, the definition of such categories is non-trivial and, in fact, we argue for a need to incorporate communicative elements even beyond su… ▽ More

    Submitted 6 October, 2016; originally announced October 2016.

    Comments: 9 pages, 1 figure, conference workshop

  8. arXiv:1512.03201  [pdf, ps, other

    cs.LG

    Gated networks: an inventory

    Authors: Olivier Sigaud, Clément Masson, David Filliat, Freek Stulp

    Abstract: Gated networks are networks that contain gating connections, in which the outputs of at least two neurons are multiplied. Initially, gated networks were used to learn relationships between two input sources, such as pixels from two images. More recently, they have been applied to learning activity recognition or multi-modal representations. The aims of this paper are threefold: 1) to explain the b… ▽ More

    Submitted 10 December, 2015; originally announced December 2015.

    Comments: Unpublished manuscript, 17 pages