Skip to main content

Showing 1–10 of 10 results for author: Wild, C

  1. arXiv:2407.07848  [pdf, other

    cs.LG cs.AI

    Uncovering Layer-Dependent Activation Sparsity Patterns in ReLU Transformers

    Authors: Cody Wild, Jesper Anderson

    Abstract: Previous work has demonstrated that MLPs within ReLU Transformers exhibit high levels of sparsity, with many of their activations equal to zero for any given token. We build on that work to more deeply explore how token-level sparsity evolves over the course of training, and how it connects to broader sparsity patterns over the course of a sequence or batch, demonstrating that the different layers… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  2. arXiv:2205.07886  [pdf, other

    cs.LG cs.AI

    An Empirical Investigation of Representation Learning for Imitation

    Authors: Xin Chen, Sam Toyer, Cody Wild, Scott Emmons, Ian Fischer, Kuang-Huei Lee, Neel Alex, Steven H Wang, Ping Luo, Stuart Russell, Pieter Abbeel, Rohin Shah

    Abstract: Imitation learning often needs a large demonstration set in order to handle the full range of situations that an agent might find itself in during deployment. However, collecting expert demonstrations can be expensive. Recent work in vision, reinforcement learning, and NLP has shown that auxiliary representation learning objectives can reduce the need for large amounts of expensive, task-specific… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    Comments: Accepted to NeurIPS2021 Datasets and Benchmarks Track

  3. arXiv:2204.07123  [pdf, other

    cs.AI

    Retrospective on the 2021 BASALT Competition on Learning from Human Feedback

    Authors: Rohin Shah, Steven H. Wang, Cody Wild, Stephanie Milani, Anssi Kanervisto, Vinicius G. Goecks, Nicholas Waytowich, David Watkins-Valls, Bharat Prakash, Edmund Mills, Divyansh Garg, Alexander Fries, Alexandra Souly, Chan Jun Shern, Daniel del Castillo, Tom Lieberum

    Abstract: We held the first-ever MineRL Benchmark for Agents that Solve Almost-Lifelike Tasks (MineRL BASALT) Competition at the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021). The goal of the competition was to promote research towards agents that use learning from human feedback (LfHF) techniques to solve open-world tasks. Rather than mandating the use of LfHF techniques,… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted to the PMLR NeurIPS 2021 Demo & Competition Track volume

  4. arXiv:2107.01969  [pdf, other

    cs.LG cs.AI

    The MineRL BASALT Competition on Learning from Human Feedback

    Authors: Rohin Shah, Cody Wild, Steven H. Wang, Neel Alex, Brandon Houghton, William Guss, Sharada Mohanty, Anssi Kanervisto, Stephanie Milani, Nicholay Topin, Pieter Abbeel, Stuart Russell, Anca Dragan

    Abstract: The last decade has seen a significant increase of interest in deep learning research, with many public successes that have demonstrated its potential. As such, these systems are now being incorporated into commercial products. With this comes an additional challenge: how can we build AI systems that solve tasks where there is not a crisp, well-defined specification? While multiple solutions have… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: NeurIPS 2021 Competition Track

  5. arXiv:2103.03386  [pdf, other

    cs.NE

    Clusterability in Neural Networks

    Authors: Daniel Filan, Stephen Casper, Shlomi Hod, Cody Wild, Andrew Critch, Stuart Russell

    Abstract: The learned weights of a neural network have often been considered devoid of scrutable internal structure. In this paper, however, we look for structure in the form of clusterability: how well a network can be divided into groups of neurons with strong internal connectivity but weak external connectivity. We find that a trained neural network is typically more clusterable than randomly initialized… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

    Comments: 20 pages, 22 figures. arXiv admin note: text overlap with arXiv:2003.04881

  6. arXiv:2010.07017  [pdf

    cs.CY cs.CL stat.OT

    Computational Skills by Stealth in Secondary School Data Science

    Authors: Wesley Burr, Fanny Chevalier, Christopher Collins, Alison L Gibbs, Raymond Ng, Chris Wild

    Abstract: The unprecedented growth in the availability of data of all types and qualities and the emergence of the field of data science has provided an impetus to finally realizing the implementation of the full breadth of the Nolan and Temple Lang proposed integration of computing concepts into statistics curricula at all levels in statistics and new data science programs and courses. Moreover, data scien… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.

    Comments: 38 pages, 8 figures

  7. arXiv:2003.04881  [pdf, other

    cs.NE cs.LG

    Pruned Neural Networks are Surprisingly Modular

    Authors: Daniel Filan, Shlomi Hod, Cody Wild, Andrew Critch, Stuart Russell

    Abstract: The learned weights of a neural network are often considered devoid of scrutable internal structure. To discern structure in these weights, we introduce a measurable notion of modularity for multi-layer perceptrons (MLPs), and investigate the modular structure of MLPs trained on datasets of small images. Our notion of modularity comes from the graph clustering literature: a "module" is a set of ne… ▽ More

    Submitted 7 February, 2022; v1 submitted 10 March, 2020; originally announced March 2020.

    Comments: 25 pages, 12 figures

  8. arXiv:1905.10615  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Adversarial Policies: Attacking Deep Reinforcement Learning

    Authors: Adam Gleave, Michael Dennis, Cody Wild, Neel Kant, Sergey Levine, Stuart Russell

    Abstract: Deep reinforcement learning (RL) policies are known to be vulnerable to adversarial perturbations to their observations, similar to adversarial examples for classifiers. However, an attacker is not usually able to directly modify another agent's observations. This might lead one to wonder: is it possible to attack an RL agent simply by choosing an adversarial policy acting in a multi-agent environ… ▽ More

    Submitted 17 January, 2021; v1 submitted 25 May, 2019; originally announced May 2019.

    Comments: Presented at ICLR 2020

    ACM Class: I.2.6

  9. arXiv:1903.05700  [pdf, other

    cs.CR cs.LG stat.ML

    ALOHA: Auxiliary Loss Optimization for Hypothesis Augmentation

    Authors: Ethan M. Rudd, Felipe N. Ducau, Cody Wild, Konstantin Berlin, Richard Harang

    Abstract: Malware detection is a popular application of Machine Learning for Information Security (ML-Sec), in which an ML classifier is trained to predict whether a given file is malware or benignware. Parameters of this classifier are typically optimized such that outputs from the model over a set of input samples most closely match the samples' true malicious/benign (1/0) target labels. However, there ar… ▽ More

    Submitted 13 March, 2019; originally announced March 2019.

    Comments: Pre-print of a manuscript submitted to Usenix Security Symposium 2019

  10. arXiv:1804.05020  [pdf, other

    cs.CR cs.LG stat.ML

    A Deep Learning Approach to Fast, Format-Agnostic Detection of Malicious Web Content

    Authors: Joshua Saxe, Richard Harang, Cody Wild, Hillary Sanders

    Abstract: Malicious web content is a serious problem on the Internet today. In this paper we propose a deep learning approach to detecting malevolent web pages. While past work on web content detection has relied on syntactic parsing or on emulation of HTML and Javascript to extract features, our approach operates directly on a language-agnostic stream of tokens extracted directly from static HTML files wit… ▽ More

    Submitted 13 April, 2018; originally announced April 2018.