anonymous

Sequence to Sequence Learning with Neural Networks

Deep Neural Networks (DNNs) are powerful models that have achieved excellent performance on difficult learning tasks. Although DNNs work well whenever large labeled training sets are available, they cannot be used to map sequences to sequences. In this paper, we present a general end-to-end approach to sequence learning that makes minimal assumptions on the sequence structure. Our method uses a multilayered Long Short-Term Memory (LSTM) to map the input sequence to a vector of a fixed dimensionality, and then another deep LSTM to decode the target sequence from the vector. Our main result is that on an English to French translation task from the WMT'14 dataset, the translations produced by the LSTM achieve a BLEU score of 34.8 on the entire test set, where the LSTM's BLEU score was penalized on out-of-vocabulary words. Additionally, the LSTM did not have difficulty on long sentences. For comparison, a phrase-based SMT system achieves a BLEU score of 33.3 on the same dataset. When we used the LSTM to rerank the 1000 hypotheses produced by the aforementioned SMT system, its BLEU score increases to 36.5, which is close to the previous best result on this task. The LSTM also learned sensible phrase and sentence representations that are sensitive to word order and are relatively invariant to the active and the passive voice. Finally, we found that reversing the order of the words in all source sentences (but not target sentences) improved the LSTM's performance markedly, because doing so introduced many short term dependencies between the source and the target sentence which made the optimization problem easier.

zip

$ python main.py

anonymous

Attention Is All You Need

The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely. Experiments on two machine translation tasks show these models to be superior in quality while being more parallelizable and requiring significantly less time to train. Our model achieves 28.4 BLEU on the WMT 2014 English-to-German translation task, improving over the existing best results, including ensembles by over 2 BLEU. On the WMT 2014 English-to-French translation task, our model establishes a new single-model state-of-the-art BLEU score of 41.8 after training for 3.5 days on eight GPUs, a small fraction of the training costs of the best models from the literature. We show that the Transformer generalizes well to other tasks by applying it successfully to English constituency parsing both with large and limited training data.

zip

$ python main.py

anonymous

ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras

zip

$ python main.py

anonymous

Autonomous Demand-Side Management Based on Game-Theoretic Energy Consumption Scheduling for the Future Smart Grid

zip

$ python main.py

anonymous

zip

$ python main.py

anonymous

Good quantum error-correcting codes exist

zip

$ python main.py

anonymous

Boltzmann generators: Sampling equilibrium states of many-body systems with deep learning

Efficient sampling of equilibrium states Molecular dynamics or Monte Carlo methods can be used to sample equilibrium states, but these methods become computationally expensive for complex systems, where the transition from one equilibrium state to another may only occur through rare events. Noé et al. used neural networks and deep learning to generate distributions of independent soft condensed-matter samples at equilibrium (see the Perspective by Tuckerman). Supervised training is used to construct invertible transformations between the coordinates of the complex system of interest and simple Gaussian coordinates of the same dimensionality. Thus, configurations can be sampled in this simpler coordinate system and then transformed back into the complex one using the correct statistical weighting. Science , this issue p. eaaw1147 ; see also p. 982

zip

$ python main.py

anonymous

Dex-Net 2.0: Deep Learning to Plan Robust Grasps with Synthetic Point Clouds and Analytic Grasp Metrics

To reduce data collection time for deep learning of robust robotic grasp plans, we explore training from a synthetic dataset of 6.7 million point clouds, grasps, and analytic grasp metrics generated from thousands of 3D models from Dex-Net 1.0 in randomized poses on a table. We use the resulting dataset, Dex-Net 2.0, to train a Grasp Quality Convolutional Neural Network (GQ-CNN) model that rapidly predicts the probability of success of grasps from depth images, where grasps are specified as the planar position, angle, and depth of a gripper relative to an RGB-D sensor. Experiments with over 1,000 trials on an ABB YuMi comparing grasp planning methods on singulated objects suggest that a GQ-CNN trained with only synthetic data from Dex-Net 2.0 can be used to plan grasps in 0.8sec with a success rate of 93% on eight known objects with adversarial geometry and is 3x faster than registering point clouds to a precomputed dataset of objects and indexing grasps. The Dex-Net 2.0 grasp planner also has the highest success rate on a dataset of 10 novel rigid objects and achieves 99% precision (one false positive out of 69 grasps classified as robust) on a dataset of 40 novel household objects, some of which are articulated or deformable. Code, datasets, videos, and supplementary material are available at http://berkeleyautomation.github.io/dex-net .

zip

$ python main.py

anonymous

Climate change impacts on wind energy: A review

zip

$ python main.py

anonymous

PerCom 2008 advertisement

zip

$ python main.py

anonymous

Concurrent Training of a Control Policy and a State Estimator for Dynamic and Robust Legged Locomotion

In this paper, we propose a locomotion training framework where a control policy and a state estimator are trained concurrently. The framework consists of a policy network which outputs the desired joint positions and a state estimation network which outputs estimates of the robot's states such as the base linear velocity, foot height, and contact probability. We exploit a fast simulation environment to train the networks and the trained networks are transferred to the real robot. The trained policy and state estimator are capable of traversing diverse terrains such as a hill, slippery plate, and bumpy road. We also demonstrate that the learned policy can run at up to 3.75 m/s on normal flat ground and 3.54 m/s on a slippery plate with the coefficient of friction of 0.22.

zip

$ python main.py

anonymous

ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras

zip

$ python main.py

anonymous

Autonomous Demand-Side Management Based on Game-Theoretic Energy Consumption Scheduling for the Future Smart Grid

zip

$ python main.py

anonymous

Intelligent control algorithms for robotic-assisted beating heart surgery

zip

$ python main.py

anonymous

zip

$ python main.py

anonymous

Supervised learning with quantum-enhanced feature spaces

zip

$ python main.py

anonymous

Probing many-body dynamics on a 51-atom quantum simulator

zip

$ python main.py

anonymous

Universal Quantum Simulators

Feynman's 1982 conjecture, that quantum computers can be programmed to simulate any local quantum system, is shown to be correct.

$ python main.py

anonymous

Rapid solution of problems by quantum computation

Abstract A class of problems is described which can be solved more efficiently by quantum computation than by any classical or stochastic method. The quantum computation solves the problem with certainty in exponentially less time than any classical deterministic computation.

zip

$ python main.py

anonymous

Quantum Mechanics Helps in Searching for a Needle in a Haystack

zip

$ python main.py

anonymous

zip

$ python main.py

anonymous

Novel Type of Phase Transition in a System of Self-Driven Particles

zip

$ python main.py

anonymous

Density matrix formulation for quantum renormalization groups

zip

$ python main.py

anonymous

zip

$ python main.py

anonymous

Observation of a new particle in the search for the Standard Model Higgs boson with the ATLAS detector at the LHC

zip

$ python main.py

anonymous

Comparisons and physics basis of tokamak transport models and turbulence simulations

The predictions of gyrokinetic and gyrofluid simulations of ion-temperature-gradient (ITG) instability and turbulence in tokamak plasmas as well as some tokamak plasma thermal transport models, which have been widely used for predicting the performance of the proposed International Thermonuclear Experimental Reactor (ITER) tokamak [Plasma Physics and Controlled Nuclear Fusion Research, 1996 (International Atomic Energy Agency, Vienna, 1997), Vol. 1, p. 3], are compared. These comparisons provide information on effects of differences in the physics content of the various models and on the fusion-relevant figures of merit of plasma performance predicted by the models. Many of the comparisons are undertaken for a simplified plasma model and geometry which is an idealization of the plasma conditions and geometry in a Doublet III-D [Plasma Physics and Controlled Nuclear Fusion Research, 1986 (International Atomic Energy Agency, Vienna, 1987), Vol. 1, p. 159] high confinement (H-mode) experiment. Most of the models show good agreements in their predictions and assumptions for the linear growth rates and frequencies. There are some differences associated with different equilibria. However, there are significant differences in the transport levels between the models. The causes of some of the differences are examined in some detail, with particular attention to numerical convergence in the turbulence simulations (with respect to simulation mesh size, system size and, for particle-based simulations, the particle number). The implications for predictions of fusion plasma performance are also discussed.

zip

$ python main.py

anonymous

Ballistic Focusing of Polyenergetic Protons Driven by Petawatt Laser Pulses

zip

$ python main.py

anonymous

<mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" display="inline"><mml:msub><mml:mi>Z</mml:mi><mml:mn>2</mml:mn></mml:msub></mml:math>Topological Order and the Quantum Spin Hall Effect

zip

$ python main.py

anonymous

New Method for High-Accuracy Determination of the Fine-Structure Constant Based on Quantized Hall Resistance

zip

$ python main.py

anonymous

LATTICE BOLTZMANN METHOD FOR FLUID FLOWS

▪ Abstract We present an overview of the lattice Boltzmann method (LBM), a parallel and efficient algorithm for simulating single-phase and multiphase fluid flows and for incorporating additional physical complexities. The LBM is especially useful for modeling complicated boundary conditions and multiphase interfaces. Recent extensions of this method are described, including simulations of fluid turbulence, suspension flows, and reaction diffusion systems.

zip

$ python main.py

anonymous

Efficient iterative schemes forab initiototal-energy calculations using a plane-wave basis set

zip

$ python main.py

anonymous

Density-functional thermochemistry. III. The role of exact exchange

Despite the remarkable thermochemical accuracy of Kohn–Sham density-functional theories with gradient corrections for exchange-correlation [see, for example, A. D. Becke, J. Chem. Phys. 96, 2155 (1992)], we believe that further improvements are unlikely unless exact-exchange information is considered. Arguments to support this view are presented, and a semiempirical exchange-correlation functional containing local-spin-density, gradient, and exact-exchange terms is tested on 56 atomization energies, 42 ionization potentials, 8 proton affinities, and 10 total atomic energies of first- and second-row systems. This functional performs significantly better than previous functionals with gradient corrections only, and fits experimental atomization energies with an impressively small average absolute deviation of 2.4 kcal/mol.

zip

$ python main.py

anonymous

Generalized Gradient Approximation Made Simple

zip

$ python main.py

anonymous

Inhomogeneous Electron Gas

zip

$ python main.py

anonymous

Training Deep Neural Networks is complicated by the fact that the distribution of each layer's inputs changes during training, as the parameters of the previous layers change. This slows down the training by requiring lower learning rates and careful parameter initialization, and makes it notoriously hard to train models with saturating nonlinearities. We refer to this phenomenon as internal covariate shift, and address the problem by normalizing layer inputs. Our method draws its strength from making normalization a part of the model architecture and performing the normalization for each training mini-batch. Batch Normalization allows us to use much higher learning rates and be less careful about initialization. It also acts as a regularizer, in some cases eliminating the need for Dropout. Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin. Using an ensemble of batch-normalized networks, we improve upon the best published result on ImageNet classification: reaching 4.9% top-5 validation error (and 4.8% test error), exceeding the accuracy of human raters.

zip

$ python main.py

anonymous

zip

$ python main.py

anonymous

Incorporating in-situ force sensing capabilities in a magnetic microrobot

zip

$ python main.py

anonymous

VINS-Mono: A Robust and Versatile Monocular Visual-Inertial State Estimator

A monocular visual-inertial system (VINS), consisting of a camera and a low-cost inertial measurement unit (IMU), forms the minimum sensor suite for metric six degrees-of-freedom (DOF) state estimation. However, the lack of direct distance measurement poses significant challenges in terms of IMU processing, estimator initialization, extrinsic calibration, and nonlinear optimization. In this work, we present VINS-Mono: a robust and versatile monocular visual-inertial state estimator.Our approach starts with a robust procedure for estimator initialization and failure recovery. A tightly-coupled, nonlinear optimization-based method is used to obtain high accuracy visual-inertial odometry by fusing pre-integrated IMU measurements and feature observations. A loop detection module, in combination with our tightly-coupled formulation, enables relocalization with minimum computation overhead.We additionally perform four degrees-of-freedom pose graph optimization to enforce global consistency. We validate the performance of our system on public datasets and real-world experiments and compare against other state-of-the-art algorithms. We also perform onboard closed-loop autonomous flight on the MAV platform and port the algorithm to an iOS-based demonstration. We highlight that the proposed work is a reliable, complete, and versatile system that is applicable for different applications that require high accuracy localization. We open source our implementations for both PCs and iOS mobile devices.

zip

$ python main.py

anonymous

Reduced-order models and controllers for continuous-time stochastic systems: an information theory approach

zip

$ python main.py

anonymous

Associating Uncertainty With Three-Dimensional Poses for Use in Estimation Problems

zip

$ python main.py

anonymous

Information Theoretic Model Predictive Control: Theory and Applications to Autonomous Driving

We present an information theoretic approach to stochastic optimal control problems that can be used to derive general sampling based optimization schemes. This new mathematical method is used to develop a sampling based model predictive control algorithm. We apply this information theoretic model predictive control (IT-MPC) scheme to the task of aggressive autonomous driving around a dirt test track, and compare its performance to a model predictive control version of the cross-entropy method.

zip

$ python main.py

anonymous

zip

$ python main.py

anonymous

Intelligent control algorithms for robotic-assisted beating heart surgery

zip

$ python main.py

anonymous

ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras

zip

$ python main.py

anonymous

zip

$ python main.py

anonymous

Design and Analysis of Delayed Bit-Interleaved Coded Modulation with LDPC Codes

An algorithm for the machine calculation of complex Fourier series

zip

$ python main.py

anonymous

zip

$ python main.py

anonymous

Maximal Flow Through a Network

Introduction. The problem discussed in this paper was formulated by T. Harris as follows:“Consider a rail network connecting two cities by way of a number of intermediate cities, where each link of the network has a number assigned to it representing its capacity. Assuming a steady state condition, find a maximal flow from one given city to the other.”

zip

$ python main.py

anonymous

A note on two problems in connexion with graphs

zip

$ python main.py

anonymous

Model predictive control: Theory and practice—A survey

zip

$ python main.py

anonymous

zip

$ python main.py

anonymous

Q-learning

zip

$ python main.py

anonymous

Principal component analysis: a review and recent developments

Large datasets are increasingly common and are often difficult to interpret. Principal component analysis (PCA) is a technique for reducing the dimensionality of such datasets, increasing interpretability but at the same time minimizing information loss. It does so by creating new uncorrelated variables that successively maximize variance. Finding such new variables, the principal components, reduces to solving an eigenvalue/eigenvector problem, and the new variables are defined by the dataset at hand, not a priori , hence making PCA an adaptive data analysis technique. It is adaptive in another sense too, since variants of the technique have been developed that are tailored to various different data types and structures. This article will begin by introducing the basic ideas of PCA, discussing what it can and cannot do. It will then describe some variants of PCA and their application.

zip

$ python main.py

anonymous

Least squares quantization in PCM

zip

$ python main.py

anonymous

Support-vector networks

zip

$ python main.py

anonymous

Random Forests

zip

$ python main.py

anonymous

SUNDIALS

SUNDIALS is a suite of advanced computational codes for solving large-scale problems that can be modeled as a system of nonlinear algebraic equations, or as initial-value problems in ordinary differential or differential-algebraic equations. The basic versions of these codes are called KINSOL, CVODE, and IDA, respectively. The codes are written in ANSI standard C and are suitable for either serial or parallel machine environments. Common and notable features of these codes include inexact Newton-Krylov methods for solving large-scale nonlinear systems; linear multistep methods for time-dependent problems; a highly modular structure to allow incorporation of different preconditioning and/or linear solver methods; and clear interfaces allowing for users to provide their own data structures underneath the solvers. We describe the current capabilities of the codes, along with some of the algorithms and heuristics used to achieve efficiency and robustness. We also describe how the codes stem from previous and widely used Fortran 77 solvers, and how the codes have been augmented with forward and adjoint methods for carrying out first-order sensitivity analysis with respect to model parameters or initial conditions.

zip

$ python main.py

anonymous

zip

$ python main.py

anonymous

Particle swarm optimization

zip

$ python main.py

anonymous

Optimization by Simulated Annealing

There is a deep and useful connection between statistical mechanics (the behavior of systems with many degrees of freedom in thermal equilibrium at a finite temperature) and multivariate or combinatorial optimization (finding the minimum of a given function depending on many parameters). A detailed analogy with annealing in solids provides a framework for optimization of the properties of very large and complex systems. This connection to statistical mechanics exposes new information and provides an unfamiliar perspective on traditional optimization problems and methods.

zip

$ python main.py

anonymous

On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming

zip

$ python main.py

anonymous

A Limited Memory Algorithm for Bound Constrained Optimization

zip

$ python main.py

anonymous

Semidefinite Programming

zip

$ python main.py

anonymous

Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers

Many problems of recent interest in statistics and machine learning can be posed in the framework of convex optimization. Due to the explosion in size and complexity of modern datasets, it is increasingly important to be able to solve problems with a very large number of features or training examples. As a result, both the decentralized collection or storage of these datasets as well as accompanying distributed solution methods are either necessary or at least highly desirable. In this review, we argue that the alternating direction method of multipliers is well suited to distributed convex optimization, and in particular to large-scale problems arising in statistics, machine learning, and related areas. The method was developed in the 1970s, with roots in the 1950s, and is equivalent or closely related to many other algorithms, such as dual decomposition, the method of multipliers, Douglas–Rachford splitting, Spingarn's method of partial inverses, Dykstra's alternating projections, Bregman iterative algorithms for ℓ1 problems, proximal methods, and others. After briefly surveying the theory and history of the algorithm, we discuss applications to a wide variety of statistical and machine learning problems of recent interest, including the lasso, sparse logistic regression, basis pursuit, covariance selection, support vector machines, and many others. We also discuss general distributed optimization, extensions to the nonconvex setting, and efficient implementation, including some details on distributed MPI and Hadoop MapReduce implementations.

zip

$ python main.py

anonymous

A Computationally Efficient Mixed-Integer Linear Formulation for the Thermal Unit Commitment Problem

zip

$ python main.py

anonymous

Generalized Benders decomposition

zip

$ python main.py

anonymous

MATPOWER: Steady-State Operations, Planning, and Analysis Tools for Power Systems Research and Education

zip

$ python main.py

anonymous

We introduce TurboDiffusion, a video generation acceleration framework that can speed up end-to-end diffusion generation by 100-200x while maintaining video quality.

git

$ python setup.py

Sequence to Sequence Learning with Neural Networks

Attention Is All You Need

ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras

Autonomous Demand-Side Management Based on Game-Theoretic Energy Consumption Scheduling for the Future Smart Grid

ORB-SLAM: A Versatile and Accurate Monocular SLAM System

Logical quantum processor based on reconfigurable atom arrays

Evidence for the utility of quantum computing before fault tolerance

Quantum Machine Learning in Feature Hilbert Spaces

Good quantum error-correcting codes exist

Boltzmann generators: Sampling equilibrium states of many-body systems with deep learning

Density matrix formulation for quantum renormalization groups

Observation of a new boson at a mass of 125 GeV with the CMS experiment at the LHC

Observation of a new particle in the search for the Standard Model Higgs boson with the ATLAS detector at the LHC

Ballistic Focusing of Polyenergetic Protons Driven by Petawatt Laser Pulses

Computer "Experiments" on Classical Fluids. I. Thermodynamical Properties of Lennard-Jones Molecules

Mixtral of Experts

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

Proximal Policy Optimization Algorithms

E(3)-Equivariant Graph Neural Networks for Data-Efficient and Accurate Interatomic Potentials

Emerging Properties in Self-Supervised Vision Transformers

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Attention Is All You Need

Language Models are Few-Shot Learners

Very Deep Convolutional Networks for Large-Scale Image Recognition

Overcoming Exploration in Reinforcement Learning with Demonstrations

Sim-to-Real: Learning Agile Locomotion For Quadruped Robots

Deep Object Pose Estimation for Semantic Robotic Grasping of Household Objects

Minimum snap trajectory generation and control for quadrotors

Dex-Net 2.0: Deep Learning to Plan Robust Grasps with Synthetic Point Clouds and Analytic Grasp Metrics

Climate change impacts on wind energy: A review

PerCom 2008 advertisement

Concurrent Training of a Control Policy and a State Estimator for Dynamic and Robust Legged Locomotion

ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras

Autonomous Demand-Side Management Based on Game-Theoretic Energy Consumption Scheduling for the Future Smart Grid

Intelligent control algorithms for robotic-assisted beating heart surgery

ORB-SLAM: A Versatile and Accurate Monocular SLAM System

Evidence for the utility of quantum computing before fault tolerance

Logical quantum processor based on reconfigurable atom arrays

Quantum Circuit Learning

Quantum Machine Learning in Feature Hilbert Spaces

Supervised learning with quantum-enhanced feature spaces

Probing many-body dynamics on a 51-atom quantum simulator

Universal Quantum Simulators

Long-distance quantum communication with atomic ensembles and linear optics

Quantum cryptography based on Bell’s theorem

Quantum cryptography: Public key distribution and coin tossing

Quantum dynamics of single trapped ions

Circuit quantum electrodynamics

Cavity quantum electrodynamics for superconducting electrical circuits: An architecture for quantum computation

Suppressing quantum errors by scaling a surface code logical qubit

Suppressing quantum errors by scaling a surface code logical qubit

A Decoherence-Free Quantum Memory Using Trapped Ions

Magic-state distillation with low overhead

Good quantum error-correcting codes exist

Scheme for reducing decoherence in quantum computer memory

Quantum computational advantage using photons

Quantum supremacy using a programmable superconducting processor

Hardware-efficient variational quantum eigensolver for small molecules and quantum magnets

A Quantum Approximate Optimization Algorithm

A variational eigenvalue solver on a photonic quantum processor

Surface codes: Towards practical large-scale quantum computation

Quantum Algorithm for Linear Systems of Equations

Rapid solution of problems by quantum computation

Quantum Mechanics Helps in Searching for a Needle in a Haystack

Polynomial-Time Algorithms for Prime Factorization and Discrete Logarithms on a Quantum Computer

E(3)-Equivariant Graph Neural Networks for Data-Efficient and Accurate Interatomic Potentials

Boltzmann generators: Sampling equilibrium states of many-body systems with deep learning

Hydrodynamics of soft active matter

Novel Type of Phase Transition in a System of Self-Driven Particles

Density matrix formulation for quantum renormalization groups

Noisy intermediate-scale quantum algorithms

Observation of Gravitational Waves from a Binary Black Hole Merger

<i>Planck</i> 2018 results

Observation of a new boson at a mass of 125 GeV with the CMS experiment at the LHC

Observation of a new particle in the search for the Standard Model Higgs boson with the ATLAS detector at the LHC

Comparisons and physics basis of tokamak transport models and turbulence simulations

Ballistic Focusing of Polyenergetic Protons Driven by Petawatt Laser Pulses

<mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" display="inline"><mml:msub><mml:mi>Z</mml:mi><mml:mn>2</mml:mn></mml:msub></mml:math>Topological Order and the Quantum Spin Hall Effect

New Method for High-Accuracy Determination of the Fine-Structure Constant Based on Quantized Hall Resistance

LATTICE BOLTZMANN METHOD FOR FLUID FLOWS