DeepMind Technologies Limited

United Kingdom

Back to Profile

1-100 of 292 for DeepMind Technologies Limited Sort by
Query
Patent
World - WIPO
Aggregations Reset Report
Date
New (last 4 weeks) 14
2024 April (MTD) 13
2024 March 7
2024 February 3
2024 January 2
See more
IPC Class
G06N 3/04 - Architecture, e.g. interconnection topology 164
G06N 3/08 - Learning methods 151
G06N 3/00 - Computing arrangements based on biological models 77
G06N 3/045 - Combinations of networks 28
G06N 3/092 - Reinforcement learning 26
See more
Found results for  patents
  1     2     3        Next Page

1.

SELECTIVE ACQUISITION FOR MULTI-MODAL TEMPORAL DATA

      
Application Number EP2023079389
Publication Number 2024/084097
Status In Force
Filing Date 2023-10-21
Publication Date 2024-04-25
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Kossen, Jannik Lukas
  • Belgrave, Danielle Charlotte Mary
  • Tomasev, Nenad
  • Cangea, Catalina-Codruta
  • Ktena, Sofia Ira
  • Vértes, Eszter
  • Patraucean, Viorica
  • Jaegle, Andrew Coulter

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a prediction characterizing an environment. In one aspect, a method includes obtaining a respective observation characterizing a state of an environment for each time step in a sequence of multiple time steps, comprising, for each time step after a first time step in the sequence of time steps: processing a network input that comprises observations obtained for one or more preceding time steps to generate a plurality of acquisition decisions; obtaining an observation for the time step, wherein the observation includes data corresponding to modalities that are selected for acquisition at the time step, does not include data corresponding to modalities that are not selected for acquisition at the time step; and processing a model input that includes the observation for each time step in the sequence of time steps to generate the prediction.

IPC Classes  ?

2.

PATHOGENICITY PREDICTION FOR PROTEIN MUTATIONS USING AMINO ACID SCORE DISTRIBUTIONS

      
Application Number EP2023078227
Publication Number 2024/079204
Status In Force
Filing Date 2023-10-11
Publication Date 2024-04-18
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Avsec, Ziga
  • Novati, Guido
  • Cheng, Jun

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a pathogenicity score characterizing a likelihood that a mutation to a protein is a pathogenic mutation, wherein the mutation modifies an amino acid sequence of the protein by replacing an original amino acid by a substitute amino acid at a mutation position in the amino acid sequence of the protein. In one aspect, a method comprises: generating a network input to a pathogenicity prediction neural network, wherein the network input comprises a multiple sequence alignment (MSA) representation that represents an MSA for the protein; processing the network input using the pathogenicity prediction neural network to generate a score distribution over a set of amino acids; and generating the pathogenicity score using the score distribution over the set of amino acids.

IPC Classes  ?

3.

OPTIMIZING ALGORITHMS FOR HARDWARE DEVICES

      
Application Number EP2023077237
Publication Number 2024/074452
Status In Force
Filing Date 2023-10-02
Publication Date 2024-04-11
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Hubert, Thomas Keisuke
  • Huang, Shih-Chieh
  • Novikov, Alexander
  • Fawzi, Alhussein
  • Romera-Paredes, Bernardino
  • Silver, David
  • Hassabis, Demis
  • Swirszcz, Grzegorz Michal
  • Schrittwieser, Julian
  • Kohli, Pushmeet
  • Barekatain, Mohammadamin
  • Balog, Matej
  • Rodriguez Ruiz, Francisco Jesus

Abstract

A method performed by one or more computers for obtaining an optimized algorithm that (i) is functionally equivalent to a target algorithm and (ii) optimizes one or more target properties when executed on a target set of one or more hardware devices. The method includes: initializing a target tensor representing the target algorithm; generating, using a neural network having a plurality of network parameters, a tensor decomposition of the target tensor that parametrizes a candidate algorithm; generating target property values for each of the target properties when executing the candidate algorithm on the target set of hardware devices; determining a benchmarking score for the tensor decomposition based on the target property values of the candidate algorithm; generating a training example from the tensor decomposition and the benchmarking score; and storing, in a training data store, the training example for use in updating the network parameters of the neural network.

IPC Classes  ?

  • G06F 16/901 - Indexing; Data structures therefor; Storage structures
  • G06N 3/092 - Reinforcement learning
  • G06N 3/0985 - Hyperparameter optimisation; Meta-learning; Learning-to-learn
  • G06N 5/01 - Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound

4.

CONTROLLING AGENTS USING REPORTER NEURAL NETWORKS

      
Application Number EP2023076516
Publication Number 2024/068610
Status In Force
Filing Date 2023-09-26
Publication Date 2024-04-04
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Dasgupta, Ishita
  • Chen, Shiqi
  • Marino, Kenneth Daniel
  • Shang, Wenling
  • Ahuja, Arun

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling agents using reporter neural networks.

IPC Classes  ?

5.

SCORE MODELLING FOR SIMULATION-BASED INFERENCE

      
Application Number EP2023076529
Publication Number 2024/068622
Status In Force
Filing Date 2023-09-26
Publication Date 2024-04-04
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Mnih, Andriy
  • Geffner, Tomas

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for using simulation-based inference to inferring a set of parameters such as measurements, from observations, e.g. real world observations. The method uses a score generation neural network to determine scores for individual observations or for groups of observations that are combined and used to iteratively adjust values of the parameters.

IPC Classes  ?

  • G06F 16/906 - Clustering; Classification
  • G06N 3/00 - Computing arrangements based on biological models

6.

GRAPH NEURAL NETWORKS THAT MODEL FACE-FACE INTERACTIONS BETWEEN MESHES

      
Application Number EP2023076797
Publication Number 2024/068788
Status In Force
Filing Date 2023-09-27
Publication Date 2024-04-04
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Allen, Kelsey Rebecca
  • Rubanova, Yulia
  • Lopez Guevara, Tatiana
  • Whitney, William Fairclough
  • Sanchez, Alvaro
  • Battaglia, Peter William
  • Pfaff, Tobias

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for simulating a state of an environment over a sequence of time steps. In one aspect, a method comprises, at each of one or more time steps: obtaining an environment mesh representing the state of the environment at the time step; generating a graph representing the state of the environment at the time step, comprising: determining that a first face of a first object mesh is within a collision distance of a second face of a second object mesh; and in response, instantiating a face-face edge in the graph that connects: (i) a first set of graph nodes in the graph that represent the first face in the first object mesh, and (ii) a second set of graph nodes in the graph that represent the second face in the second object mesh.

IPC Classes  ?

  • G06F 30/20 - Design optimisation, verification or simulation
  • B25J 9/00 - Programme-controlled manipulators

7.

LEARNING TASKS USING SKILL SEQUENCING FOR TEMPORALLY-EXTENDED EXPLORATION

      
Application Number EP2023076798
Publication Number 2024/068789
Status In Force
Filing Date 2023-09-27
Publication Date 2024-04-04
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Vezzani, Giulia
  • Tirumala Bukkapatnam, Dhruva
  • Wulfmeier, Markus
  • Riedmiller, Martin
  • Heess, Nicolas Manfred Otto

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling an agent that is interacting with an environment. Implementations of the system use previously learned skills to explore states of the environment to collect and store training data, which is then used to train an action selection system. The system includes a set of skill action selection subsystems, each configured to select actions for the agent to perform for a respective skill. The set of skill action selection subsystems is used to explore states of the environment to collect the training data, keeping their individual action selection policies unchanged. A scheduler neural network selects the skill neural networks to use. The action selection system is trained on the stored training data.

IPC Classes  ?

8.

REINFORCEMENT LEARNING USING DENSITY ESTIMATION WITH ONLINE CLUSTERING FOR EXPLORATION

      
Application Number EP2023076893
Publication Number 2024/068841
Status In Force
Filing Date 2023-09-28
Publication Date 2024-04-04
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Saade, Alaa
  • Kapturowski, Steven James
  • Calandriello, Daniele
  • Blundell, Charles
  • Valko, Michal
  • Sprechmann, Pablo
  • Piot, Bilal

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network used to select actions to be performed by an agent interacting with an environment. Implementations of the described techniques can learn to explore the environment efficiently by storing and updating state embedding cluster centers based on observations characterizing states of the environment.

IPC Classes  ?

9.

AGENT CONTROL THROUGH IN-CONTEXT REINFORCEMENT LEARNING

      
Application Number EP2023076897
Publication Number 2024/068843
Status In Force
Filing Date 2023-09-28
Publication Date 2024-04-04
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Laskin, Michael
  • Mnih, Volodymyr
  • Wang, Luyu
  • Baveja, Satinder Singh

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling agents. In particular, an agent can be controlled using an action selection neural network that performs in-context reinforcement learning when controlling an agent on a new task.

IPC Classes  ?

  • G06N 3/006 - Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
  • G06N 3/0442 - Recurrent networks, e.g. Hopfield networks characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU]
  • G06N 3/092 - Reinforcement learning

10.

DISCRETE TOKEN PROCESSING USING DIFFUSION MODELS

      
Application Number EP2023076788
Publication Number 2024/068781
Status In Force
Filing Date 2023-09-27
Publication Date 2024-04-04
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Strudel, Robin
  • Leblond, Rémi
  • Sifre, Laurent
  • Dieleman, Sander Etienne Lea
  • Savinov, Nikolay
  • Grathwohl, Will S.
  • Tallec, Corentin
  • Altché, Florent
  • Ganin, Iaroslav
  • Mensch, Arthur
  • Du, Yilun

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating an output sequence of discrete tokens using a diffusion model. In one aspect, a method includes generating, by using the diffusion model, a final latent representation of the sequence of discrete tokens that includes a determined value for each of a plurality of latent variables; applying a de-embedding matrix to the final latent representation of the output sequence of discrete tokens to generate a de-embedded final latent representation that includes, for each of the plurality of latent variables, a respective numeric score for each discrete token in a vocabulary of multiple discrete tokens; selecting, for each of the plurality of latent variables, a discrete token from among the multiple discrete tokens in the vocabulary that has a highest numeric score; and generating the output sequence of discrete tokens that includes the selected discrete tokens.

IPC Classes  ?

11.

REWARD-MODEL BASED REINFORCEMENT LEARNING FOR PERFORMING REASONING TASKS

      
Application Number EP2023076792
Publication Number 2024/068784
Status In Force
Filing Date 2023-09-27
Publication Date 2024-04-04
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Higgins, Irina
  • Uesato, Jonathan Ken
  • Kushman, Nathaniel Arthur
  • Kumar, Ramana

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for A training a language model for performing a reasoning task. The system obtains a plurality of training examples. Each training example includes a respective sample query text sequence characterizing a respective sample query and a respective reference response text sequence that includes a reference final answer to the respective sample query. The system trains a reward model on the plurality of training examples. The reward model is configured to receive an input including a query text sequence characterizing a query and one or more reasoning steps that have been generated in response to the query and process the input to compute a reward score indicating how successful the one or more reasoning steps are in yielding a correct final answer to the query. The system trains the language model using the trained reward model.

IPC Classes  ?

12.

SYSTEM AND METHOD FOR REINFORCEMENT LEARNING BASED ON PRIOR TRAJECTORIES

      
Application Number EP2023076793
Publication Number 2024/068785
Status In Force
Filing Date 2023-09-27
Publication Date 2024-04-04
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Bruce, Jacob
  • Anand, Ankit
  • Fergus, Robert David

Abstract

A reinforcement learning system is proposed in which a policy model neural network is trained to control an agent to perform a task in successive time steps, by training a control system including the policy model neural network to select a respective action for each time step which gives a high value for a reward function based on the action, and which indicates the contribution of the action to solving the task. The reward function includes a term based on a progress value output by a progress model. The progress model generates the progress value upon receiving a first observation of the state of the environment at a time step before the performance of the action, and a second observation of the state of the environment at a time step following the performance of the action. The progress value is an estimate of the average time which an ensemble of experts who produced the demonstrations would have taken to transform the environment from how it appears in the first observation to how it appears in the second observation.

IPC Classes  ?

13.

NEURAL NETWORKS WITH REGULARIZED ATTENTION LAYERS

      
Application Number EP2023076794
Publication Number 2024/068786
Status In Force
Filing Date 2023-09-27
Publication Date 2024-04-04
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • He, Bobby Boyi
  • Martens, James

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for processing a network input using a neural network that includes one or more regularized attention layers. In one aspect, a method comprises: receiving a layer input to a regularized attention layer, wherein the layer input to the regularized attention layer comprises a set of input embeddings; and applying a regularized attention operation over the set of input embeddings to generate a set of output embeddings, comprising: transforming intermediate attention scores using a set of shaping constants to generate a set of transformed attention scores, wherein: values of the shaping constants are initialized prior to training of the neural network and are not adjusted during the training of the neural network; and the values of the shaping constants are selected to regularize the set of output embeddings.

IPC Classes  ?

14.

GUIDED DIALOGUE USING LANGUAGE GENERATION NEURAL NETWORKS AND SEARCH

      
Application Number EP2023075931
Publication Number 2024/061963
Status In Force
Filing Date 2023-09-20
Publication Date 2024-03-28
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Irving, Geoffrey
  • Glaese, Amelia Marita Claudia
  • Mcaleese-Park, Nathaniel John
  • Hendricks, Lisa Anne Marie

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for enabling a user to conduct a dialogue. Implementations of the system learn when to rely on supporting evidence, obtained from an external search system via a search system interface, and are also able to generate replies for the user that align with the preferences of a previously trained response selection neural network. Implementations of the system can also use a previously trained rule violation detection neural network to generate replies that take account of previously learnt rules.

IPC Classes  ?

  • G06N 3/006 - Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
  • G06N 3/092 - Reinforcement learning
  • G06N 3/0455 - Auto-encoder networks; Encoder-decoder networks
  • G06N 3/042 - Knowledge-based neural networks; Logical representations of neural networks
  • G06N 3/084 - Backpropagation, e.g. using gradient descent
  • G06N 3/094 - Adversarial learning
  • G06N 5/01 - Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound

15.

CONTROLLING INDUSTRIAL FACILITIES USING HIERARCHICAL REINFORCEMENT LEARNING

      
Application Number EP2023075295
Publication Number 2024/056800
Status In Force
Filing Date 2023-09-14
Publication Date 2024-03-21
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Wong, William
  • Dutta, Praneet
  • Luo, Jerry Jiayu

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling a facility through hierarchical reinforcement learning. In particular, the facility is controlled using a high-level controller neural network that makes high-level decisions and a low-level controller neural network that makes low-level controller decisions.

IPC Classes  ?

  • G05B 13/02 - Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric

16.

DATA-EFFICIENT REINFORCEMENT LEARNING WITH ADAPTIVE RETURN COMPUTATION SCHEMES

      
Application Number EP2023075512
Publication Number 2024/056891
Status In Force
Filing Date 2023-09-15
Publication Date 2024-03-21
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Jiang, Ray
  • Puigdomènech Badia, Adrià
  • Campos Camúñez, Víctor
  • Kapturowski, Steven James
  • Rakicevic, Nemanja

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for data-efficient reinforcement learning with adaptive return computation schemes.

IPC Classes  ?

  • G06N 3/006 - Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
  • G06N 3/045 - Combinations of networks
  • G06N 3/092 - Reinforcement learning
  • G06N 3/096 - Transfer learning
  • G06N 3/0464 - Convolutional networks [CNN, ConvNet]
  • G06N 3/084 - Backpropagation, e.g. using gradient descent
  • G06N 3/0985 - Hyperparameter optimisation; Meta-learning; Learning-to-learn

17.

TRAINING POLICY NEURAL NETWORKS IN SIMULATION USING SCENE SYNTHESIS MACHINE LEARNING MODELS

      
Application Number EP2023075514
Publication Number 2024/056892
Status In Force
Filing Date 2023-09-15
Publication Date 2024-03-21
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Byravan, Arunkumar
  • Humplik, Jan
  • Hasenclever, Leonard
  • Brussee, Arthur Karl
  • Nori, Francesco

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a policy neural network for use in controlling a robot. In particular, the policy neural network can be trained in simulation using images generated by a scene synthesis machine learning model.

IPC Classes  ?

  • G06N 3/0895 - Weakly supervised learning, e.g. semi-supervised or self-supervised learning
  • G06N 3/006 - Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
  • G06N 3/092 - Reinforcement learning
  • G06N 3/096 - Transfer learning
  • G06N 3/045 - Combinations of networks

18.

ACTION ABSTRACTION CONTROLLER FOR FULLY ACTUATED ROBOTIC MANIPULATORS

      
Application Number EP2023067028
Publication Number 2024/051978
Status In Force
Filing Date 2023-06-22
Publication Date 2024-03-14
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Chen, Jose Enrique
  • Laurens, Antoine Marin Alix
  • Romano, Francesco
  • Scholz, Jonathan Karl
  • Fernandes Martins, Murilo
  • Nori, Francesco

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for controlling a robot manipulator that has a plurality of joints. One of the methods includes obtaining a control input that comprises one or more velocity values that specify a target velocity of a reference point in a given coordinate frame; determining a respective joint velocity for each of the plurality of joints by generating a solution to an optimization problem formulated from the control input; and controlling the robot manipulator, including causing the plurality of joints of the robot manipulator to move in accordance with the respective joint velocities to approximate the control input.

IPC Classes  ?

  • B25J 9/16 - Programme controls
  • G05B 19/427 - Teaching successive positions by tracking the position of a joystick or handle to control the positioning servo of the tool head, master-slave control

19.

CONTROLLING AGENTS USING AMBIGUITY-SENSITIVE NEURAL NETWORKS AND RISK-SENSITIVE NEURAL NETWORKS

      
Application Number EP2023074759
Publication Number 2024/052544
Status In Force
Filing Date 2023-09-08
Publication Date 2024-03-14
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Grau Moya, Jordi
  • Delétang, Grégoire
  • Kunesch, Markus
  • Ortega Caballero, Pedro Alejandro

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling agents. In particular, an agent can be controlled using an action selection system that is risk-sensitive, ambiguity-sensitive, or both.

IPC Classes  ?

  • G06N 3/044 - Recurrent networks, e.g. Hopfield networks
  • G06N 3/045 - Combinations of networks
  • G06N 3/092 - Reinforcement learning
  • G06N 3/0985 - Hyperparameter optimisation; Meta-learning; Learning-to-learn
  • G06N 3/0464 - Convolutional networks [CNN, ConvNet]
  • G06N 3/006 - Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
  • G06N 7/01 - Probabilistic graphical models, e.g. probabilistic networks

20.

SELECTION-INFERENCE NEURAL NETWORK SYSTEMS

      
Application Number EP2023073796
Publication Number 2024/047108
Status In Force
Filing Date 2023-08-30
Publication Date 2024-03-07
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Creswell, Antonia Phoebe Nina
  • Shanahan, Murray

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a response to a query input using a selection-inference neural network.

IPC Classes  ?

  • G06N 5/046 - Forward inferencing; Production systems
  • G06N 3/042 - Knowledge-based neural networks; Logical representations of neural networks
  • G06N 5/025 - Extracting rules from data
  • G06N 5/04 - Inference or reasoning models
  • G06N 5/045 - Explanation of inference; Explainable artificial intelligence [XAI]; Interpretable artificial intelligence

21.

DETERMINING FAILURE CASES IN TRAINED NEURAL NETWORKS USING GENERATIVE NEURAL NETWORKS

      
Application Number EP2023072617
Publication Number 2024/038114
Status In Force
Filing Date 2023-08-16
Publication Date 2024-02-22
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Gowal, Sven Adrian
  • Wiles, Olivia Anne
  • Carneiro De Albuquerque, Isabela Maria

Abstract

Methods, systems, and computer readable storage media for performing operations comprising: obtaining a plurality of initial network inputs that have been classified as belonging to a corresponding ground truth class; processing each of the plurality of initial network inputs using a trained target neural network to generate a respective predicted network output for each initial network input, the respective predicted network output comprising a respective score for each of a plurality of classes, the plurality of classes comprising the ground truth class; identifying, based on the respective predicted network outputs and the ground truth class, a subset of the initial network inputs as having been misclassified by the trained target neural network; and determining, based on the subset of initial network inputs, one or more failure case latent representations, wherein each failure case latent representation is a latent representation that characterizes network inputs that belong to the ground truth class but that are likely to be misclassified by the trained target neural network.

IPC Classes  ?

  • G06N 5/045 - Explanation of inference; Explainable artificial intelligence [XAI]; Interpretable artificial intelligence
  • G06N 3/045 - Combinations of networks
  • G06N 3/0475 - Generative networks
  • G06N 3/09 - Supervised learning

22.

AUTOMATED DISCOVERY OF AGENTS IN SYSTEMS

      
Application Number EP2023071987
Publication Number 2024/033387
Status In Force
Filing Date 2023-08-08
Publication Date 2024-02-15
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Jebreel, Zachary Alex Kenton
  • Kumar, Ramana
  • Richens, Jonathan George
  • Everitt, Tom Åke Helmer
  • Farquhar, Aiken Sebastian
  • Macdermott, Matthew Joseph Tilley

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for identifying agents in a system. According to one aspect, a method comprises: generating data defining a causal model of the system, comprising transmitting instructions to cause a plurality of interventions to be applied to the system, wherein each intervention modifies one or more variable elements in the system; processing the model of the system to identify one or more of the variable elements in the system as being decision elements, wherein each decision element represents an action selected by a respective agent in the system; and identifying one or more agents in the system based on the decision elements; and outputting data that identifies the agents in the system.

IPC Classes  ?

  • G06N 3/092 - Reinforcement learning
  • G06N 5/045 - Explanation of inference; Explainable artificial intelligence [XAI]; Interpretable artificial intelligence
  • G06N 3/042 - Knowledge-based neural networks; Logical representations of neural networks

23.

FINDING A STATIONARY POINT OF A LOSS FUNCTION BY AN ITERATIVE ALGORITHM USING A VARIABLE LEARNING RATE VALUE

      
Application Number EP2023072108
Publication Number 2024/033445
Status In Force
Filing Date 2023-08-09
Publication Date 2024-02-15
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Rosca, Mihaela
  • Dherin, Benoit Richard Umbert
  • Wu, Yan
  • Qin, Chongli

Abstract

A computer-implemented method for determining, for a loss function which is a function of a parameter vector comprising a plurality of parameters, values for the parameters for which the parameter vector is a stationary point of the loss function. The method comprises determining initial values for the parameters; and repeatedly updating the parameters by: (a) determining at least one drift value indicative of discretization drift for a discrete update to the parameters based on the loss function; (b) determining at least one learning rate value by evaluating a learning rate function based on, and having an inverse relationship with, the at least one drift value; (c) determining respective updates to the parameters based upon a product of the at least one learning rate value and a gradient of the loss function with respect to the respective parameter for current values of the parameters; and (d) updating the parameters based upon the determined respective updates.

IPC Classes  ?

24.

OPTIMIZING ALGORITHMS FOR TARGET PROCESSORS USING REPRESENTATION NEURAL NETWORKS

      
Application Number EP2023070308
Publication Number 2024/018065
Status In Force
Filing Date 2023-07-21
Publication Date 2024-01-25
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Michi, Andrea
  • Mankowitz, Daniel J.
  • Zhernov, Anton
  • Gelmi, Marco Oreste
  • Selvi, Marco
  • Paduraru, Cosmin
  • Leurent, Edouard
  • Mandhane, Amol Balkishan
  • Iqbal, Shariq Nadeem
  • Silver, David
  • Riedmiller, Martin
  • Kohli, Pushmeet
  • Vinyals, Oriol

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for optimizing a target algorithm using a state representation neural network.

IPC Classes  ?

25.

MODEL-FREE REINFORCEMENT LEARNING WITH REGULARIZED NASH DYNAMICS

      
Application Number EP2023067491
Publication Number 2024/003058
Status In Force
Filing Date 2023-06-27
Publication Date 2024-01-04
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Perolat, Julien
  • De Vylder, Bart
  • Tuyls, Karl Paul

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a policy neural network that is used to control an agent. In particular, the policy neural network can be trained through model-free reinforcement learning with regularized Nash dynamics.

IPC Classes  ?

26.

SIMULATING INDUSTRIAL FACILITIES FOR CONTROL

      
Application Number EP2023067148
Publication Number 2023/247767
Status In Force
Filing Date 2023-06-23
Publication Date 2023-12-28
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Dutta, Praneet
  • Chervonyi, Iurii
  • Voicu, Octavian
  • Luo, Jerry Jiayu
  • Trochim, Piotr

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for simulating industrial facilities for control. One of the methods includes. at each of a plurality of time steps during a task episode: receiving, from a computer simulator of an industrial facility, measurements representing a current state of the facility; generating, from the measurements, an observation; providing the observation as input to a control policy for controlling the facility; receiving, as output, an action for controlling one or more setpoints of the facility; generating, from the action, one or more control inputs for the one or more setpoints of the facility; and providing, as input to the simulator, (i) the control inputs and (ii) current values for one or more configuration parameters of the simulator to cause the simulator to generate, as output, new measurements representing a new state of the facility.

IPC Classes  ?

  • G05B 13/02 - Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
  • G05B 19/418 - Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control (DNC), flexible manufacturing systems (FMS), integrated manufacturing systems (IMS), computer integrated manufacturing (CIM)

27.

Simulating Physical Environments with Discontinuous Dynamics Using Graph Neural Networks

      
Application Number EP2023066187
Publication Number 2023/242378
Status In Force
Filing Date 2023-06-15
Publication Date 2023-12-21
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Allen, Kelsey Rebecca
  • Lopez Guevara, Tatiana
  • Pfaff, Tobias
  • Sanchez, Alvaro
  • Rubanova, Yulia
  • Stachenfeld, Kimberly
  • Battaglia, Peter William

Abstract

This specification describes a simulation system that performs simulations of physical environments using a graph neural network. At each of one or more time steps in a sequence of time steps in a given time interval, the system can process a representation of a current state of the physical environment at the current time step using the graph neural network to generate a prediction of a next state of the physical environment at the next time step. Generally, the environment has discontinuous dynamics at one or more time points during the time interval.

IPC Classes  ?

  • G06F 30/20 - Design optimisation, verification or simulation
  • G06F 30/27 - Design optimisation, verification or simulation using machine learning, e.g. artificial intelligence, neural networks, support vector machines [SVM] or training a model
  • G06F 119/12 - Timing analysis or timing optimisation

28.

TRAINING CAMERA POLICY NEURAL NETWORKS THROUGH SELF PREDICTION

      
Application Number EP2023066186
Publication Number 2023/242377
Status In Force
Filing Date 2023-06-15
Publication Date 2023-12-21
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Grimes, Matthew Koichi
  • Mirowski, Piotr Wojciech
  • Modayil, Joseph Varughese

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a camera policy neural network.

IPC Classes  ?

  • G06N 3/006 - Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
  • G06N 3/045 - Combinations of networks
  • G06N 3/0895 - Weakly supervised learning, e.g. semi-supervised or self-supervised learning
  • G06N 3/092 - Reinforcement learning

29.

HIERARCHICAL REINFORCEMENT LEARNING AT SCALE

      
Application Number EP2023065305
Publication Number 2023/237635
Status In Force
Filing Date 2023-06-07
Publication Date 2023-12-14
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Soyer, Hubert Josef
  • Behbahani, Feryal
  • Keck, Thomas Albert
  • Nikiforou, Kyriacos
  • Pires, Bernardo Avila
  • Baveja, Satinder Singh

Abstract

The invention describes a system and a method for controlling an agent interacting with an environment to perform a task, the method comprising, at each of a plurality of first time steps from a plurality of time steps: receiving an observation characterizing a state of the environment at the first time step; determining a goal representation for the first time step that characterizes a goal state of the environment to be reached by the agent; processing the observation and the goal representation using a low-level controller neural network to generate a low-level policy output that defines an action to be performed by the agent in response to the observation, wherein the low-level controller neural network comprises: a representation neural network configured to process the observation to generate an internal state representation of the observation, and a low-level policy head configured to process the state observation representation and the goal representation to generate the low-level policy output; and controlling the agent using the low-level policy output.

IPC Classes  ?

  • G06N 3/006 - Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
  • G06N 3/045 - Combinations of networks
  • G06N 3/092 - Reinforcement learning
  • G06N 7/01 - Probabilistic graphical models, e.g. probabilistic networks
  • G06N 3/044 - Recurrent networks, e.g. Hopfield networks

30.

REINFORCEMENT LEARNING TO EXPLORE ENVIRONMENTS USING META POLICIES

      
Application Number EP2023065306
Publication Number 2023/237636
Status In Force
Filing Date 2023-06-07
Publication Date 2023-12-14
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Zintgraf, Luisa Maria
  • Magalhaes Marinho, Zita Alexandra
  • Kemaev, Iurii
  • Kirsch, Louis Michel
  • Oh, Junhyuk
  • Schaul, Tom

Abstract

The invention describes the method performed by one or more computers and for training a base policy neural network that is configured to receive a base policy input comprising an observation of a state of an environment and to process the policy input to generate a base policy output that defines an action to be performed by an agent in response to the observation, the method comprising: generating training data for training the base policy neural network by controlling an agent using (i) the base policy neural network and (ii) an exploration strategy that maps, in accordance with a set of one or more parameters, base policy outputs generated by the base policy neural network to actions performed by the agent to interact with an environment, the generating comprising, at each of a plurality of time points: determining that criteria for updating the exploration strategy are satisfied at the time point; and in response to determining that the criteria are satisfied: generating a meta policy input that comprises data characterizing a performance of the base policy neural network in controlling the agent at the time point; processing the meta policy input using a meta policy to generate a meta policy output that specifies respective values for each of the set of one or more parameters that define the exploration strategy; and controlling the agent using the base policy neural network and in accordance with the exploration strategy defined by the respective values for the set of one or more parameters specified by the meta policy output.

IPC Classes  ?

  • G06N 3/008 - Artificial life, i.e. computing arrangements simulating life based on physical entities controlled by simulated intelligence so as to replicate intelligent life forms, e.g. based on robots replicating pets or humans in their appearance or behaviour
  • G06N 3/045 - Combinations of networks
  • G06N 3/092 - Reinforcement learning
  • G06N 3/0985 - Hyperparameter optimisation; Meta-learning; Learning-to-learn

31.

SIMULATING PHYSICAL ENVIRONMENTS USING FINE-RESOLUTION AND COARSE-RESOLUTION MESHES

      
Application Number EP2023063755
Publication Number 2023/227586
Status In Force
Filing Date 2023-05-23
Publication Date 2023-11-30
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Fortunato, Meire
  • Pfaff, Tobias
  • Wirnsberger, Peter
  • Pritzel, Alexander
  • Battaglia, Peter William

Abstract

47 ABSTRACT Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for simulating a state of a physical environment. In one aspect, a method performed by one or more computers for simulating the state of the physical environment is provided. The method includes, for each of multiple time steps: obtaining data defining a fine-resolution mesh and a coarse-resolution mesh that each characterize the state of the physical environment at the current time step, where the fine-resolution mesh has a higher resolution than the coarse-resolution mesh; processing data defining the fine- resolution mesh and the coarse-resolution mesh using a graph neural network that includes: (i) one or more fine-resolution update blocks, (ii) one or more coarse-resolution update blocks, and (iii) one or more up-sampling update blocks; and determining the state of the physical environment at a next time step using updated node embeddings for nodes in the fine-resolution mesh. DeepMind Technologies Limited F&R Ref.: 45288-0255WO1 PCT Application

IPC Classes  ?

  • G06F 30/23 - Design optimisation, verification or simulation using finite element methods [FEM] or finite difference methods [FDM]
  • G06F 30/27 - Design optimisation, verification or simulation using machine learning, e.g. artificial intelligence, neural networks, support vector machines [SVM] or training a model
  • G06N 3/02 - Neural networks
  • G06T 17/20 - Wire-frame description, e.g. polygonalisation or tessellation
  • G06F 111/10 - Numerical modelling
  • G06F 113/08 - Fluids

32.

EXPLORATION BY BOOTSTEPPED PREDICTION

      
Application Number EP2023063282
Publication Number 2023/222772
Status In Force
Filing Date 2023-05-17
Publication Date 2023-11-23
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Guo, Zhaohan
  • Altché, Florent
  • Tallec, Corentin
  • Pires, Bernardo Avila
  • Pîslar, Miruna
  • Thakoor, Shantanu Yogeshraj
  • Azar, Mohammad Gheshlaghi
  • Piot, Bilal

Abstract

An iterative method is proposed to train an action selection system of a reinforcement learning system, based on a reward function which defines a reward value for each action. The reward value includes an intrinsic reward term generated based on the outputs of two encoder models: an online encoder model and a target encoder model. The online encoder model is iteratively trained based on a loss function, and the target encoder model is updated to bring it closer to the online encoder model.

IPC Classes  ?

33.

MACHINE LEARNING SYSTEMS WITH COUNTERFACTUAL INTERVENTIONS

      
Application Number EP2023063488
Publication Number 2023/222884
Status In Force
Filing Date 2023-05-19
Publication Date 2023-11-23
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Rabinowitz, Neil Charles
  • Roy, Nicholas Andrew
  • Kim, Junkyung

Abstract

Systems, methods, and computer programs, for training and using a machine learning system to control an agent to perform a task. The machine learning system is trained using counterfactual internal states so that it can provide an output that explains the behavior of the system in causal terms, e.g. in terms of aspects of its environment that cause the system to select particular actions for the agent.

IPC Classes  ?

  • G06N 3/008 - Artificial life, i.e. computing arrangements simulating life based on physical entities controlled by simulated intelligence so as to replicate intelligent life forms, e.g. based on robots replicating pets or humans in their appearance or behaviour
  • G06N 3/044 - Recurrent networks, e.g. Hopfield networks
  • G06N 3/045 - Combinations of networks
  • G06N 3/084 - Backpropagation, e.g. using gradient descent

34.

LARGE-SCALE RETRIEVAL AUGMENTED REINFORCEMENT LEARNING

      
Application Number EP2023063492
Publication Number 2023/222885
Status In Force
Filing Date 2023-05-19
Publication Date 2023-11-23
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Humphreys, Peter Conway
  • Guez, Arthur Clement

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling a reinforcement learning agent in an environment to perform a task. In one aspect, a method comprises: maintaining a retrieval dataset that stores a plurality of history observations and, for each history observation, a respective associated context; receiving a current observation characterizing a current state of the environment; selecting one or more history observations from the plurality of history observations; processing, using an encoder neural network and in accordance with current values of encoder network parameters, an encoder network input comprising (i) the current observation and (ii) the one or more selected history observations and their respective associated context to generate a latent state representation for the current state of the environment; and using the latent state representation to determine an action to be performed by the agent in response to the current observation.

IPC Classes  ?

  • G06F 16/908 - Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
  • G06N 3/02 - Neural networks
  • G06N 20/00 - Machine learning

35.

CONTRASTIVE LEARNING USING POSITIVE PSEUDO LABELS

      
Application Number EP2023063496
Publication Number 2023/222889
Status In Force
Filing Date 2023-05-19
Publication Date 2023-11-23
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Mitrovic, Jovana
  • Bosnjak, Matko
  • Richemond, Pierre
  • Tomasev, Nenad
  • Strub, Florian
  • Walker, Jacob Charles
  • Hill, Felix George
  • Buesing, Lars
  • Pascanu, Razvan
  • Blundell, Charles

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network to perform a machine learning task on one or more received inputs by using a hybrid training dataset with a semi-supervised learning technique. The hybrid training dataset includes multiple unlabeled training inputs and multiple labeled training inputs and, in some cases, more unlabeled training inputs than labeled training inputs.

IPC Classes  ?

  • G06N 3/045 - Combinations of networks
  • G06N 3/0464 - Convolutional networks [CNN, ConvNet]
  • G06N 3/084 - Backpropagation, e.g. using gradient descent
  • G06N 3/0895 - Weakly supervised learning, e.g. semi-supervised or self-supervised learning

36.

RESOURCE NAVIGATION USING NEURAL NETWORKS

      
Application Number EP2023063486
Publication Number 2023/222882
Status In Force
Filing Date 2023-05-19
Publication Date 2023-11-23
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Marino, Kenneth Daniel
  • Zaheer, Manzil
  • Fergus, Robert David
  • Grathwohl, Will S.

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for resource navigation using neural networks.

IPC Classes  ?

37.

DETERMINING GENERALIZED EIGENVECTORS USING MULTI-AGENT INTERACTIONS

      
Application Number EP2023063487
Publication Number 2023/222883
Status In Force
Filing Date 2023-05-19
Publication Date 2023-11-23
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Gemp, Ian Michael
  • Mcwilliams, Brian
  • Chen, Charlie Xiangyu

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining generalized eigenvectors that characterize a data set.

IPC Classes  ?

38.

INTRA-AGENT SPEECH TO FACILITATE TASK LEARNING

      
Application Number EP2023063494
Publication Number 2023/222887
Status In Force
Filing Date 2023-05-19
Publication Date 2023-11-23
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Yan, Chen
  • Carnevale, Federico Javier
  • Georgiev, Petko Ivanov
  • Santoro, Adam Anthony
  • Guy, Aurelia Adrianna
  • Muldal, Alistair Michael
  • Hung, Chia-Chun
  • Abramson, Joshua Simon
  • Lillicrap, Timothy Paul
  • Wayne, Gregory Duncan

Abstract

Systems, methods, and computer programs for learning to control an embodied agent to perform tasks. The techniques use internal, "intra-agent" speech when learning, and are thus able to perform tasks involving new objects without any direct experience of interacting with those objects, i.e. zero-shot. Implementations of the techniques use an image captioning neural network system to generate natural language captions used when training an action selection neural network system.

IPC Classes  ?

  • G06N 3/008 - Artificial life, i.e. computing arrangements simulating life based on physical entities controlled by simulated intelligence so as to replicate intelligent life forms, e.g. based on robots replicating pets or humans in their appearance or behaviour
  • G06N 3/0455 - Auto-encoder networks; Encoder-decoder networks
  • G06N 3/044 - Recurrent networks, e.g. Hopfield networks
  • G06N 5/045 - Explanation of inference; Explainable artificial intelligence [XAI]; Interpretable artificial intelligence
  • G06N 3/096 - Transfer learning

39.

VARIABLE RESOLUTION VARIABLE FRAME RATE VIDEO CODING USING NEURAL NETWORKS

      
Application Number EP2023062431
Publication Number 2023/217867
Status In Force
Filing Date 2023-05-10
Publication Date 2023-11-16
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Assael, Ioannis Alexandros
  • Shillingford, Brendan

Abstract

Systems and methods for encoding video, and for decoding video at an arbitrary temporal and/or spatial resolution. The techniques use a scene representation neural network that, in implementations, is configured to represent frames of a 2D or 3D video as a 3D model encoded in the parameters of the neural network.

IPC Classes  ?

  • G06N 3/04 - Architecture, e.g. interconnection topology
  • H04N 19/31 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability in the temporal domain
  • H04N 19/33 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability in the spatial domain

40.

NEGOTIATING CONTRACTS FOR AGENT COOPERATION IN MULTI-AGENT SYSTEMS

      
Application Number EP2023062432
Publication Number 2023/217868
Status In Force
Filing Date 2023-05-10
Publication Date 2023-11-16
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Bachrach, Yoram
  • Tacchetti, Andrea
  • Gemp, Ian Michael
  • Kramár, János
  • Malinowski, Mateusz
  • Mckee, Kevin Robert

Abstract

Methods, systems and apparatus, including computer programs encoded on computer storage media, for enabling agents to cooperate with one another in a way that improves their collective efficiency. The agents can modify their behavior by taking into account the behavior of other agents, so that a better overall result can be achieved than if each agent acted independently. This is done by enabling the agents to negotiate contracts with one another that restrict their respective actions.

IPC Classes  ?

  • G06Q 10/04 - Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
  • G06N 3/02 - Neural networks

41.

SELECTION-INFERENCE NEURAL NETWORK SYSTEMS

      
Application Number EP2023062781
Publication Number 2023/218040
Status In Force
Filing Date 2023-05-12
Publication Date 2023-11-16
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor Creswell, Antonia Phoebe Nina

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a response to a query input using a selection- inference neural network.

IPC Classes  ?

  • G06N 3/045 - Combinations of networks
  • G06F 40/20 - Natural language analysis
  • G06N 5/02 - Knowledge representation; Symbolic representation
  • G06N 3/042 - Knowledge-based neural networks; Logical representations of neural networks
  • G06F 40/35 - Discourse or dialogue representation
  • G06N 5/045 - Explanation of inference; Explainable artificial intelligence [XAI]; Interpretable artificial intelligence
  • G06N 3/096 - Transfer learning
  • G06N 5/046 - Forward inferencing; Production systems

42.

DATA COMPRESSION AND RECONSTRUCTION USING SPARSE META-LEARNED NEURAL NETWORKS

      
Application Number EP2023061711
Publication Number 2023/213903
Status In Force
Filing Date 2023-05-03
Publication Date 2023-11-09
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor Schwarz, Jonathan

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for compressing and decompressing data signals using sparse, meta-learned neural networks.

IPC Classes  ?

  • G06N 3/045 - Combinations of networks
  • G06N 3/0495 - Quantised networks; Sparse networks; Compressed networks
  • G06N 3/082 - Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections
  • G06N 3/084 - Backpropagation, e.g. using gradient descent
  • G06N 3/0985 - Hyperparameter optimisation; Meta-learning; Learning-to-learn

43.

PRIVACY-SENSITIVE NEURAL NETWORK TRAINING USING DATA AUGMENTATION

      
Application Number EP2023061322
Publication Number 2023/209192
Status In Force
Filing Date 2023-04-28
Publication Date 2023-11-02
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • De, Soham
  • De Balle Pigem, Borja
  • Hayes, Jamie
  • Smith, Samuel Laurence
  • Berrada Lancrey Javal, Leonard Alix Jean Eric

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for privacy-sensitive training of a neural network. In one aspect, a method includes training a set of neural network parameters of the neural network on a set of training data over multiple training iterations to optimize an objective function. Each training iteration includes: sampling a batch of network inputs from the set of training data; determining a clipped gradient for each network input in the batch of network inputs; and updating the neural network parameters using the clipped gradients for the network inputs in the batch of network inputs.

IPC Classes  ?

  • G06N 3/084 - Backpropagation, e.g. using gradient descent

44.

LANGUAGE MODEL FOR PROCESSING A MULTI-MODE QUERY INPUT

      
Application Number EP2023061330
Publication Number 2023/209198
Status In Force
Filing Date 2023-04-28
Publication Date 2023-11-02
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Alayrac, Jean-Baptiste
  • Donahue, Jeffrey
  • Lenc, Karel
  • Simonyan, Karen
  • Reynolds, Malcolm Kevin Campbell
  • Luc, Pauline
  • Mensch, Arthur
  • Barr, Iain
  • Miech, Antoine
  • Hasson, Yana Elizabeth
  • Millican, Katherine Elizabeth
  • Ring, Roman

Abstract

A query processing system is described which receives a query input comprising an input token string and also at least one data item having a second, different modality, and generates a corresponding output token string.

IPC Classes  ?

45.

EPISTEMIC MACHINE LEARNING MODELS

      
Application Number EP2023059619
Publication Number 2023/198807
Status In Force
Filing Date 2023-04-13
Publication Date 2023-10-19
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Osband, Ian David Moffat
  • Wen, Zheng
  • Asgharipari, Seyedmohammad
  • Dwaracherla, Vikranth Reddy
  • Lu, Xiuyuan
  • Van Roy, Benjamin

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing inputs using an epistemic machine learning model that improves the quality of outputs generated by a base machine learning model.

IPC Classes  ?

46.

ALLOCATING COMPUTING RESOURCES BETWEEN MODEL SIZE AND TRAINING DATA DURING TRAINING OF A MACHINE LEARNING MODEL

      
Application Number EP2023058150
Publication Number 2023/186987
Status In Force
Filing Date 2023-03-29
Publication Date 2023-10-05
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Hoffmann, Jordan
  • Borgeaud Dit Avocat, Sebastian
  • Mensch, Arthur
  • Sifre, Laurent

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a machine learning model to perform a machine learning task. In one aspect, a method performed by one or more computer is described. The method includes: obtaining data defining a compute budget that characterizes an amount of computing resources allocated for training a machine learning model to perform a machine learning task; processing the data defining the compute budget using an allocation mapping, in accordance with a set of allocation mapping parameters, to generate an allocation tuple defining: (i) a target model size for the machine learning model, and (ii) a target amount of training data for training the machine learning model; instantiating the machine learning model, where the machine learning model has the target model size; and obtaining the target amount of training data for training the machine learning model.

IPC Classes  ?

  • G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]
  • G06N 20/00 - Machine learning

47.

ENHANCING IMAGES FROM A MOBILE DEVICE TO GIVE A PROFESSIONAL CAMERA EFFECT

      
Application Number EP2023054669
Publication Number 2023/186417
Status In Force
Filing Date 2023-02-24
Publication Date 2023-10-05
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Assael, Ioannis Alexandros
  • Shillingford, Brendan

Abstract

Systems and methods for processing an image from a mobile device so that it appears to have been captured by a camera with particular characteristics, for example a digital SLR camera with particular settings. The system uses a trained image enhancement neural network. The image enhancement neural network can be trained without needing to rely on pairs of images of the same scene; some training methods are described.

IPC Classes  ?

48.

CONTROLLING ROBOTS USING LATENT ACTION VECTOR CONDITIONED CONTROLLER NEURAL NETWORKS

      
Application Number EP2023057855
Publication Number 2023/180585
Status In Force
Filing Date 2023-03-27
Publication Date 2023-09-28
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Bohez, Steven
  • Tunyasuvunakool, Saran

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling agents. In particular, an agent can be controlled using a hierarchical controller that includes a task policy neural network and a low-level controller neural network.

IPC Classes  ?

49.

GENERATING OUTPUT SEQUENCES WITH INLINE EVIDENCE USING LANGUAGE MODEL NEURAL NETWORKS

      
Application Number EP2023056778
Publication Number 2023/175089
Status In Force
Filing Date 2023-03-16
Publication Date 2023-09-21
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Menick, Jacob Lee
  • Mikulik, Vladmir
  • Trebacz, Maja Maria
  • Mcaleese-Park, Nathaniel John
  • Irving, Geoffrey

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating output sequences using language model neural networks. In particular, the output sequences include a response to an input query and inline evidence that includes a quote from a context document that supports the response.

IPC Classes  ?

50.

TRAINING OBJECT DISCOVERY NEURAL NETWORKS AND FEATURE REPRESENTATION NEURAL NETWORKS USING SELF-SUPERVISED LEARNING

      
Application Number EP2022056866
Publication Number 2023/169696
Status In Force
Filing Date 2022-03-16
Publication Date 2023-09-14
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Zisserman, Andrew
  • Zoran, Daniel
  • Jaegle, Andrew Coulter
  • Shelhamer, Evan Gerard
  • Carreira, Joao
  • Hénaff, Olivier Jean
  • Arandjelovic, Relja
  • Koppula, Skanda Kumar

Abstract

A neural network system that is configured to learn a representation of data item, such as an image, audio, or text data item, through a self-supervised learning process. Implementations of the system couple two learning processes, an object discovery learning process and an object feature representation learning process. In implementations the object discovery learning process assists the object feature representation learning process in self-supervised learning of object feature representations, and the object feature representation learning process is used to improve the object discovery learning process.

IPC Classes  ?

51.

GENERATING COMPRESSED REPRESENTATIONS OF VIDEO FOR EFFICIENT LEARNING OF VIDEO TASKS

      
Application Number EP2023055757
Publication Number 2023/170069
Status In Force
Filing Date 2023-03-07
Publication Date 2023-09-14
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Wiles, Olivia Anne
  • Malinowski, Mateusz
  • Zisserman, Andrew
  • Barr, Iain
  • Carreira, Joao

Abstract

A method is proposed to train an adaptive system to perform a video processing task, based on a database of compressed representations of video data items. The compressed representations were generated by a trained adaptive compressor unit.

IPC Classes  ?

  • H04N 19/146 - Data rate or code amount at the encoder output
  • G06N 3/045 - Combinations of networks
  • G06N 3/0455 - Auto-encoder networks; Encoder-decoder networks
  • G06N 3/0464 - Convolutional networks [CNN, ConvNet]
  • G06N 3/08 - Learning methods
  • G06V 10/82 - Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
  • H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
  • H04N 19/177 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a group of pictures [GOP]

52.

TRACKING QUERY POINTS IN VIDEOS USING NEURAL NETWORKS

      
Application Number EP2023055758
Publication Number 2023/170070
Status In Force
Filing Date 2023-03-07
Publication Date 2023-09-14
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Doersch, Carl
  • Gupta, Ankush
  • Markeeva, Larisa
  • Greff, Klaus
  • Tagliasacchi, Andrea
  • Recasens Continente, Adrià
  • Aytar, Yusuf
  • Carreira, Joao
  • Zisserman, Andrew
  • Yang, Yi

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for tracking query points in videos using a point tracking neural network.

IPC Classes  ?

  • G06V 10/44 - Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
  • G06N 3/045 - Combinations of networks
  • G06N 3/0464 - Convolutional networks [CNN, ConvNet]
  • G06N 3/084 - Backpropagation, e.g. using gradient descent
  • G06T 7/246 - Analysis of motion using feature-based methods, e.g. the tracking of corners or segments
  • G06V 10/50 - Extraction of image or video features by summing image-intensity values; Projection analysis
  • G06V 10/82 - Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
  • G06V 20/40 - Scenes; Scene-specific elements in video content

53.

PROCESSING NETWORK INPUTS USING PARTITIONED ATTENTION

      
Application Number EP2023055753
Publication Number 2023/170067
Status In Force
Filing Date 2023-03-07
Publication Date 2023-09-14
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Recasens Continente, Adrià
  • Lin, Jason Jiachen
  • Wang, Luyu
  • Alayrac, Jean-Baptiste
  • Jaegle, Andrew Coulter
  • Carreira, Joao
  • Luc, Pauline
  • Miech, Antoine
  • De Freitas Smaira, Lucas
  • Hemsley, Ross
  • Zisserman, Andrew

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing network inputs using a neural network that implements partitioned attention.

IPC Classes  ?

  • G06N 3/0455 - Auto-encoder networks; Encoder-decoder networks
  • G06N 3/096 - Transfer learning
  • G06N 3/0895 - Weakly supervised learning, e.g. semi-supervised or self-supervised learning

54.

AGENT CONTROL THROUGH CULTURAL TRANSMISSION

      
Application Number EP2023055474
Publication Number 2023/166195
Status In Force
Filing Date 2023-03-03
Publication Date 2023-09-07
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Bhoopchand, Avishkar Ajay
  • Collister, Adrian Ashley
  • Edwards, Ashley Deloris
  • Everett, Richard
  • Hughes, Edward Fauchon
  • Mathewson, Kory Wallace
  • Pîslar, Miruna
  • Zacherl, Alexander
  • Zhang, Lei

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling agents. In particular, an agent can be controlled using a policy neural network that has been trained to allow the agent to achieve cultural transmission after training.

IPC Classes  ?

55.

COMPUTER CODE GENERATION FROM TASK DESCRIPTIONS USING NEURAL NETWORKS

      
Application Number EP2023052595
Publication Number 2023/148286
Status In Force
Filing Date 2023-02-02
Publication Date 2023-08-10
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Li, Yujia
  • Choi, David Hugo
  • Chung, Junyoung
  • Kushman, Nathaniel Arthur
  • Schrittwieser, Julian
  • Leblond, Rémi
  • Eccles, Thomas Edward
  • Keeling, James Thomas
  • Gimeno Gil, Felix Axel
  • Dal Lago, Agustín Matías
  • Hubert, Thomas Keisuke
  • Choy, Peter
  • De Masson D'Autume, Cyprien
  • Sutherland Robson, Esme
  • Vinyals, Oriol

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating computer code using neural networks. One of the methods includes receiving description data describing a computer programming task; receiving a first set of inputs for the computer programming task; generating a plurality of candidate computer programs by sampling a plurality of output sequences from a set of one or more generative neural networks; for each candidate computer program in a subset of the candidate computer programs and for each input in the first set: executing the candidate computer program on the input to generate an output; and selecting, from the candidate computer programs, one or more computer programs as synthesized computer programs for performing the computer programming task based at least in part on the outputs generated by executing the candidate computer programs in the subset on the inputs in the first set of inputs.

IPC Classes  ?

56.

GENERATING SEQUENCES OF DATA ELEMENTS USING CROSS-ATTENTION OPERATIONS

      
Application Number EP2023052026
Publication Number 2023/144316
Status In Force
Filing Date 2023-01-27
Publication Date 2023-08-03
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Hawthorne, Curtis Glenn-Macway
  • Jaegle, Andrew Coulter
  • Cangea, Catalina-Codruta
  • Borgeaud Dit Avocat, Sebastian
  • Nash, Charlie Thomas Curtis
  • Malinowski, Mateusz
  • Dieleman, Sander Etienne Lea
  • Vinyals, Oriol
  • Botvinick, Matthew
  • Simon, Ian Stuart
  • Sheahan, Hannah Rachel
  • Zeghidour, Neil
  • Alayrac, Jean-Baptiste
  • Carreira, Joao
  • Engel, Jesse

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a sequence of data elements that includes a respective data element at each position in a sequence of positions. In one aspect, a method includes: for each position after a first position in the sequence of positions: obtaining a current sequence of data element embeddings that includes a respective data element embedding of each data element at a position that precedes the current position, obtaining a sequence of latent embeddings, and processing: (i) the current sequence of data element embeddings, and (ii) the sequence of latent embeddings, using a neural network to generate the data element at the current position. The neural network includes a sequence of neural network blocks including: (i) a cross-attention block, (ii) one or more self-attention blocks, and (iii) an output block.

IPC Classes  ?

57.

LOCAL CROSS-ATTENTION OPERATIONS IN NEURAL NETWORKS

      
Application Number EP2023052183
Publication Number 2023/144385
Status In Force
Filing Date 2023-01-30
Publication Date 2023-08-03
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Carreira, Joao
  • Jaegle, Andrew Coulter
  • Koppula, Skanda Kumar
  • Zoran, Daniel
  • Recasens Continente, Adrià
  • Ionescu, Catalin-Dumitru
  • Hénaff, Olivier Jean
  • Shelhamer, Evan Gerard
  • Arandjelovic, Relja
  • Botvinick, Matthew
  • Vinyals, Oriol
  • Simonyan, Karen
  • Zisserman, Andrew

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for using a neural network to generate a network output that characterizes an entity. In one aspect, a method includes: obtaining a representation of the entity as a set of data element embeddings, obtaining a set of latent embeddings, and processing: (i) the set of data element embeddings, and (ii) the set of latent embeddings, using the neural network to generate the network output. The neural network includes a sequence of neural network blocks including: (i) one or more local cross-attention blocks, and (ii) an output block. Each local cross-attention block partitions the set of latent embeddings and the set of data element embeddings into proper subsets, and updates each proper subset of the set of latent embeddings using attention over only the corresponding proper subset of the set of data element embeddings.

IPC Classes  ?

  • G06N 3/0455 - Auto-encoder networks; Encoder-decoder networks
  • G06N 3/0495 - Quantised networks; Sparse networks; Compressed networks
  • G06N 3/084 - Backpropagation, e.g. using gradient descent

58.

CONTROLLING REINFORCEMENT LEARNING AGENTS USING GEOMETRIC POLICY COMPOSITION

      
Application Number EP2023052205
Publication Number 2023/144395
Status In Force
Filing Date 2023-01-30
Publication Date 2023-08-03
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Rowland, Mark Daniel
  • Thakoor, Shantanu Yogeshraj
  • Da Motta Salles Barreto, Andre
  • Borsa, Diana Luiza
  • Dabney, William Clinton
  • Munos, Remi

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling a reinforcement learning agent in an environment. One of the methods may include maintaining data specifying a base policy set comprising a plurality of base policies for controlling the agent; receiving a current observation characterizing a current state of the environment; generating, for each of the plurality of base policies, one or more predicted future observations characterizing respective future states of the environment that are subsequent to the current state of the environment; using the predicted future observations generated for the plurality of base policies to determine a respective estimated value for each composite policy in a composite policy set with respect to the current state of the environment; and selecting an action using the respective estimated values for the composite policies.

IPC Classes  ?

59.

LEARNED COMPUTER CONTROL USING POINTING DEVICE AND KEYBOARD ACTIONS

      
Application Number EP2023052210
Publication Number 2023/144397
Status In Force
Filing Date 2023-01-30
Publication Date 2023-08-03
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Humphreys, Peter Conway
  • Lillicrap, Timothy Paul
  • Pohlen, Tobias Markus
  • Santoro, Adam Anthony

Abstract

A computer-implemented method for controlling a particular computer to execute a task is described. The method includes receiving a control input comprising a visual input, the visual input including one or more screen frames of a computer display that represent at least a current state of the particular computer; processing the control input using a neural network to generate one or more control outputs that are used to control the particular computer to execute the task, in which the one or more control outputs include an action type output that specifies at least one of a pointing device action or a keyboard action to be performed to control the particular computer; determining one or more actions from the one or more control outputs; and executing the one or more actions to control the particular computer.

IPC Classes  ?

  • G06F 3/01 - Input arrangements or combined input and output arrangements for interaction between user and computer
  • G06F 3/038 - Control and interface arrangements therefor, e.g. drivers or device-embedded control circuitry
  • G06N 3/0442 - Recurrent networks, e.g. Hopfield networks characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU]
  • G06N 3/0455 - Auto-encoder networks; Encoder-decoder networks
  • G06N 3/0464 - Convolutional networks [CNN, ConvNet]
  • G06F 40/00 - Handling natural language data
  • G06F 3/048 - Interaction techniques based on graphical user interfaces [GUI]
  • G06F 3/16 - Sound input; Sound output
  • G06N 3/0475 - Generative networks
  • G06N 3/092 - Reinforcement learning

60.

AUGMENTING MACHINE LEARNING LANGUAGE MODELS USING SEARCH ENGINE RESULTS

      
Application Number EP2023052286
Publication Number 2023/144413
Status In Force
Filing Date 2023-01-31
Publication Date 2023-08-03
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Lazaridou, Angeliki
  • Gribovskaya, Elena
  • Grigorev, Nikolai
  • Stokowiec, Wojciech Jan

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for augmenting machine learning language models using search engine results. One of the methods includes obtaining question data representing a question; generating, from the question data, a search engine query for a search engine; obtaining a plurality of documents identified by the search engine in response to processing the search engine query; generating, from the plurality of documents, a plurality of conditioning inputs each representing at least a portion of one or more of the obtained documents; for each of a plurality of the generated conditioning inputs, processing a network input generated from (i) the question data and (ii) the conditioning input using a neural network to generate a network output representing a candidate answer to the question; and generating, from the network outputs representing respective candidate answers, answer data representing a final answer to the question.

IPC Classes  ?

61.

OPTIMIZING DESIGN PARAMETERS USING A SIMULATION NEURAL NETWORK

      
Application Number EP2023052177
Publication Number 2023/144383
Status In Force
Filing Date 2023-01-30
Publication Date 2023-08-03
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Allen, Kelsey Rebecca
  • Lopez Guevara, Tatiana
  • Stachenfeld, Kimberly
  • Hamrick, Jessica Blake Chandler
  • Sanchez, Alvaro
  • Battaglia, Peter William
  • Pfaff, Tobias

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for optimizing a set of design parameters. In one aspect, a method includes: obtaining a respective initial value for each design parameter, and iteratively optimizing current values of the design parameters over a sequence of optimization iterations. The method further includes, each optimization iteration: generating a representation of an initial state of an environment using the current values of the design parameters, processing an input including the representation of the initial state of the environment using a simulation neural network to generate an output that defines a simulation of the state of the environment over a sequence of one or more time steps, determining a reward, determining gradients of the reward with respect to the current values of the design parameters, and updating the current values of the design parameters using the gradients.

IPC Classes  ?

  • G06N 3/042 - Knowledge-based neural networks; Logical representations of neural networks
  • G06N 3/045 - Combinations of networks
  • G06N 3/084 - Backpropagation, e.g. using gradient descent
  • G06N 3/09 - Supervised learning

62.

GENERATING DATA ITEMS USING OFF-THE-SHELF GUIDED GENERATIVE DIFFUSION PROCESSES

      
Application Number EP2023052186
Publication Number 2023/144386
Status In Force
Filing Date 2023-01-30
Publication Date 2023-08-03
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Durkan, Conor Michael
  • Dieleman, Sander Etienne Lea
  • Binkowski, Mikolaj
  • Shang, Wenling

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a data item using a diffusion neural network. In particular, the data item is generated by guiding a reverse diffusion process using a time-independent guidance neural network.

IPC Classes  ?

63.

LARGE SCALE RETRIEVAL FOR SEQUENCE GENERATION

      
Application Number EP2022084592
Publication Number 2023/104793
Status In Force
Filing Date 2022-12-06
Publication Date 2023-06-15
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Borgeaud Dit Avocat, Sebastian
  • Sifre, Laurent
  • Mensch, Arthur
  • Hoffmann, Jordan

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a final output sequence. In one aspect, a method comprises: receiving a current output sequence comprising one or more current output segments; receiving a set of reference segments and a respective reference segment embedding of each reference segment that has been generated using an embedding neural network; for each current output segment: processing the current output segment using the embedding neural network to generate a current output segment embedding of the current output segment; and selecting k most similar reference segments to the current output segment using the reference segment embeddings and the current output segment embedding; and processing the current output sequence and the k most similar reference segments for each current output segment to generate an additional output segment that follows the current output sequence in the final output sequence.

IPC Classes  ?

64.

TRAINING CONDITIONAL COMPUTATION NEURAL NETWORKS USING REINFORCEMENT LEARNING

      
Application Number EP2022084779
Publication Number 2023/104879
Status In Force
Filing Date 2022-12-07
Publication Date 2023-06-15
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Clark, Aidan
  • Mensch, Arthur

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network having one or more conditional computation layers, where each conditional computation layer includes a gating sub-layer having multiple gating parameters and an expert sub-layer having multiple expert neural networks. In one aspect, a method comprises: sampling a batch of target output sequences that comprises a respective ground truth output token at each of multiple output positions; for each target output sequence, processing the target output sequence using the neural network to generate a network output that includes respective score distributions over the vocabulary of output tokens for the output positions in the target output sequence; and training each gating sub-layer using respective rewards for the gating sub-layer for the output positions through reinforcement learning to optimize a reinforcement learning objective function that measures an expected reward received by the gating sub-layer.

IPC Classes  ?

65.

CONTROLLING INTERACTIVE AGENTS USING MULTI-MODAL INPUTS

      
Application Number EP2022084780
Publication Number 2023/104880
Status In Force
Filing Date 2022-12-07
Publication Date 2023-06-15
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Abramson, Joshua Simon
  • Ahuja, Arun
  • Carnevale, Federico Javier
  • Georgiev, Petko Ivanov
  • Hung, Chia-Chun
  • Lillicrap, Timothy Paul
  • Muldal, Alistair Michael
  • Santoro, Adam Anthony
  • Von Glehn, Tamara Louise
  • Landon, Jessica Paige
  • Wayne, Gregory Duncan
  • Yan, Chen
  • Zhu, Rui

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling agents. In particular, an interactive agent can be controlled based on multi-modal inputs that include both an observation image and a natural language text sequence.

IPC Classes  ?

  • G06N 3/045 - Combinations of networks
  • G06N 3/0442 - Recurrent networks, e.g. Hopfield networks characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU]
  • G06N 3/092 - Reinforcement learning
  • G06N 3/084 - Backpropagation, e.g. using gradient descent
  • G06N 3/0895 - Weakly supervised learning, e.g. semi-supervised or self-supervised learning
  • G06N 3/008 - Artificial life, i.e. computing arrangements simulating life based on physical entities controlled by simulated intelligence so as to replicate intelligent life forms, e.g. based on robots replicating pets or humans in their appearance or behaviour
  • G05D 1/00 - Control of position, course, altitude, or attitude of land, water, air, or space vehicles, e.g. automatic pilot

66.

DESIGNING PROTEINS BY JOINTLY MODELING SEQUENCE AND STRUCTURE

      
Application Number EP2022082678
Publication Number 2023/094335
Status In Force
Filing Date 2022-11-21
Publication Date 2023-06-01
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Kohl, Simon
  • Jumper, John
  • Senior, Andrew W.
  • Zambaldi, Vinicius
  • Galiazzi Schneider, Rosalia
  • Bates, Russell James
  • Stanton, Gabriella Hayley
  • Fergus, Robert David
  • Ionescu, Catalin-Dumitru
  • La, David

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for designing a protein by jointly generating an amino acid sequence and a structure of the protein. In one aspect, a method comprises: generating data defining the amino acid sequence and the structure of the protein using a protein design neural network, comprising, for a plurality of positions in the amino acid sequence: receiving the current representation of the protein as of the current position; processing the current representation of the protein using the protein design neural network to generate design data for the current position that comprises: (i) data identifying an amino acid at the current position, and (ii) a set of structure parameters for the current position; and updating the current representation of the protein using the design data for the current position.

IPC Classes  ?

  • G16B 35/00 - ICT specially adapted for in silico combinatorial libraries of nucleic acids, proteins or peptides

67.

DISTRIBUTED TOP K COMPUTATION

      
Application Number EP2022082276
Publication Number 2023/089030
Status In Force
Filing Date 2022-11-17
Publication Date 2023-05-25
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Elsen, Erich Konrad
  • Abercrombie, Stuart Christopher Benedict

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for distributing a top k computation across multiple computing units of an integrated circuit. One of the methods includes computing, by each of the plurality of computing units and for each candidate vector in a respective subset of the candidate vectors assigned to the computing unit, a respective distance between the query vector and the candidate vector; initializing, by the integrated circuit, a cut-off distance value; determining, by the integrated circuit, a final cut-off distance value; and providing, by the integrated circuit and as an output of a top k computation for the query vector and the set of candidate vectors, the candidate vectors that have respective distances that satisfy the final cut-off distance value.

IPC Classes  ?

  • G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

68.

COORDINATION OF MULTIPLE ROBOTS USING GRAPH NEURAL NETWORKS

      
Application Number EP2022075656
Publication Number 2023/057185
Status In Force
Filing Date 2022-09-15
Publication Date 2023-04-13
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Lai, Matthew
  • Scholz, Jonathan Karl
  • Chen, Jose Enrique

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for controlling a plurality of robots. One of the methods includes: obtaining state data representing a current state of the environment; generating, from the state data, graph data representing a graph of the current state of the environment; processing the graph data using a graph neural network to generate a graph output that comprises a respective updated feature representation for each of the robot nodes in the graph; and selecting, based on the graph output, a respective action to be performed by each of the robots.

IPC Classes  ?

69.

TRAINING A NEURAL NETWORK TO PREDICT MULTI-CHAIN PROTEIN STRUCTURES

      
Application Number EP2022077595
Publication Number 2023/057455
Status In Force
Filing Date 2022-10-04
Publication Date 2023-04-13
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Evans, Richard Andrew
  • O'Neill, Michael James
  • Pritzel, Alexander
  • Antropova, Natasha Olegovna
  • Green, Timothy Frederick Goldie
  • Jumper, John

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for predicting a structure of a protein that comprises a plurality of amino acid chains using a protein structure prediction neural network, where each chain comprises a respective sequence of amino acids. In one aspect, a method comprises: receiving a network input for the protein structure prediction neural network, wherein the network input characterizes the protein; processing the network input characterizing the protein using the protein structure prediction neural network to generate a network output that characterizes a predicted structure of the protein; and determining the predicted structure of the protein based on the network output.

IPC Classes  ?

70.

HIERARCHICAL LATENT MIXTURE POLICIES FOR AGENT CONTROL

      
Application Number EP2022077694
Publication Number 2023/057511
Status In Force
Filing Date 2022-10-05
Publication Date 2023-04-13
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Rao, Dushyant
  • Sadeghi, Fereshteh
  • Hasenclever, Leonard
  • Wulfmeier, Markus
  • Zambelli, Martina
  • Vezzani, Giulia
  • Tirumala Bukkapatnam, Dhruva
  • Aytar, Yusuf
  • Merel, Joshua
  • Heess, Nicolas Manfred Otto
  • Hadsell, Raia Thais

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling agents. In particular, an agent can be controlled using a hierarchical controller that includes a high-level controller neural network, a mid-level controller neural network, and a low-level controller neural network.

IPC Classes  ?

  • G06N 3/092 - Reinforcement learning
  • G06N 3/0464 - Convolutional networks [CNN, ConvNet]
  • G06N 3/096 - Transfer learning
  • G06N 3/006 - Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
  • G06N 3/045 - Combinations of networks

71.

RETRIEVAL AUGMENTED REINFORCEMENT LEARNING

      
Application Number EP2022077696
Publication Number 2023/057512
Status In Force
Filing Date 2022-10-05
Publication Date 2023-04-13
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Goyal, Anirudh
  • Banino, Andrea
  • Friesen, Abram Luke
  • Weber, Theophane Guillaume
  • Badia, Adrià Puigdomènech
  • Ke, Nan
  • Osindero, Simon
  • Lillicrap, Timothy Paul
  • Blundell, Charles

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for controlling a reinforcement learning agent in an environment to perform a task using a retrieval-augmented action selection process. One of the methods includes receiving a current observation characterizing a current state of the environment; processing an encoder network input comprising the current observation to determine a policy neural network hidden state that corresponds to the current observation; maintaining a plurality of trajectories generated as a result of the reinforcement learning agent interacting with the environment; selecting one or more trajectories from the plurality of trajectories; updating the policy neural network hidden state using update data determined from the one or more selected trajectories; and processing the updated hidden state using a policy neural network to generate a policy output that specifies an action to be performed by the agent in response to the current observation.

IPC Classes  ?

  • G06N 3/092 - Reinforcement learning
  • G06N 3/006 - Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
  • G06N 3/042 - Knowledge-based neural networks; Logical representations of neural networks
  • G06N 3/0442 - Recurrent networks, e.g. Hopfield networks characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU]
  • G06N 3/0455 - Auto-encoder networks; Encoder-decoder networks
  • G06N 3/0464 - Convolutional networks [CNN, ConvNet]
  • G06N 3/084 - Backpropagation, e.g. using gradient descent

72.

CONFORMAL TRAINING OF MACHINE-LEARNING MODELS

      
Application Number EP2022077703
Publication Number 2023/057516
Status In Force
Filing Date 2022-10-05
Publication Date 2023-04-13
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Cemgil, Ali Taylan
  • Doucet, Arnaud
  • Dvijotham, Krishnamurthy
  • Stutz, David

Abstract

Methods, computer systems, and apparatus, including computer programs encoded on computer storage media, for training a classification machine-learning model. The system obtains calibration training examples and prediction training examples, determines a threshold value based on the calibration training examples, generates data characterizing predicted confidence sets based on the threshold value and the prediction training examples, and update model parameters based at least on the predicted confidence sets.

IPC Classes  ?

  • G06N 3/084 - Backpropagation, e.g. using gradient descent
  • G06N 3/09 - Supervised learning
  • G06N 3/042 - Knowledge-based neural networks; Logical representations of neural networks
  • G16H 50/20 - ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems

73.

DEMONSTRATION-DRIVEN REINFORCEMENT LEARNING

      
Application Number EP2022077706
Publication Number 2023/057518
Status In Force
Filing Date 2022-10-05
Publication Date 2023-04-13
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Sushkov, Oleg O.
  • Davchev, Todor Bozhinov
  • Scholz, Jonathan Karl

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a reinforcement learning system to select actions to be performed by an agent interacting with an environment to perform a particular task. In one aspect, one of the methods includes obtaining a training sequence comprising a respective training observations at each of a plurality of time steps; obtaining demonstration data comprising one or more demonstration sequences; generating a new training sequence from the training sequence and the demonstration data; and training the goal-conditioned policy neural network on the new training sequence through reinforcement learning.

IPC Classes  ?

  • G06N 3/08 - Learning methods
  • G06N 3/00 - Computing arrangements based on biological models
  • G06N 3/04 - Architecture, e.g. interconnection topology

74.

STEP-UNROLLED DENOISING NEURAL NETWORKS

      
Application Number EP2022077806
Publication Number 2023/057565
Status In Force
Filing Date 2022-10-06
Publication Date 2023-04-13
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Savinov, Nikolay
  • Chung, Junyoung
  • Binkowski, Mikolaj
  • Van Den Oord, Aaron Gerard Antonius
  • Elsen, Erich Konrad

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating output sequences using a non-auto-regressive neural network.

IPC Classes  ?

  • G06N 3/088 - Non-supervised learning, e.g. competitive learning
  • G06N 3/0455 - Auto-encoder networks; Encoder-decoder networks
  • G06N 3/047 - Probabilistic or stochastic networks
  • G06N 3/084 - Backpropagation, e.g. using gradient descent

75.

RATING TASKS AND POLICIES USING CONDITIONAL PROBABILITY DISTRIBUTIONS DERIVED FROM EQUILIBRIUM-BASED SOLUTIONS OF GAMES

      
Application Number EP2022078056
Publication Number 2023/057651
Status In Force
Filing Date 2022-10-10
Publication Date 2023-04-13
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor Marris, Luke Christopher

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for rating tasks and policies using conditional probability distributions derived from equilibrium-based solutions of games. One of the methods includes: determining, for each action selection policy in a pool of action selection policies, a respective performance measure of the action selection policy on each task in a pool of tasks, processing the performance measures of the action selection policies on the tasks to generate data defining a joint probability distribution over a set of action selection policy - task pairs, and processing the joint probability distribution over the set of action selection policy - task pairs to generate a respective rating for each action selection policy in the pool of action selection policies, where the respective rating for each action selection policy characterizes a utility of the action selection policy in performing tasks from the pool of tasks.

IPC Classes  ?

  • G06N 3/00 - Computing arrangements based on biological models
  • G06N 5/00 - Computing arrangements using knowledge-based models
  • G06N 3/08 - Learning methods
  • G06N 7/00 - Computing arrangements based on specific mathematical models

76.

NEURAL NETWORKS WITH TRANSFORMED ACTIVATION FUNCTION LAYERS

      
Application Number EP2022077600
Publication Number 2023/052653
Status In Force
Filing Date 2022-10-04
Publication Date 2023-04-06
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Martens, James
  • Zhang, Guodong
  • Swirszcz, Grzegorz Michael
  • Ballard, Andrew James
  • Desjardins, Guillaume

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing inputs using a neural network system that includes one or more transformed activation function layers.

IPC Classes  ?

77.

AUTOREGRESSIVELY GENERATING SEQUENCES OF DATA ELEMENTS DEFINING ACTIONS TO BE PERFORMED BY AN AGENT

      
Application Number EP2022072731
Publication Number 2023/025607
Status In Force
Filing Date 2022-08-12
Publication Date 2023-03-02
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Reed, Scott Ellison
  • Zolna, Konrad
  • Parisotto, Emilio
  • Erez, Tom
  • Novikov, Alexander
  • Rae, Jack William
  • Denil, Misha Man Ray
  • Gomes De Freitas, Joao Ferdinando
  • Vinyals, Oriol
  • Gomez, Sergio
  • Edwards, Ashley Deloris
  • Bruce, Jacob
  • Barth-Maron, Gabriel

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for selecting actions to be performed by an agent to interact with an environment using an action selection neural network. In one aspect, a method comprises, at each time step in a sequence of time steps: generating a current representation of a state of a task being performed by the agent in the environment as of the current time step as a sequence of data elements; autoregressively generating a sequence of data elements representing a current action to be performed by the agent at the current time step; and after autoregressively generating the sequence of data elements representing the current action, causing the agent to perform the current action at the current time step.

IPC Classes  ?

78.

CONTROLLING AGENTS BY SWITCHING BETWEEN CONTROL POLICIES DURING TASK EPISODES

      
Application Number EP2022071864
Publication Number 2023/012234
Status In Force
Filing Date 2022-08-03
Publication Date 2023-02-09
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Schaul, Tom
  • Pîslar, Miruna

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling agents. In particular, an agent can be controlled to perform a task episode by switching the control policy that is used to control the agent at one or more time steps during the task episode.

IPC Classes  ?

  • G06N 3/04 - Architecture, e.g. interconnection topology
  • G06N 3/08 - Learning methods
  • G06N 3/00 - Computing arrangements based on biological models

79.

TRAINING AGENT NEURAL NETWORKS THROUGH OPEN-ENDED LEARNING

      
Application Number EP2022071137
Publication Number 2023/006848
Status In Force
Filing Date 2022-07-27
Publication Date 2023-02-02
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Jaderberg, Maxwell Elliot
  • Czarnecki, Wojciech

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for raining an agent neural network for use in controlling an agent to perform a plurality of tasks. One of the methods includes maintaining population data specifying a population of one or more candidate agent neural networks; and training each candidate agent neural network on a respective set of one or more tasks to update the parameter values of the parameters of the candidate agent neural networks in the population data, the training comprising, for each candidate agent neural network: obtaining data identifying a candidate task; obtaining data specifying a control policy for the candidate task; determining whether to train the candidate agent neural network on the candidate task; and in response to determining to train the candidate agent neural network on the candidate task, training the candidate agent neural network on the candidate task.

IPC Classes  ?

  • G06N 3/00 - Computing arrangements based on biological models
  • G06N 3/04 - Architecture, e.g. interconnection topology
  • G06N 3/08 - Learning methods

80.

CONTROLLING A MAGNETIC FIELD OF A MAGNETIC CONFINEMENT DEVICE USING A NEURAL NETWORK

      
Application Number EP2022069047
Publication Number 2023/281048
Status In Force
Filing Date 2022-07-08
Publication Date 2023-01-12
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Degrave, Jonas
  • Felici, Federico Alberto Alfredo
  • Buchli, Jonas
  • Neunert, Michael Peter
  • Tracey, Brendan Daniel
  • Carpanese, Francesco
  • Ewalds, Timo Victor
  • Hafner, Roland
  • Riedmiller, Martin

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating control signals for controlling a magnetic field for confining plasma in a chamber of a magnetic confinement device. One of the methods includes, for each of a plurality of time steps, obtaining an observation characterizing a current state of the plasma in the chamber of the magnetic confinement device, processing an input including the observation using a plasma confinement neural network to generate a magnetic control output that characterizes control signals for controlling the magnetic field of the magnetic confinement device, and generating the control signals for controlling the magnetic field of the magnetic confinement device based on the magnetic control output.

IPC Classes  ?

  • G21B 1/05 - Thermonuclear fusion reactors with magnetic or electric plasma confinement
  • G21D 3/00 - Control of nuclear power plant

81.

DETERMINING A DISTRIBUTION OF ATOM COORDINATES OF A MACROMOLECULE FROM IMAGES USING AUTO-ENCODERS

      
Application Number EP2022067541
Publication Number 2022/269100
Status In Force
Filing Date 2022-06-27
Publication Date 2022-12-29
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Ronneberger, Olaf
  • Abellanas, Marta Garnelo
  • Rosenbaum, Dan
  • Eslami, Seyed Mohammadali
  • Adler, Jonas Anders

Abstract

Methods, systems and apparatus, including computer programs encoded on computer storage media. One of the methods includes obtaining a plurality of images of a macromolecule having a plurality of atoms, training a decoder neural network on the plurality of images, and after the training, generating a plurality of conformations for at least a portion of the macromolecule that each include respective three-dimensional coordinates of each of the plurality of atoms, wherein generating each conformation includes sampling a conformation latent representation from a prior distribution over conformation latent representations, processing a respective input including the sampled conformation latent representation using the decoder neural network to generate a conformation output that specifies three-dimensional coordinates of each of the plurality of atoms for the conformation, and generating the conformation from the conformation output.

IPC Classes  ?

  • G16B 15/30 - Drug targeting using structural data; Docking or binding prediction
  • G16B 40/20 - Supervised data analysis

82.

ADAPTIVE VISUAL SPEECH RECOGNITION

      
Application Number EP2022066419
Publication Number 2022/263570
Status In Force
Filing Date 2022-06-15
Publication Date 2022-12-22
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Assael, Ioannis Alexandros
  • Shillingford, Brendan
  • Gomes De Freitas, Joao Ferdinando

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing video data using an adaptive visual speech recognition model. One of the methods includes receiving a video that includes a plurality of video frames that depict a first speaker; obtaining a first embedding characterizing the first speaker; and processing a first input comprising (i) the video and (ii) the first embedding using a visual speech recognition neural network having a plurality of parameters, wherein the visual speech recognition neural network is configured to process the video and the first embedding in accordance with trained values of the parameters to generate a speech recognition output that defines a sequence of one or more words being spoken by the first speaker in the video.

IPC Classes  ?

  • G10L 15/16 - Speech classification or search using artificial neural networks
  • G10L 15/26 - Speech to text systems
  • G06N 3/04 - Architecture, e.g. interconnection topology
  • G06N 3/08 - Learning methods
  • G06V 40/16 - Human faces, e.g. facial parts, sketches or expressions
  • G10L 15/25 - Speech recognition using non-acoustical features using position of the lips, movement of the lips or face analysis

83.

MULTIMODAL FEW-SHOT LEARNING WITH FROZEN LANGUAGE MODELS

      
Application Number EP2022065502
Publication Number 2022/258666
Status In Force
Filing Date 2022-06-08
Publication Date 2022-12-15
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Tsimpoukelli, Maria Rafailia
  • Menick, Jacob Lee
  • Cabi, Serkan
  • Hill, Felix George
  • Eslami, Seyed Mohammadali
  • Vinyals, Oriol

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing multi-modal inputs using language models. In particular, the inputs include an image, and the image is encoded by an image encoder neural network to generate a sequence of image embeddings representing the image. The sequence of image embeddings is provided as at least part of an input sequence to that is processed by a language model neural network.

IPC Classes  ?

  • G06N 3/04 - Architecture, e.g. interconnection topology
  • G06N 3/08 - Learning methods
  • G06V 10/80 - Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
  • G06V 10/82 - Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks

84.

PREDICTING SPECTRAL REPRESENTATIONS FOR TRAINING SPEECH SYNTHESIS NEURAL NETWORKS

      
Application Number EP2022064925
Publication Number 2022/253901
Status In Force
Filing Date 2022-06-01
Publication Date 2022-12-08
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor Casagrande, Norman

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network to perform speech synthesis. One of the methods includes obtaining a training data set for training a first neural network to process a spectral representation of an audio sample and to generate a prediction of the audio sample, wherein, after training, the first neural network obtains spectral representations of audio samples from a second neural network; for a plurality of audio samples in the training data set: generating a ground-truth spectral representation of the audio sample; and processing the ground-truth spectral representation using a third neural network to generate an updated spectral representation of the audio sample; and training the first neural network using the updated spectral representations, wherein the third neural network is configured to generate updated spectral representations that resemble spectral representations generated by the second neural network.

IPC Classes  ?

  • G10L 13/047 - Architecture of speech synthesisers
  • G10L 25/30 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the analysis technique using neural networks
  • G06N 3/04 - Architecture, e.g. interconnection topology
  • G10L 13/10 - Prosody rules derived from text; Stress or intonation
  • G10L 13/033 - Voice editing, e.g. manipulating the voice of the synthesiser

85.

CONTINUAL LEARNING NEURAL NETWORK SYSTEM TRAINING FOR CLASSIFICATION TYPE TASKS

      
Application Number EP2022064431
Publication Number 2022/248676
Status In Force
Filing Date 2022-05-27
Publication Date 2022-12-01
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor Shanahan, Murray

Abstract

There is disclosed a computer-implemented method for training a neural network-based system. The method comprises receiving a training data item and target data associated with the training data item. The training data item is processed using an encoder to generate an encoding of the training data item. A subset of neural networks is selected from a plurality of neural networks stored in a memory based upon the encoding; wherein the plurality of neural networks are configured to process the encoding to generate output data indicative of a classification of an aspect of the training data item. The encoding is processed using the selected subset of neural networks to generate the output data. An update to the parameters of the selected subset of neural networks is determined based upon a loss function comprising a relationship between the generated output data and the target data associated with the training data item. The parameters of the selected subset of neural networks are updated based upon the determined update.

IPC Classes  ?

86.

NEURAL NETWORKS WITH HIERARCHICAL ATTENTION MEMORY

      
Application Number EP2022064497
Publication Number 2022/248723
Status In Force
Filing Date 2022-05-27
Publication Date 2022-12-01
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor Lampinen, Andrew Kyle

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing a machine learning task on a network input to generate a network output. One of the systems includes an attention neural network comprising one or more hierarchical attention blocks, each hierarchical attention block configured to: receive an input sequence for the hierarchical attention block; maintain a 5 plurality of memory summary keys, each memory summary key corresponding to a respective one of a plurality of partitions of a sequence of memory block inputs; determine a proper subset of the plurality of memory summary keys; and generate an attended input sequence for the hierarchical attention block including applying an attention mechanism over the respective memory block inputs at the memory positions within the partitions of 10 the sequence of memory block inputs that correspond to the proper subset of the plurality of memory summary keys.

IPC Classes  ?

  • G06N 3/00 - Computing arrangements based on biological models
  • G06N 3/04 - Architecture, e.g. interconnection topology
  • G06N 3/08 - Learning methods

87.

REINFORCEMENT LEARNING USING AN ENSEMBLE OF DISCRIMINATOR MODELS

      
Application Number EP2022064499
Publication Number 2022/248725
Status In Force
Filing Date 2022-05-27
Publication Date 2022-12-01
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Hansen, Steven Stenberg
  • Strouse, Daniel Joseph

Abstract

This specification describes a method performed by one or more data processing apparatus that includes: sampling a latent from a set of possible latents, selecting actions to be performed by an agent to interact with an environment over a sequence of time steps using an action selection neural network that is conditioned on the sampled latent, determining a respective reward received for each time step in the sequence of time steps using an ensemble of discriminator models, and training the action selection neural network based on the rewards using a reinforcement learning technique. Each discriminator model can process an observation to generate a respective prediction output that predicts which latent the action selection neural network was conditioned on to cause the environment to enter the state characterized by the observation.

IPC Classes  ?

88.

TRAINING GRAPH NEURAL NETWORKS USING A DE-NOISING OBJECTIVE

      
Application Number EP2022064565
Publication Number 2022/248735
Status In Force
Filing Date 2022-05-30
Publication Date 2022-12-01
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Godwin, Jonathan William
  • Battaglia, Peter William
  • Schaarschmidt, Kevin Michael
  • Sanchez, Alvaro

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network that includes one or more graph neural network layers. In one aspect, a method comprises: generating data defining a graph, comprising: generating a respective final feature representation for each node, wherein, for each of one or more of the nodes, the respective final feature representation is a modified feature representation that is generated from a respective feature representation for the node using respective noise; processing the data defining the graph using one or more of the graph neural network layers of the neural network to generate a respective updated node embedding of each node; and processing, for each of one or more of the nodes having modified feature representations, the updated node embedding of the node to generate a respective de-noising prediction for the node.

IPC Classes  ?

  • G06N 3/04 - Architecture, e.g. interconnection topology
  • G06N 3/08 - Learning methods
  • G16C 20/70 - Machine learning, data mining or chemometrics
  • G16C 20/30 - Prediction of properties of chemical compounds, compositions or mixtures
  • G16C 20/50 - Molecular design, e.g. of drugs

89.

UNSUPERVISED LEARNING OF OBJECT REPRESENTATIONS FROM VIDEO SEQUENCES USING ATTENTION OVER SPACE AND TIME

      
Application Number EP2022064484
Publication Number 2022/248712
Status In Force
Filing Date 2022-05-27
Publication Date 2022-12-01
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Kabra, Rishabh
  • Zoran, Daniel
  • Erdogan, Goker
  • Creswell, Antonia Phoebe Nina
  • Matthey-De-L'Endroit, Loic
  • Botvinick, Matthew
  • Lerchner, Alexander
  • Burgess, Christopher Paul

Abstract

A computer-implemented video generation neural network system, configured to determine a value for each of a set of object latent variables by sampling from a respective prior object latent distribution for the object latent variable. The system comprises a trained image frame decoder neural network configured to, for each pixel of each generated image frame and for each generated image frame time step process determined values of the object latent variables to determine parameters of a pixel distribution for each of the object latent variables, combine the pixel distributions for each of the object latent variables to determine a combined pixel distribution, and sample from the combined pixel distribution to determine a value for the pixel and for the time step.

IPC Classes  ?

90.

LEARNING DIVERSE SKILLS FOR TASKS USING SEQUENTIAL LATENT VARIABLES FOR ENVIRONMENT DYNAMICS

      
Application Number EP2022064491
Publication Number 2022/248718
Status In Force
Filing Date 2022-05-27
Publication Date 2022-12-01
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor Hansen, Steven Stenberg

Abstract

This specification relates to methods for controlling agents to perform actions according to a goal (or option) comprising a sequence of local goals (or local options) and corresponding methods for training. As discussed herein, environment dynamics may be modelled sequentially by sampling latent variables, each latent variable relating to a local goal and being dependent on a previous latent variable. These latent variables are used to condition an action-selection policy neural network to select actions according to the local goal. This allows the agents to reach more diverse states than would be possible through a fixed latent variable or goal, thereby encouraging exploratory behavior. In addition, specific methods described herein model the sequence of latent variables through a simple linear and recurrent relationship that allows the system to be trained more efficiently. This avoids the need to learn a state-dependent higher level policy for selecting the latent variables which can be difficult to train in practice.

IPC Classes  ?

  • G06N 3/08 - Learning methods
  • G06N 3/04 - Architecture, e.g. interconnection topology
  • G06N 7/00 - Computing arrangements based on specific mathematical models

91.

MULTI-OBJECTIVE REINFORCEMENT LEARNING USING WEIGHTED POLICY PROJECTION

      
Application Number EP2022064493
Publication Number 2022/248720
Status In Force
Filing Date 2022-05-27
Publication Date 2022-12-01
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Abdolmaleki, Abbas
  • Huang, Sandy Han
  • Riedmiller, Martin

Abstract

Computer implemented systems and methods for training an action selection policy neural network to select actions to be performed by an agent to control the agent to perform a task. The techniques are able to optimize multiple objectives one of which may be to stay close to a behavioral policy of a teacher. The behavioral policy of the teacher may be defined by a predetermined dataset of behaviors and the systems and methods may then learn offline. The described techniques provide a mechanism for explicitly defining a trade-off between the multiple objectives.

IPC Classes  ?

  • G06N 3/04 - Architecture, e.g. interconnection topology
  • G06N 3/08 - Learning methods
  • G06N 7/00 - Computing arrangements based on specific mathematical models
  • G06N 5/00 - Computing arrangements using knowledge-based models

92.

REINFORCEMENT LEARNING BY SOLUTION OF A CONVEX MARKOV DECISION PROCESS

      
Application Number EP2022064495
Publication Number 2022/248722
Status In Force
Filing Date 2022-05-27
Publication Date 2022-12-01
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Zahavy, Tom Ben Zion
  • O'Donoghue, Brendan Timothy
  • Desjardins, Guillaume
  • Baveja, Satinder Singh

Abstract

The actions of an agent in an environment are selected using a policy model neural network which implements a policy model defining, for any observed state of the environment characterized by an observation received by the policy model neural network, a state-action distribution over the set of possible actions the agent can perform. The policy model neural network is jointly trained with a cost model neural network which, upon receiving an observation characterizing the environment, outputs a reward vector. The reward vector comprises a corresponding reward value for every possible action. The training involves a sequence of iterations, in each of which (a) a cost model is derived based on the state-action distribution of a candidate policy model defined in one or more previous iterations, and subsequently (b) a candidate policy model is obtained based on reward vector(s) defined by the cost model obtained in the iteration.

IPC Classes  ?

  • G06N 3/00 - Computing arrangements based on biological models

93.

GENERATING NEURAL NETWORK OUTPUTS BY CROSS ATTENTION OF QUERY EMBEDDINGS OVER A SET OF LATENT EMBEDDINGS

      
Application Number EP2022064501
Publication Number 2022/248727
Status In Force
Filing Date 2022-05-27
Publication Date 2022-12-01
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Jaegle, Andrew Coulter
  • Alayrac, Jean-Baptiste
  • Borgeaud Dit Avocat, Sebastian
  • Ionescu, Catalin-Dumitru
  • Doersch, Carl
  • Ding, Fengning
  • Vinyals, Oriol
  • Hénaff, Olivier Jean
  • Koppula, Skanda Kumar
  • Zoran, Daniel
  • Brock, Andrew
  • Shelhamer, Evan Gerard
  • Zisserman, Andrew
  • Carreira, Joao

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a network output using a neural network. In one aspect, a method comprises: obtaining: (i) a network input to a neural network, and (ii) a set of query embeddings; processing the network input using the neural network to generate a network output that comprises a respective dimension corresponding to each query embedding in the set of query embeddings, comprising: processing the network input using an encoder block of the neural network to generate a representation of the network input as a set of latent embeddings; and processing: (i) the set of latent embeddings, and (ii) the set of query embeddings, using a cross-attention block that generates each dimension of the network output by cross-attention of a corresponding query embedding over the set of latent embeddings.

IPC Classes  ?

94.

ENHANCING POPULATION-BASED TRAINING OF NEURAL NETWORKS

      
Application Number EP2022064563
Publication Number 2022/248734
Status In Force
Filing Date 2022-05-30
Publication Date 2022-12-01
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Dalibard, Valentin Clement
  • Jaderberg, Maxwell Elliot

Abstract

Methods, computer systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network for performing a task. The system maintains data specifying (i) a plurality of candidate neural networks and (ii) a partitioning of the plurality of candidate neural networks into a plurality of partitions. The system repeatedly performs operations, including: training each of the candidate neural networks; evaluating each candidate neural network using a respective fitness function for the partition; and for each partition, updating the respective values of the one or more hyperparameters for at least one of the candidate neural networks in the partition based on the respective fitness metrics of the candidate neural networks in the partition. After repeatedly performing the operations, the system selects, from the maintained data, the respective values of the network parameters of one of the candidate neural networks.

IPC Classes  ?

  • G06N 3/00 - Computing arrangements based on biological models
  • G06N 3/04 - Architecture, e.g. interconnection topology
  • G06N 3/08 - Learning methods

95.

TRAINING RATE CONTROL NEURAL NETWORKS THROUGH REINFORCEMENT LEARNING

      
Application Number EP2022064566
Publication Number 2022/248736
Status In Force
Filing Date 2022-05-30
Publication Date 2022-12-01
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Zhernov, Anton
  • Gu, Chenjie
  • Mankowitz, Daniel J.
  • Schrittwieser, Julian
  • Mandhane, Amol Balkishan
  • Rauh, Mary Elizabeth
  • Wang, Miaosen
  • Hubert, Thomas Keisuke

Abstract

Systems and methods for training rate control neural networks through reinforcement learning. During training, reward values for training examples are generated from the current performance of the rate control neural network in encoding the video in the training example and the historical performance of the rate control neural network in encoding the video in the training example.

IPC Classes  ?

96.

INTERACTIVE DECODING OF WORDS FROM PHONEME SCORE DISTRIBUTIONS

      
Application Number EP2022059331
Publication Number 2022/214623
Status In Force
Filing Date 2022-04-07
Publication Date 2022-10-13
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Assael, Ioannis Alexandros
  • Shillingford, Brendan
  • Denil, Misha Man Ray

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for interactive decoding of a word sequence.

IPC Classes  ?

  • G10L 15/16 - Speech classification or search using artificial neural networks
  • G06V 40/20 - Movements or behaviour, e.g. gesture recognition
  • G10L 15/183 - Speech classification or search using natural language modelling using context dependencies, e.g. language models
  • G10L 15/20 - Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise or of stress induced speech
  • G10L 15/25 - Speech recognition using non-acoustical features using position of the lips, movement of the lips or face analysis

97.

PREDICTING COMPLETE PROTEIN REPRESENTATIONS FROM MASKED PROTEIN REPRESENTATIONS

      
Application Number EP2022051943
Publication Number 2022/194434
Status In Force
Filing Date 2022-01-27
Publication Date 2022-09-22
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Pritzel, Alexander
  • Ionescu, Catalin-Dumitru
  • Kohl, Simon

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for unmasking a masked representation of a protein using a protein reconstruction neural network. In one aspect, a method comprises: receiving the masked representation of the protein; and processing the masked representation of the protein using the protein reconstruction neural network to generate a respective predicted embedding corresponding to one or more masked embeddings that are included in the masked representation of the protein, wherein a predicted embedding corresponding to a masked embedding in a representation of the amino acid sequence of the protein defines a prediction for an identity of an amino acid at a corresponding position in the amino acid sequence, wherein a predicted embedding corresponding to a masked embedding in a representation of the structure of the protein defines a prediction for a corresponding structural feature of the protein.

IPC Classes  ?

98.

GENERATING OUTPUT SIGNALS USING VARIABLE-RATE DISCRETE REPRESENTATIONS

      
Application Number EP2022056014
Publication Number 2022/189493
Status In Force
Filing Date 2022-03-09
Publication Date 2022-09-15
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor Dieleman, Sander Etienne Lea

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating output signals using variable-rate discrete representations. One of the methods includes generating, using a generative neural network, an event sequence representing a run-length encoding of a discrete representation of the audio signal, the event sequence comprising a respective event at each of a plurality of event sequence time steps; generating the discrete representation of the audio signal from the event sequence using run-length decoding; and processing the discrete representation using a decoder neural network, wherein the decoder neural network is configured to process the discrete representation of the audio signal to generate the prediction of the audio signal.

IPC Classes  ?

  • G10L 19/16 - Vocoder architecture
  • G10L 25/30 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the analysis technique using neural networks

99.

NOWCASTING USING GENERATIVE NEURAL NETWORKS

      
Application Number EP2022053834
Publication Number 2022/175337
Status In Force
Filing Date 2022-02-16
Publication Date 2022-08-25
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Ravuri, Suman
  • Lenc, Karel
  • Mirowski, Piotr Wojciech
  • Lam, Remi Roger Alain Paul
  • Willson, Matthew James
  • Brock, Andrew

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for precipitation nowcasting using generative neural networks. One of the methods includes obtaining a context temporal sequence of a plurality of context radar fields characterizing a real-world location, each context radar field characterizing the weather in the real-world location at a corresponding preceding time point; sampling a set of one or more latent inputs by sampling values from a specified distribution; and for each sampled latent input, processing the context temporal sequence of radar fields and the sampled latent input using a generative neural network that has been configured through training to process the temporal sequence of radar fields to generate as output a predicted temporal sequence comprising a plurality of predicted radar fields, each predicted radar field in the predicted temporal sequence characterizing the predicted weather in the real-world location at a corresponding future time point.

IPC Classes  ?

  • G06N 3/04 - Architecture, e.g. interconnection topology
  • G01W 1/10 - Devices for predicting weather conditions
  • G05D 1/00 - Control of position, course, altitude, or attitude of land, water, air, or space vehicles, e.g. automatic pilot
  • G06N 3/08 - Learning methods
  • G08G 3/00 - Traffic control systems for marine craft
  • G08G 5/00 - Traffic control systems for aircraft

100.

SEQUENCE-TO SEQUENCE NEURAL NETWORK SYSTEMS USING LOOK AHEAD TREE SEARCH

      
Application Number EP2022053035
Publication Number 2022/171635
Status In Force
Filing Date 2022-02-08
Publication Date 2022-08-18
Owner DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor
  • Leblond, Rémi Bertrand Francis
  • Alayrac, Jean-Baptiste
  • Sifre, Laurent
  • Pîslar, Miruna
  • Lespiau, Jean-Baptiste
  • Antonoglou, Ioannis
  • Simonyan, Karen
  • Silver, David
  • Vinyals, Oriol

Abstract

A computer-implemented method for generating an output token sequence from an input token sequence. The method combines a look ahead tree search, such as a Monte Carlo tree search, with a sequence-to-sequence neural network system. The sequence-to-sequence neural network system has a policy output defining a next token probability distribution, and may include a value neural network providing a value output to evaluate a sequence. An initial partial output sequence is extended using the look ahead tree search guided by the policy output and, in implementations, the value output, of the sequence-to-sequence neural network system until a complete output sequence is obtained.

IPC Classes  ?

  • G06N 3/08 - Learning methods
  • G06N 5/00 - Computing arrangements using knowledge-based models
  • G06N 3/04 - Architecture, e.g. interconnection topology
  1     2     3        Next Page