DeepMind Technologies Limited

United Kingdom

1-100 of 292 for DeepMind Technologies Limited

Sort by

Query

Patent
World - WIPO

Aggregations

Reset Report

Date

IPC Class

164

151

Found results for

patents

1 2 3 Next Page

1. SELECTIVE ACQUISITION FOR MULTI-MODAL TEMPORAL DATA

Application Number	EP2023079389
Publication Number	2024/084097
Status	In Force
Filing Date	2023-10-21
Publication Date	2024-04-25
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Kossen, Jannik Lukas Belgrave, Danielle Charlotte Mary Tomasev, Nenad Cangea, Catalina-Codruta Ktena, Sofia Ira Vértes, Eszter Patraucean, Viorica Jaegle, Andrew Coulter

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a prediction characterizing an environment. In one aspect, a method includes obtaining a respective observation characterizing a state of an environment for each time step in a sequence of multiple time steps, comprising, for each time step after a first time step in the sequence of time steps: processing a network input that comprises observations obtained for one or more preceding time steps to generate a plurality of acquisition decisions; obtaining an observation for the time step, wherein the observation includes data corresponding to modalities that are selected for acquisition at the time step, does not include data corresponding to modalities that are not selected for acquisition at the time step; and processing a model input that includes the observation for each time step in the sequence of time steps to generate the prediction.

IPC Classes ?

G06N 3/045 - Combinations of networks
G06N 3/092 - Reinforcement learning
G06N 5/022 - Knowledge engineering; Knowledge acquisition

2. PATHOGENICITY PREDICTION FOR PROTEIN MUTATIONS USING AMINO ACID SCORE DISTRIBUTIONS

Application Number	EP2023078227
Publication Number	2024/079204
Status	In Force
Filing Date	2023-10-11
Publication Date	2024-04-18
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Avsec, Ziga Novati, Guido Cheng, Jun

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a pathogenicity score characterizing a likelihood that a mutation to a protein is a pathogenic mutation, wherein the mutation modifies an amino acid sequence of the protein by replacing an original amino acid by a substitute amino acid at a mutation position in the amino acid sequence of the protein. In one aspect, a method comprises: generating a network input to a pathogenicity prediction neural network, wherein the network input comprises a multiple sequence alignment (MSA) representation that represents an MSA for the protein; processing the network input using the pathogenicity prediction neural network to generate a score distribution over a set of amino acids; and generating the pathogenicity score using the score distribution over the set of amino acids.

IPC Classes ?

G16B 15/20 - Protein or domain folding
G16B 20/50 - Mutagenesis
G16B 30/10 - Sequence alignment; Homology search
G16B 40/20 - Supervised data analysis

3. OPTIMIZING ALGORITHMS FOR HARDWARE DEVICES

Application Number	EP2023077237
Publication Number	2024/074452
Status	In Force
Filing Date	2023-10-02
Publication Date	2024-04-11
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Hubert, Thomas Keisuke Huang, Shih-Chieh Novikov, Alexander Fawzi, Alhussein Romera-Paredes, Bernardino Silver, David Hassabis, Demis Swirszcz, Grzegorz Michal Schrittwieser, Julian Kohli, Pushmeet Barekatain, Mohammadamin Balog, Matej Rodriguez Ruiz, Francisco Jesus

Abstract

A method performed by one or more computers for obtaining an optimized algorithm that (i) is functionally equivalent to a target algorithm and (ii) optimizes one or more target properties when executed on a target set of one or more hardware devices. The method includes: initializing a target tensor representing the target algorithm; generating, using a neural network having a plurality of network parameters, a tensor decomposition of the target tensor that parametrizes a candidate algorithm; generating target property values for each of the target properties when executing the candidate algorithm on the target set of hardware devices; determining a benchmarking score for the tensor decomposition based on the target property values of the candidate algorithm; generating a training example from the tensor decomposition and the benchmarking score; and storing, in a training data store, the training example for use in updating the network parameters of the neural network.

IPC Classes ?

G06F 16/901 - Indexing; Data structures therefor; Storage structures
G06N 3/092 - Reinforcement learning
G06N 3/0985 - Hyperparameter optimisation; Meta-learning; Learning-to-learn
G06N 5/01 - Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound

4. CONTROLLING AGENTS USING REPORTER NEURAL NETWORKS

Application Number	EP2023076516
Publication Number	2024/068610
Status	In Force
Filing Date	2023-09-26
Publication Date	2024-04-04
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Dasgupta, Ishita Chen, Shiqi Marino, Kenneth Daniel Shang, Wenling Ahuja, Arun

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling agents using reporter neural networks.

IPC Classes ?

G06F 16/9032 - Query formulation
G06F 16/903 - Querying
G06N 3/00 - Computing arrangements based on biological models

5. SCORE MODELLING FOR SIMULATION-BASED INFERENCE

Application Number	EP2023076529
Publication Number	2024/068622
Status	In Force
Filing Date	2023-09-26
Publication Date	2024-04-04
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Mnih, Andriy Geffner, Tomas

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for using simulation-based inference to inferring a set of parameters such as measurements, from observations, e.g. real world observations. The method uses a score generation neural network to determine scores for individual observations or for groups of observations that are combined and used to iteratively adjust values of the parameters.

IPC Classes ?

G06F 16/906 - Clustering; Classification
G06N 3/00 - Computing arrangements based on biological models

6. GRAPH NEURAL NETWORKS THAT MODEL FACE-FACE INTERACTIONS BETWEEN MESHES

Application Number	EP2023076797
Publication Number	2024/068788
Status	In Force
Filing Date	2023-09-27
Publication Date	2024-04-04
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Allen, Kelsey Rebecca Rubanova, Yulia Lopez Guevara, Tatiana Whitney, William Fairclough Sanchez, Alvaro Battaglia, Peter William Pfaff, Tobias

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for simulating a state of an environment over a sequence of time steps. In one aspect, a method comprises, at each of one or more time steps: obtaining an environment mesh representing the state of the environment at the time step; generating a graph representing the state of the environment at the time step, comprising: determining that a first face of a first object mesh is within a collision distance of a second face of a second object mesh; and in response, instantiating a face-face edge in the graph that connects: (i) a first set of graph nodes in the graph that represent the first face in the first object mesh, and (ii) a second set of graph nodes in the graph that represent the second face in the second object mesh.

IPC Classes ?

G06F 30/20 - Design optimisation, verification or simulation
B25J 9/00 - Programme-controlled manipulators

7. LEARNING TASKS USING SKILL SEQUENCING FOR TEMPORALLY-EXTENDED EXPLORATION

Application Number	EP2023076798
Publication Number	2024/068789
Status	In Force
Filing Date	2023-09-27
Publication Date	2024-04-04
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Vezzani, Giulia Tirumala Bukkapatnam, Dhruva Wulfmeier, Markus Riedmiller, Martin Heess, Nicolas Manfred Otto

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling an agent that is interacting with an environment. Implementations of the system use previously learned skills to explore states of the environment to collect and store training data, which is then used to train an action selection system. The system includes a set of skill action selection subsystems, each configured to select actions for the agent to perform for a respective skill. The set of skill action selection subsystems is used to explore states of the environment to collect the training data, keeping their individual action selection policies unchanged. A scheduler neural network selects the skill neural networks to use. The action selection system is trained on the stored training data.

IPC Classes ?

G06N 3/092 - Reinforcement learning
G06N 3/045 - Combinations of networks
G06N 3/084 - Backpropagation, e.g. using gradient descent

8. REINFORCEMENT LEARNING USING DENSITY ESTIMATION WITH ONLINE CLUSTERING FOR EXPLORATION

Application Number	EP2023076893
Publication Number	2024/068841
Status	In Force
Filing Date	2023-09-28
Publication Date	2024-04-04
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Saade, Alaa Kapturowski, Steven James Calandriello, Daniele Blundell, Charles Valko, Michal Sprechmann, Pablo Piot, Bilal

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network used to select actions to be performed by an agent interacting with an environment. Implementations of the described techniques can learn to explore the environment efficiently by storing and updating state embedding cluster centers based on observations characterizing states of the environment.

IPC Classes ?

G06F 16/901 - Indexing; Data structures therefor; Storage structures
G06F 16/906 - Clustering; Classification
G06N 3/092 - Reinforcement learning

9. AGENT CONTROL THROUGH IN-CONTEXT REINFORCEMENT LEARNING

Application Number	EP2023076897
Publication Number	2024/068843
Status	In Force
Filing Date	2023-09-28
Publication Date	2024-04-04
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Laskin, Michael Mnih, Volodymyr Wang, Luyu Baveja, Satinder Singh

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling agents. In particular, an agent can be controlled using an action selection neural network that performs in-context reinforcement learning when controlling an agent on a new task.

IPC Classes ?

G06N 3/006 - Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
G06N 3/0442 - Recurrent networks, e.g. Hopfield networks characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU]
G06N 3/092 - Reinforcement learning

10. DISCRETE TOKEN PROCESSING USING DIFFUSION MODELS

Application Number	EP2023076788
Publication Number	2024/068781
Status	In Force
Filing Date	2023-09-27
Publication Date	2024-04-04
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Strudel, Robin Leblond, Rémi Sifre, Laurent Dieleman, Sander Etienne Lea Savinov, Nikolay Grathwohl, Will S. Tallec, Corentin Altché, Florent Ganin, Iaroslav Mensch, Arthur Du, Yilun

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating an output sequence of discrete tokens using a diffusion model. In one aspect, a method includes generating, by using the diffusion model, a final latent representation of the sequence of discrete tokens that includes a determined value for each of a plurality of latent variables; applying a de-embedding matrix to the final latent representation of the output sequence of discrete tokens to generate a de-embedded final latent representation that includes, for each of the plurality of latent variables, a respective numeric score for each discrete token in a vocabulary of multiple discrete tokens; selecting, for each of the plurality of latent variables, a discrete token from among the multiple discrete tokens in the vocabulary that has a highest numeric score; and generating the output sequence of discrete tokens that includes the selected discrete tokens.

IPC Classes ?

G06N 3/0475 - Generative networks
G06N 3/088 - Non-supervised learning, e.g. competitive learning

11. REWARD-MODEL BASED REINFORCEMENT LEARNING FOR PERFORMING REASONING TASKS

Application Number	EP2023076792
Publication Number	2024/068784
Status	In Force
Filing Date	2023-09-27
Publication Date	2024-04-04
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Higgins, Irina Uesato, Jonathan Ken Kushman, Nathaniel Arthur Kumar, Ramana

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for A training a language model for performing a reasoning task. The system obtains a plurality of training examples. Each training example includes a respective sample query text sequence characterizing a respective sample query and a respective reference response text sequence that includes a reference final answer to the respective sample query. The system trains a reward model on the plurality of training examples. The reward model is configured to receive an input including a query text sequence characterizing a query and one or more reasoning steps that have been generated in response to the query and process the input to compute a reward score indicating how successful the one or more reasoning steps are in yielding a correct final answer to the query. The system trains the language model using the trained reward model.

IPC Classes ?

G06F 16/332 - Query formulation
G06F 16/33 - Querying
G06F 16/338 - Presentation of query results
G06N 3/02 - Neural networks
G06N 3/092 - Reinforcement learning
G06N 5/04 - Inference or reasoning models

12. SYSTEM AND METHOD FOR REINFORCEMENT LEARNING BASED ON PRIOR TRAJECTORIES

Application Number	EP2023076793
Publication Number	2024/068785
Status	In Force
Filing Date	2023-09-27
Publication Date	2024-04-04
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Bruce, Jacob Anand, Ankit Fergus, Robert David

Abstract

A reinforcement learning system is proposed in which a policy model neural network is trained to control an agent to perform a task in successive time steps, by training a control system including the policy model neural network to select a respective action for each time step which gives a high value for a reward function based on the action, and which indicates the contribution of the action to solving the task. The reward function includes a term based on a progress value output by a progress model. The progress model generates the progress value upon receiving a first observation of the state of the environment at a time step before the performance of the action, and a second observation of the state of the environment at a time step following the performance of the action. The progress value is an estimate of the average time which an ensemble of experts who produced the demonstrations would have taken to transform the environment from how it appears in the first observation to how it appears in the second observation.

IPC Classes ?

G06N 3/02 - Neural networks
G06N 3/08 - Learning methods
G06N 3/092 - Reinforcement learning

13. NEURAL NETWORKS WITH REGULARIZED ATTENTION LAYERS

Application Number	EP2023076794
Publication Number	2024/068786
Status	In Force
Filing Date	2023-09-27
Publication Date	2024-04-04
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	He, Bobby Boyi Martens, James

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for processing a network input using a neural network that includes one or more regularized attention layers. In one aspect, a method comprises: receiving a layer input to a regularized attention layer, wherein the layer input to the regularized attention layer comprises a set of input embeddings; and applying a regularized attention operation over the set of input embeddings to generate a set of output embeddings, comprising: transforming intermediate attention scores using a set of shaping constants to generate a set of transformed attention scores, wherein: values of the shaping constants are initialized prior to training of the neural network and are not adjusted during the training of the neural network; and the values of the shaping constants are selected to regularize the set of output embeddings.

IPC Classes ?

G06N 3/045 - Combinations of networks
G06N 3/08 - Learning methods

14. GUIDED DIALOGUE USING LANGUAGE GENERATION NEURAL NETWORKS AND SEARCH

Application Number	EP2023075931
Publication Number	2024/061963
Status	In Force
Filing Date	2023-09-20
Publication Date	2024-03-28
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Irving, Geoffrey Glaese, Amelia Marita Claudia Mcaleese-Park, Nathaniel John Hendricks, Lisa Anne Marie

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for enabling a user to conduct a dialogue. Implementations of the system learn when to rely on supporting evidence, obtained from an external search system via a search system interface, and are also able to generate replies for the user that align with the preferences of a previously trained response selection neural network. Implementations of the system can also use a previously trained rule violation detection neural network to generate replies that take account of previously learnt rules.

IPC Classes ?

G06N 3/006 - Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
G06N 3/092 - Reinforcement learning
G06N 3/0455 - Auto-encoder networks; Encoder-decoder networks
G06N 3/042 - Knowledge-based neural networks; Logical representations of neural networks
G06N 3/084 - Backpropagation, e.g. using gradient descent
G06N 3/094 - Adversarial learning
G06N 5/01 - Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound

15. CONTROLLING INDUSTRIAL FACILITIES USING HIERARCHICAL REINFORCEMENT LEARNING

Application Number	EP2023075295
Publication Number	2024/056800
Status	In Force
Filing Date	2023-09-14
Publication Date	2024-03-21
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Wong, William Dutta, Praneet Luo, Jerry Jiayu

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling a facility through hierarchical reinforcement learning. In particular, the facility is controlled using a high-level controller neural network that makes high-level decisions and a low-level controller neural network that makes low-level controller decisions.

IPC Classes ?

G05B 13/02 - Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric

16. DATA-EFFICIENT REINFORCEMENT LEARNING WITH ADAPTIVE RETURN COMPUTATION SCHEMES

Application Number	EP2023075512
Publication Number	2024/056891
Status	In Force
Filing Date	2023-09-15
Publication Date	2024-03-21
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Jiang, Ray Puigdomènech Badia, Adrià Campos Camúñez, Víctor Kapturowski, Steven James Rakicevic, Nemanja

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for data-efficient reinforcement learning with adaptive return computation schemes.

IPC Classes ?

G06N 3/006 - Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
G06N 3/045 - Combinations of networks
G06N 3/092 - Reinforcement learning
G06N 3/096 - Transfer learning
G06N 3/0464 - Convolutional networks [CNN, ConvNet]
G06N 3/084 - Backpropagation, e.g. using gradient descent
G06N 3/0985 - Hyperparameter optimisation; Meta-learning; Learning-to-learn

17. TRAINING POLICY NEURAL NETWORKS IN SIMULATION USING SCENE SYNTHESIS MACHINE LEARNING MODELS

Application Number	EP2023075514
Publication Number	2024/056892
Status	In Force
Filing Date	2023-09-15
Publication Date	2024-03-21
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Byravan, Arunkumar Humplik, Jan Hasenclever, Leonard Brussee, Arthur Karl Nori, Francesco

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a policy neural network for use in controlling a robot. In particular, the policy neural network can be trained in simulation using images generated by a scene synthesis machine learning model.

IPC Classes ?

G06N 3/0895 - Weakly supervised learning, e.g. semi-supervised or self-supervised learning
G06N 3/006 - Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
G06N 3/092 - Reinforcement learning
G06N 3/096 - Transfer learning
G06N 3/045 - Combinations of networks

18. ACTION ABSTRACTION CONTROLLER FOR FULLY ACTUATED ROBOTIC MANIPULATORS

Application Number	EP2023067028
Publication Number	2024/051978
Status	In Force
Filing Date	2023-06-22
Publication Date	2024-03-14
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Chen, Jose Enrique Laurens, Antoine Marin Alix Romano, Francesco Scholz, Jonathan Karl Fernandes Martins, Murilo Nori, Francesco

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for controlling a robot manipulator that has a plurality of joints. One of the methods includes obtaining a control input that comprises one or more velocity values that specify a target velocity of a reference point in a given coordinate frame; determining a respective joint velocity for each of the plurality of joints by generating a solution to an optimization problem formulated from the control input; and controlling the robot manipulator, including causing the plurality of joints of the robot manipulator to move in accordance with the respective joint velocities to approximate the control input.

IPC Classes ?

B25J 9/16 - Programme controls
G05B 19/427 - Teaching successive positions by tracking the position of a joystick or handle to control the positioning servo of the tool head, master-slave control

19. CONTROLLING AGENTS USING AMBIGUITY-SENSITIVE NEURAL NETWORKS AND RISK-SENSITIVE NEURAL NETWORKS

Application Number	EP2023074759
Publication Number	2024/052544
Status	In Force
Filing Date	2023-09-08
Publication Date	2024-03-14
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Grau Moya, Jordi Delétang, Grégoire Kunesch, Markus Ortega Caballero, Pedro Alejandro

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling agents. In particular, an agent can be controlled using an action selection system that is risk-sensitive, ambiguity-sensitive, or both.

IPC Classes ?

G06N 3/044 - Recurrent networks, e.g. Hopfield networks
G06N 3/045 - Combinations of networks
G06N 3/092 - Reinforcement learning
G06N 3/0985 - Hyperparameter optimisation; Meta-learning; Learning-to-learn
G06N 3/0464 - Convolutional networks [CNN, ConvNet]
G06N 3/006 - Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
G06N 7/01 - Probabilistic graphical models, e.g. probabilistic networks

20. SELECTION-INFERENCE NEURAL NETWORK SYSTEMS

Application Number	EP2023073796
Publication Number	2024/047108
Status	In Force
Filing Date	2023-08-30
Publication Date	2024-03-07
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Creswell, Antonia Phoebe Nina Shanahan, Murray

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a response to a query input using a selection-inference neural network.

IPC Classes ?

G06N 5/046 - Forward inferencing; Production systems
G06N 3/042 - Knowledge-based neural networks; Logical representations of neural networks
G06N 5/025 - Extracting rules from data
G06N 5/04 - Inference or reasoning models
G06N 5/045 - Explanation of inference; Explainable artificial intelligence [XAI]; Interpretable artificial intelligence

21. DETERMINING FAILURE CASES IN TRAINED NEURAL NETWORKS USING GENERATIVE NEURAL NETWORKS

Application Number	EP2023072617
Publication Number	2024/038114
Status	In Force
Filing Date	2023-08-16
Publication Date	2024-02-22
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Gowal, Sven Adrian Wiles, Olivia Anne Carneiro De Albuquerque, Isabela Maria

Abstract

Methods, systems, and computer readable storage media for performing operations comprising: obtaining a plurality of initial network inputs that have been classified as belonging to a corresponding ground truth class; processing each of the plurality of initial network inputs using a trained target neural network to generate a respective predicted network output for each initial network input, the respective predicted network output comprising a respective score for each of a plurality of classes, the plurality of classes comprising the ground truth class; identifying, based on the respective predicted network outputs and the ground truth class, a subset of the initial network inputs as having been misclassified by the trained target neural network; and determining, based on the subset of initial network inputs, one or more failure case latent representations, wherein each failure case latent representation is a latent representation that characterizes network inputs that belong to the ground truth class but that are likely to be misclassified by the trained target neural network.

IPC Classes ?

G06N 5/045 - Explanation of inference; Explainable artificial intelligence [XAI]; Interpretable artificial intelligence
G06N 3/045 - Combinations of networks
G06N 3/0475 - Generative networks
G06N 3/09 - Supervised learning

22. AUTOMATED DISCOVERY OF AGENTS IN SYSTEMS

Application Number	EP2023071987
Publication Number	2024/033387
Status	In Force
Filing Date	2023-08-08
Publication Date	2024-02-15
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Jebreel, Zachary Alex Kenton Kumar, Ramana Richens, Jonathan George Everitt, Tom Åke Helmer Farquhar, Aiken Sebastian Macdermott, Matthew Joseph Tilley

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for identifying agents in a system. According to one aspect, a method comprises: generating data defining a causal model of the system, comprising transmitting instructions to cause a plurality of interventions to be applied to the system, wherein each intervention modifies one or more variable elements in the system; processing the model of the system to identify one or more of the variable elements in the system as being decision elements, wherein each decision element represents an action selected by a respective agent in the system; and identifying one or more agents in the system based on the decision elements; and outputting data that identifies the agents in the system.

IPC Classes ?

G06N 3/092 - Reinforcement learning
G06N 5/045 - Explanation of inference; Explainable artificial intelligence [XAI]; Interpretable artificial intelligence
G06N 3/042 - Knowledge-based neural networks; Logical representations of neural networks

23. FINDING A STATIONARY POINT OF A LOSS FUNCTION BY AN ITERATIVE ALGORITHM USING A VARIABLE LEARNING RATE VALUE

Application Number	EP2023072108
Publication Number	2024/033445
Status	In Force
Filing Date	2023-08-09
Publication Date	2024-02-15
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Rosca, Mihaela Dherin, Benoit Richard Umbert Wu, Yan Qin, Chongli

Abstract

A computer-implemented method for determining, for a loss function which is a function of a parameter vector comprising a plurality of parameters, values for the parameters for which the parameter vector is a stationary point of the loss function. The method comprises determining initial values for the parameters; and repeatedly updating the parameters by: (a) determining at least one drift value indicative of discretization drift for a discrete update to the parameters based on the loss function; (b) determining at least one learning rate value by evaluating a learning rate function based on, and having an inverse relationship with, the at least one drift value; (c) determining respective updates to the parameters based upon a product of the at least one learning rate value and a gradient of the loss function with respect to the respective parameter for current values of the parameters; and (d) updating the parameters based upon the determined respective updates.

IPC Classes ?

G06N 3/09 - Supervised learning
G06F 17/11 - Complex mathematical operations for solving equations
G06N 3/0464 - Convolutional networks [CNN, ConvNet]
G06N 3/045 - Combinations of networks
G06N 3/092 - Reinforcement learning
G06N 3/094 - Adversarial learning

24. OPTIMIZING ALGORITHMS FOR TARGET PROCESSORS USING REPRESENTATION NEURAL NETWORKS

Application Number	EP2023070308
Publication Number	2024/018065
Status	In Force
Filing Date	2023-07-21
Publication Date	2024-01-25
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Michi, Andrea Mankowitz, Daniel J. Zhernov, Anton Gelmi, Marco Oreste Selvi, Marco Paduraru, Cosmin Leurent, Edouard Mandhane, Amol Balkishan Iqbal, Shariq Nadeem Silver, David Riedmiller, Martin Kohli, Pushmeet Vinyals, Oriol

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for optimizing a target algorithm using a state representation neural network.

IPC Classes ?

G06F 8/41 - Compilation
G06N 3/0455 - Auto-encoder networks; Encoder-decoder networks
G06N 3/02 - Neural networks

25. MODEL-FREE REINFORCEMENT LEARNING WITH REGULARIZED NASH DYNAMICS

Application Number	EP2023067491
Publication Number	2024/003058
Status	In Force
Filing Date	2023-06-27
Publication Date	2024-01-04
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Perolat, Julien De Vylder, Bart Tuyls, Karl Paul

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a policy neural network that is used to control an agent. In particular, the policy neural network can be trained through model-free reinforcement learning with regularized Nash dynamics.

IPC Classes ?

G06N 3/092 - Reinforcement learning
G06N 3/045 - Combinations of networks

26. SIMULATING INDUSTRIAL FACILITIES FOR CONTROL

Application Number	EP2023067148
Publication Number	2023/247767
Status	In Force
Filing Date	2023-06-23
Publication Date	2023-12-28
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Dutta, Praneet Chervonyi, Iurii Voicu, Octavian Luo, Jerry Jiayu Trochim, Piotr

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for simulating industrial facilities for control. One of the methods includes. at each of a plurality of time steps during a task episode: receiving, from a computer simulator of an industrial facility, measurements representing a current state of the facility; generating, from the measurements, an observation; providing the observation as input to a control policy for controlling the facility; receiving, as output, an action for controlling one or more setpoints of the facility; generating, from the action, one or more control inputs for the one or more setpoints of the facility; and providing, as input to the simulator, (i) the control inputs and (ii) current values for one or more configuration parameters of the simulator to cause the simulator to generate, as output, new measurements representing a new state of the facility.

IPC Classes ?

G05B 13/02 - Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
G05B 19/418 - Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control (DNC), flexible manufacturing systems (FMS), integrated manufacturing systems (IMS), computer integrated manufacturing (CIM)

27. Simulating Physical Environments with Discontinuous Dynamics Using Graph Neural Networks

Application Number	EP2023066187
Publication Number	2023/242378
Status	In Force
Filing Date	2023-06-15
Publication Date	2023-12-21
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Allen, Kelsey Rebecca Lopez Guevara, Tatiana Pfaff, Tobias Sanchez, Alvaro Rubanova, Yulia Stachenfeld, Kimberly Battaglia, Peter William

Abstract

This specification describes a simulation system that performs simulations of physical environments using a graph neural network. At each of one or more time steps in a sequence of time steps in a given time interval, the system can process a representation of a current state of the physical environment at the current time step using the graph neural network to generate a prediction of a next state of the physical environment at the next time step. Generally, the environment has discontinuous dynamics at one or more time points during the time interval.

IPC Classes ?

G06F 30/20 - Design optimisation, verification or simulation
G06F 30/27 - Design optimisation, verification or simulation using machine learning, e.g. artificial intelligence, neural networks, support vector machines [SVM] or training a model
G06F 119/12 - Timing analysis or timing optimisation

28. TRAINING CAMERA POLICY NEURAL NETWORKS THROUGH SELF PREDICTION

Application Number	EP2023066186
Publication Number	2023/242377
Status	In Force
Filing Date	2023-06-15
Publication Date	2023-12-21
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Grimes, Matthew Koichi Mirowski, Piotr Wojciech Modayil, Joseph Varughese

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a camera policy neural network.

IPC Classes ?

G06N 3/006 - Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
G06N 3/045 - Combinations of networks
G06N 3/0895 - Weakly supervised learning, e.g. semi-supervised or self-supervised learning
G06N 3/092 - Reinforcement learning

29. HIERARCHICAL REINFORCEMENT LEARNING AT SCALE

Application Number	EP2023065305
Publication Number	2023/237635
Status	In Force
Filing Date	2023-06-07
Publication Date	2023-12-14
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Soyer, Hubert Josef Behbahani, Feryal Keck, Thomas Albert Nikiforou, Kyriacos Pires, Bernardo Avila Baveja, Satinder Singh

Abstract

The invention describes a system and a method for controlling an agent interacting with an environment to perform a task, the method comprising, at each of a plurality of first time steps from a plurality of time steps: receiving an observation characterizing a state of the environment at the first time step; determining a goal representation for the first time step that characterizes a goal state of the environment to be reached by the agent; processing the observation and the goal representation using a low-level controller neural network to generate a low-level policy output that defines an action to be performed by the agent in response to the observation, wherein the low-level controller neural network comprises: a representation neural network configured to process the observation to generate an internal state representation of the observation, and a low-level policy head configured to process the state observation representation and the goal representation to generate the low-level policy output; and controlling the agent using the low-level policy output.

IPC Classes ?

G06N 3/006 - Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
G06N 3/045 - Combinations of networks
G06N 3/092 - Reinforcement learning
G06N 7/01 - Probabilistic graphical models, e.g. probabilistic networks
G06N 3/044 - Recurrent networks, e.g. Hopfield networks

30. REINFORCEMENT LEARNING TO EXPLORE ENVIRONMENTS USING META POLICIES

Application Number	EP2023065306
Publication Number	2023/237636
Status	In Force
Filing Date	2023-06-07
Publication Date	2023-12-14
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Zintgraf, Luisa Maria Magalhaes Marinho, Zita Alexandra Kemaev, Iurii Kirsch, Louis Michel Oh, Junhyuk Schaul, Tom

Abstract

The invention describes the method performed by one or more computers and for training a base policy neural network that is configured to receive a base policy input comprising an observation of a state of an environment and to process the policy input to generate a base policy output that defines an action to be performed by an agent in response to the observation, the method comprising: generating training data for training the base policy neural network by controlling an agent using (i) the base policy neural network and (ii) an exploration strategy that maps, in accordance with a set of one or more parameters, base policy outputs generated by the base policy neural network to actions performed by the agent to interact with an environment, the generating comprising, at each of a plurality of time points: determining that criteria for updating the exploration strategy are satisfied at the time point; and in response to determining that the criteria are satisfied: generating a meta policy input that comprises data characterizing a performance of the base policy neural network in controlling the agent at the time point; processing the meta policy input using a meta policy to generate a meta policy output that specifies respective values for each of the set of one or more parameters that define the exploration strategy; and controlling the agent using the base policy neural network and in accordance with the exploration strategy defined by the respective values for the set of one or more parameters specified by the meta policy output.

IPC Classes ?

G06N 3/008 - Artificial life, i.e. computing arrangements simulating life based on physical entities controlled by simulated intelligence so as to replicate intelligent life forms, e.g. based on robots replicating pets or humans in their appearance or behaviour
G06N 3/045 - Combinations of networks
G06N 3/092 - Reinforcement learning
G06N 3/0985 - Hyperparameter optimisation; Meta-learning; Learning-to-learn

31. SIMULATING PHYSICAL ENVIRONMENTS USING FINE-RESOLUTION AND COARSE-RESOLUTION MESHES

Application Number	EP2023063755
Publication Number	2023/227586
Status	In Force
Filing Date	2023-05-23
Publication Date	2023-11-30
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Fortunato, Meire Pfaff, Tobias Wirnsberger, Peter Pritzel, Alexander Battaglia, Peter William

Abstract

47 ABSTRACT Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for simulating a state of a physical environment. In one aspect, a method performed by one or more computers for simulating the state of the physical environment is provided. The method includes, for each of multiple time steps: obtaining data defining a fine-resolution mesh and a coarse-resolution mesh that each characterize the state of the physical environment at the current time step, where the fine-resolution mesh has a higher resolution than the coarse-resolution mesh; processing data defining the fine- resolution mesh and the coarse-resolution mesh using a graph neural network that includes: (i) one or more fine-resolution update blocks, (ii) one or more coarse-resolution update blocks, and (iii) one or more up-sampling update blocks; and determining the state of the physical environment at a next time step using updated node embeddings for nodes in the fine-resolution mesh. DeepMind Technologies Limited F&R Ref.: 45288-0255WO1 PCT Application

IPC Classes ?

G06F 30/23 - Design optimisation, verification or simulation using finite element methods [FEM] or finite difference methods [FDM]
G06F 30/27 - Design optimisation, verification or simulation using machine learning, e.g. artificial intelligence, neural networks, support vector machines [SVM] or training a model
G06N 3/02 - Neural networks
G06T 17/20 - Wire-frame description, e.g. polygonalisation or tessellation
G06F 111/10 - Numerical modelling
G06F 113/08 - Fluids

32. EXPLORATION BY BOOTSTEPPED PREDICTION

Application Number	EP2023063282
Publication Number	2023/222772
Status	In Force
Filing Date	2023-05-17
Publication Date	2023-11-23
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Guo, Zhaohan Altché, Florent Tallec, Corentin Pires, Bernardo Avila Pîslar, Miruna Thakoor, Shantanu Yogeshraj Azar, Mohammad Gheshlaghi Piot, Bilal

Abstract

An iterative method is proposed to train an action selection system of a reinforcement learning system, based on a reward function which defines a reward value for each action. The reward value includes an intrinsic reward term generated based on the outputs of two encoder models: an online encoder model and a target encoder model. The online encoder model is iteratively trained based on a loss function, and the target encoder model is updated to bring it closer to the online encoder model.

IPC Classes ?

G06N 3/044 - Recurrent networks, e.g. Hopfield networks
G06N 3/045 - Combinations of networks
G06N 3/092 - Reinforcement learning

33. MACHINE LEARNING SYSTEMS WITH COUNTERFACTUAL INTERVENTIONS

Application Number	EP2023063488
Publication Number	2023/222884
Status	In Force
Filing Date	2023-05-19
Publication Date	2023-11-23
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Rabinowitz, Neil Charles Roy, Nicholas Andrew Kim, Junkyung

Abstract

Systems, methods, and computer programs, for training and using a machine learning system to control an agent to perform a task. The machine learning system is trained using counterfactual internal states so that it can provide an output that explains the behavior of the system in causal terms, e.g. in terms of aspects of its environment that cause the system to select particular actions for the agent.

IPC Classes ?

G06N 3/008 - Artificial life, i.e. computing arrangements simulating life based on physical entities controlled by simulated intelligence so as to replicate intelligent life forms, e.g. based on robots replicating pets or humans in their appearance or behaviour
G06N 3/044 - Recurrent networks, e.g. Hopfield networks
G06N 3/045 - Combinations of networks
G06N 3/084 - Backpropagation, e.g. using gradient descent

34. LARGE-SCALE RETRIEVAL AUGMENTED REINFORCEMENT LEARNING

Application Number	EP2023063492
Publication Number	2023/222885
Status	In Force
Filing Date	2023-05-19
Publication Date	2023-11-23
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Humphreys, Peter Conway Guez, Arthur Clement

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling a reinforcement learning agent in an environment to perform a task. In one aspect, a method comprises: maintaining a retrieval dataset that stores a plurality of history observations and, for each history observation, a respective associated context; receiving a current observation characterizing a current state of the environment; selecting one or more history observations from the plurality of history observations; processing, using an encoder neural network and in accordance with current values of encoder network parameters, an encoder network input comprising (i) the current observation and (ii) the one or more selected history observations and their respective associated context to generate a latent state representation for the current state of the environment; and using the latent state representation to determine an action to be performed by the agent in response to the current observation.

IPC Classes ?

G06F 16/908 - Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
G06N 3/02 - Neural networks
G06N 20/00 - Machine learning

35. CONTRASTIVE LEARNING USING POSITIVE PSEUDO LABELS

Application Number	EP2023063496
Publication Number	2023/222889
Status	In Force
Filing Date	2023-05-19
Publication Date	2023-11-23
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Mitrovic, Jovana Bosnjak, Matko Richemond, Pierre Tomasev, Nenad Strub, Florian Walker, Jacob Charles Hill, Felix George Buesing, Lars Pascanu, Razvan Blundell, Charles

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network to perform a machine learning task on one or more received inputs by using a hybrid training dataset with a semi-supervised learning technique. The hybrid training dataset includes multiple unlabeled training inputs and multiple labeled training inputs and, in some cases, more unlabeled training inputs than labeled training inputs.

IPC Classes ?

G06N 3/045 - Combinations of networks
G06N 3/0464 - Convolutional networks [CNN, ConvNet]
G06N 3/084 - Backpropagation, e.g. using gradient descent
G06N 3/0895 - Weakly supervised learning, e.g. semi-supervised or self-supervised learning

36. RESOURCE NAVIGATION USING NEURAL NETWORKS

Application Number	EP2023063486
Publication Number	2023/222882
Status	In Force
Filing Date	2023-05-19
Publication Date	2023-11-23
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Marino, Kenneth Daniel Zaheer, Manzil Fergus, Robert David Grathwohl, Will S.

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for resource navigation using neural networks.

IPC Classes ?

G06F 16/33 - Querying
G06N 3/02 - Neural networks
G06F 16/953 - Querying, e.g. by the use of web search engines

37. DETERMINING GENERALIZED EIGENVECTORS USING MULTI-AGENT INTERACTIONS

Application Number	EP2023063487
Publication Number	2023/222883
Status	In Force
Filing Date	2023-05-19
Publication Date	2023-11-23
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Gemp, Ian Michael Mcwilliams, Brian Chen, Charlie Xiangyu

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining generalized eigenvectors that characterize a data set.

IPC Classes ?

G06F 17/16 - Matrix or vector computation

38. INTRA-AGENT SPEECH TO FACILITATE TASK LEARNING

Application Number	EP2023063494
Publication Number	2023/222887
Status	In Force
Filing Date	2023-05-19
Publication Date	2023-11-23
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Yan, Chen Carnevale, Federico Javier Georgiev, Petko Ivanov Santoro, Adam Anthony Guy, Aurelia Adrianna Muldal, Alistair Michael Hung, Chia-Chun Abramson, Joshua Simon Lillicrap, Timothy Paul Wayne, Gregory Duncan

Abstract

Systems, methods, and computer programs for learning to control an embodied agent to perform tasks. The techniques use internal, "intra-agent" speech when learning, and are thus able to perform tasks involving new objects without any direct experience of interacting with those objects, i.e. zero-shot. Implementations of the techniques use an image captioning neural network system to generate natural language captions used when training an action selection neural network system.

IPC Classes ?

G06N 3/008 - Artificial life, i.e. computing arrangements simulating life based on physical entities controlled by simulated intelligence so as to replicate intelligent life forms, e.g. based on robots replicating pets or humans in their appearance or behaviour
G06N 3/0455 - Auto-encoder networks; Encoder-decoder networks
G06N 3/044 - Recurrent networks, e.g. Hopfield networks
G06N 5/045 - Explanation of inference; Explainable artificial intelligence [XAI]; Interpretable artificial intelligence
G06N 3/096 - Transfer learning

39. VARIABLE RESOLUTION VARIABLE FRAME RATE VIDEO CODING USING NEURAL NETWORKS

Application Number	EP2023062431
Publication Number	2023/217867
Status	In Force
Filing Date	2023-05-10
Publication Date	2023-11-16
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Assael, Ioannis Alexandros Shillingford, Brendan

Abstract

Systems and methods for encoding video, and for decoding video at an arbitrary temporal and/or spatial resolution. The techniques use a scene representation neural network that, in implementations, is configured to represent frames of a 2D or 3D video as a 3D model encoded in the parameters of the neural network.

IPC Classes ?

G06N 3/04 - Architecture, e.g. interconnection topology
H04N 19/31 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability in the temporal domain
H04N 19/33 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability in the spatial domain

40. NEGOTIATING CONTRACTS FOR AGENT COOPERATION IN MULTI-AGENT SYSTEMS

Application Number	EP2023062432
Publication Number	2023/217868
Status	In Force
Filing Date	2023-05-10
Publication Date	2023-11-16
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Bachrach, Yoram Tacchetti, Andrea Gemp, Ian Michael Kramár, János Malinowski, Mateusz Mckee, Kevin Robert

Abstract

Methods, systems and apparatus, including computer programs encoded on computer storage media, for enabling agents to cooperate with one another in a way that improves their collective efficiency. The agents can modify their behavior by taking into account the behavior of other agents, so that a better overall result can be achieved than if each agent acted independently. This is done by enabling the agents to negotiate contracts with one another that restrict their respective actions.

IPC Classes ?

G06Q 10/04 - Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
G06N 3/02 - Neural networks

41. SELECTION-INFERENCE NEURAL NETWORK SYSTEMS

Application Number	EP2023062781
Publication Number	2023/218040
Status	In Force
Filing Date	2023-05-12
Publication Date	2023-11-16
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Creswell, Antonia Phoebe Nina

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a response to a query input using a selection- inference neural network.

IPC Classes ?

G06N 3/045 - Combinations of networks
G06F 40/20 - Natural language analysis
G06N 5/02 - Knowledge representation; Symbolic representation
G06N 3/042 - Knowledge-based neural networks; Logical representations of neural networks
G06F 40/35 - Discourse or dialogue representation
G06N 5/045 - Explanation of inference; Explainable artificial intelligence [XAI]; Interpretable artificial intelligence
G06N 3/096 - Transfer learning
G06N 5/046 - Forward inferencing; Production systems

42. DATA COMPRESSION AND RECONSTRUCTION USING SPARSE META-LEARNED NEURAL NETWORKS

Application Number	EP2023061711
Publication Number	2023/213903
Status	In Force
Filing Date	2023-05-03
Publication Date	2023-11-09
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Schwarz, Jonathan

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for compressing and decompressing data signals using sparse, meta-learned neural networks.

IPC Classes ?

G06N 3/045 - Combinations of networks
G06N 3/0495 - Quantised networks; Sparse networks; Compressed networks
G06N 3/082 - Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections
G06N 3/084 - Backpropagation, e.g. using gradient descent
G06N 3/0985 - Hyperparameter optimisation; Meta-learning; Learning-to-learn

43. PRIVACY-SENSITIVE NEURAL NETWORK TRAINING USING DATA AUGMENTATION

Application Number	EP2023061322
Publication Number	2023/209192
Status	In Force
Filing Date	2023-04-28
Publication Date	2023-11-02
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	De, Soham De Balle Pigem, Borja Hayes, Jamie Smith, Samuel Laurence Berrada Lancrey Javal, Leonard Alix Jean Eric

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for privacy-sensitive training of a neural network. In one aspect, a method includes training a set of neural network parameters of the neural network on a set of training data over multiple training iterations to optimize an objective function. Each training iteration includes: sampling a batch of network inputs from the set of training data; determining a clipped gradient for each network input in the batch of network inputs; and updating the neural network parameters using the clipped gradients for the network inputs in the batch of network inputs.

IPC Classes ?

G06N 3/084 - Backpropagation, e.g. using gradient descent

44. LANGUAGE MODEL FOR PROCESSING A MULTI-MODE QUERY INPUT

Application Number	EP2023061330
Publication Number	2023/209198
Status	In Force
Filing Date	2023-04-28
Publication Date	2023-11-02
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Alayrac, Jean-Baptiste Donahue, Jeffrey Lenc, Karel Simonyan, Karen Reynolds, Malcolm Kevin Campbell Luc, Pauline Mensch, Arthur Barr, Iain Miech, Antoine Hasson, Yana Elizabeth Millican, Katherine Elizabeth Ring, Roman

Abstract

A query processing system is described which receives a query input comprising an input token string and also at least one data item having a second, different modality, and generates a corresponding output token string.

IPC Classes ?

G06N 3/045 - Combinations of networks
G06N 3/0475 - Generative networks
G06N 3/08 - Learning methods

45. EPISTEMIC MACHINE LEARNING MODELS

Application Number	EP2023059619
Publication Number	2023/198807
Status	In Force
Filing Date	2023-04-13
Publication Date	2023-10-19
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Osband, Ian David Moffat Wen, Zheng Asgharipari, Seyedmohammad Dwaracherla, Vikranth Reddy Lu, Xiuyuan Van Roy, Benjamin

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing inputs using an epistemic machine learning model that improves the quality of outputs generated by a base machine learning model.

IPC Classes ?

G06N 3/0464 - Convolutional networks [CNN, ConvNet]
G06N 3/047 - Probabilistic or stochastic networks
G06N 3/09 - Supervised learning
G06N 3/045 - Combinations of networks
G06N 3/096 - Transfer learning

46. ALLOCATING COMPUTING RESOURCES BETWEEN MODEL SIZE AND TRAINING DATA DURING TRAINING OF A MACHINE LEARNING MODEL

Application Number	EP2023058150
Publication Number	2023/186987
Status	In Force
Filing Date	2023-03-29
Publication Date	2023-10-05
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Hoffmann, Jordan Borgeaud Dit Avocat, Sebastian Mensch, Arthur Sifre, Laurent

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a machine learning model to perform a machine learning task. In one aspect, a method performed by one or more computer is described. The method includes: obtaining data defining a compute budget that characterizes an amount of computing resources allocated for training a machine learning model to perform a machine learning task; processing the data defining the compute budget using an allocation mapping, in accordance with a set of allocation mapping parameters, to generate an allocation tuple defining: (i) a target model size for the machine learning model, and (ii) a target amount of training data for training the machine learning model; instantiating the machine learning model, where the machine learning model has the target model size; and obtaining the target amount of training data for training the machine learning model.

IPC Classes ?

G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]
G06N 20/00 - Machine learning

47. ENHANCING IMAGES FROM A MOBILE DEVICE TO GIVE A PROFESSIONAL CAMERA EFFECT

Application Number	EP2023054669
Publication Number	2023/186417
Status	In Force
Filing Date	2023-02-24
Publication Date	2023-10-05
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Assael, Ioannis Alexandros Shillingford, Brendan

Abstract

Systems and methods for processing an image from a mobile device so that it appears to have been captured by a camera with particular characteristics, for example a digital SLR camera with particular settings. The system uses a trained image enhancement neural network. The image enhancement neural network can be trained without needing to rely on pairs of images of the same scene; some training methods are described.

IPC Classes ?

H04N 23/80 - Camera processing pipelines; Components thereof
G06N 3/02 - Neural networks
G06N 20/00 - Machine learning
G06T 5/00 - Image enhancement or restoration

48. CONTROLLING ROBOTS USING LATENT ACTION VECTOR CONDITIONED CONTROLLER NEURAL NETWORKS

Application Number	EP2023057855
Publication Number	2023/180585
Status	In Force
Filing Date	2023-03-27
Publication Date	2023-09-28
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Bohez, Steven Tunyasuvunakool, Saran

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling agents. In particular, an agent can be controlled using a hierarchical controller that includes a task policy neural network and a low-level controller neural network.

IPC Classes ?

G06N 3/044 - Recurrent networks, e.g. Hopfield networks
G06N 3/045 - Combinations of networks
G06N 3/047 - Probabilistic or stochastic networks
G06N 3/0499 - Feedforward networks
G06N 3/084 - Backpropagation, e.g. using gradient descent
G06N 3/092 - Reinforcement learning

49. GENERATING OUTPUT SEQUENCES WITH INLINE EVIDENCE USING LANGUAGE MODEL NEURAL NETWORKS

Application Number	EP2023056778
Publication Number	2023/175089
Status	In Force
Filing Date	2023-03-16
Publication Date	2023-09-21
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Menick, Jacob Lee Mikulik, Vladmir Trebacz, Maja Maria Mcaleese-Park, Nathaniel John Irving, Geoffrey

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating output sequences using language model neural networks. In particular, the output sequences include a response to an input query and inline evidence that includes a quote from a context document that supports the response.

IPC Classes ?

G06F 40/00 - Handling natural language data
G06N 3/02 - Neural networks
G06F 16/33 - Querying

50. TRAINING OBJECT DISCOVERY NEURAL NETWORKS AND FEATURE REPRESENTATION NEURAL NETWORKS USING SELF-SUPERVISED LEARNING

Application Number	EP2022056866
Publication Number	2023/169696
Status	In Force
Filing Date	2022-03-16
Publication Date	2023-09-14
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Zisserman, Andrew Zoran, Daniel Jaegle, Andrew Coulter Shelhamer, Evan Gerard Carreira, Joao Hénaff, Olivier Jean Arandjelovic, Relja Koppula, Skanda Kumar

Abstract

A neural network system that is configured to learn a representation of data item, such as an image, audio, or text data item, through a self-supervised learning process. Implementations of the system couple two learning processes, an object discovery learning process and an object feature representation learning process. In implementations the object discovery learning process assists the object feature representation learning process in self-supervised learning of object feature representations, and the object feature representation learning process is used to improve the object discovery learning process.

IPC Classes ?

G06N 3/04 - Architecture, e.g. interconnection topology
G06N 3/08 - Learning methods

51. GENERATING COMPRESSED REPRESENTATIONS OF VIDEO FOR EFFICIENT LEARNING OF VIDEO TASKS

Application Number	EP2023055757
Publication Number	2023/170069
Status	In Force
Filing Date	2023-03-07
Publication Date	2023-09-14
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Wiles, Olivia Anne Malinowski, Mateusz Zisserman, Andrew Barr, Iain Carreira, Joao

Abstract

A method is proposed to train an adaptive system to perform a video processing task, based on a database of compressed representations of video data items. The compressed representations were generated by a trained adaptive compressor unit.

IPC Classes ?

H04N 19/146 - Data rate or code amount at the encoder output
G06N 3/045 - Combinations of networks
G06N 3/0455 - Auto-encoder networks; Encoder-decoder networks
G06N 3/0464 - Convolutional networks [CNN, ConvNet]
G06N 3/08 - Learning methods
G06V 10/82 - Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
H04N 19/177 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a group of pictures [GOP]

52. TRACKING QUERY POINTS IN VIDEOS USING NEURAL NETWORKS

Application Number	EP2023055758
Publication Number	2023/170070
Status	In Force
Filing Date	2023-03-07
Publication Date	2023-09-14
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Doersch, Carl Gupta, Ankush Markeeva, Larisa Greff, Klaus Tagliasacchi, Andrea Recasens Continente, Adrià Aytar, Yusuf Carreira, Joao Zisserman, Andrew Yang, Yi

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for tracking query points in videos using a point tracking neural network.

IPC Classes ?

G06V 10/44 - Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
G06N 3/045 - Combinations of networks
G06N 3/0464 - Convolutional networks [CNN, ConvNet]
G06N 3/084 - Backpropagation, e.g. using gradient descent
G06T 7/246 - Analysis of motion using feature-based methods, e.g. the tracking of corners or segments
G06V 10/50 - Extraction of image or video features by summing image-intensity values; Projection analysis
G06V 10/82 - Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
G06V 20/40 - Scenes; Scene-specific elements in video content

53. PROCESSING NETWORK INPUTS USING PARTITIONED ATTENTION

Application Number	EP2023055753
Publication Number	2023/170067
Status	In Force
Filing Date	2023-03-07
Publication Date	2023-09-14
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Recasens Continente, Adrià Lin, Jason Jiachen Wang, Luyu Alayrac, Jean-Baptiste Jaegle, Andrew Coulter Carreira, Joao Luc, Pauline Miech, Antoine De Freitas Smaira, Lucas Hemsley, Ross Zisserman, Andrew

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing network inputs using a neural network that implements partitioned attention.

IPC Classes ?

G06N 3/0455 - Auto-encoder networks; Encoder-decoder networks
G06N 3/096 - Transfer learning
G06N 3/0895 - Weakly supervised learning, e.g. semi-supervised or self-supervised learning

54. AGENT CONTROL THROUGH CULTURAL TRANSMISSION

Application Number	EP2023055474
Publication Number	2023/166195
Status	In Force
Filing Date	2023-03-03
Publication Date	2023-09-07
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Bhoopchand, Avishkar Ajay Collister, Adrian Ashley Edwards, Ashley Deloris Everett, Richard Hughes, Edward Fauchon Mathewson, Kory Wallace Pîslar, Miruna Zacherl, Alexander Zhang, Lei

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling agents. In particular, an agent can be controlled using a policy neural network that has been trained to allow the agent to achieve cultural transmission after training.

IPC Classes ?

G06N 3/096 - Transfer learning
G06N 3/092 - Reinforcement learning

55. COMPUTER CODE GENERATION FROM TASK DESCRIPTIONS USING NEURAL NETWORKS

Application Number	EP2023052595
Publication Number	2023/148286
Status	In Force
Filing Date	2023-02-02
Publication Date	2023-08-10
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Li, Yujia Choi, David Hugo Chung, Junyoung Kushman, Nathaniel Arthur Schrittwieser, Julian Leblond, Rémi Eccles, Thomas Edward Keeling, James Thomas Gimeno Gil, Felix Axel Dal Lago, Agustín Matías Hubert, Thomas Keisuke Choy, Peter De Masson D'Autume, Cyprien Sutherland Robson, Esme Vinyals, Oriol

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating computer code using neural networks. One of the methods includes receiving description data describing a computer programming task; receiving a first set of inputs for the computer programming task; generating a plurality of candidate computer programs by sampling a plurality of output sequences from a set of one or more generative neural networks; for each candidate computer program in a subset of the candidate computer programs and for each input in the first set: executing the candidate computer program on the input to generate an output; and selecting, from the candidate computer programs, one or more computer programs as synthesized computer programs for performing the computer programming task based at least in part on the outputs generated by executing the candidate computer programs in the subset on the inputs in the first set of inputs.

IPC Classes ?

G06F 8/30 - Creation or generation of source code
G06N 3/02 - Neural networks
G06N 3/0455 - Auto-encoder networks; Encoder-decoder networks

56. GENERATING SEQUENCES OF DATA ELEMENTS USING CROSS-ATTENTION OPERATIONS

Application Number	EP2023052026
Publication Number	2023/144316
Status	In Force
Filing Date	2023-01-27
Publication Date	2023-08-03
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Hawthorne, Curtis Glenn-Macway Jaegle, Andrew Coulter Cangea, Catalina-Codruta Borgeaud Dit Avocat, Sebastian Nash, Charlie Thomas Curtis Malinowski, Mateusz Dieleman, Sander Etienne Lea Vinyals, Oriol Botvinick, Matthew Simon, Ian Stuart Sheahan, Hannah Rachel Zeghidour, Neil Alayrac, Jean-Baptiste Carreira, Joao Engel, Jesse

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a sequence of data elements that includes a respective data element at each position in a sequence of positions. In one aspect, a method includes: for each position after a first position in the sequence of positions: obtaining a current sequence of data element embeddings that includes a respective data element embedding of each data element at a position that precedes the current position, obtaining a sequence of latent embeddings, and processing: (i) the current sequence of data element embeddings, and (ii) the sequence of latent embeddings, using a neural network to generate the data element at the current position. The neural network includes a sequence of neural network blocks including: (i) a cross-attention block, (ii) one or more self-attention blocks, and (iii) an output block.

IPC Classes ?

G06N 3/045 - Combinations of networks
G06N 3/09 - Supervised learning
G06N 3/084 - Backpropagation, e.g. using gradient descent

57. LOCAL CROSS-ATTENTION OPERATIONS IN NEURAL NETWORKS

Application Number	EP2023052183
Publication Number	2023/144385
Status	In Force
Filing Date	2023-01-30
Publication Date	2023-08-03
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Carreira, Joao Jaegle, Andrew Coulter Koppula, Skanda Kumar Zoran, Daniel Recasens Continente, Adrià Ionescu, Catalin-Dumitru Hénaff, Olivier Jean Shelhamer, Evan Gerard Arandjelovic, Relja Botvinick, Matthew Vinyals, Oriol Simonyan, Karen Zisserman, Andrew

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for using a neural network to generate a network output that characterizes an entity. In one aspect, a method includes: obtaining a representation of the entity as a set of data element embeddings, obtaining a set of latent embeddings, and processing: (i) the set of data element embeddings, and (ii) the set of latent embeddings, using the neural network to generate the network output. The neural network includes a sequence of neural network blocks including: (i) one or more local cross-attention blocks, and (ii) an output block. Each local cross-attention block partitions the set of latent embeddings and the set of data element embeddings into proper subsets, and updates each proper subset of the set of latent embeddings using attention over only the corresponding proper subset of the set of data element embeddings.

IPC Classes ?

G06N 3/0455 - Auto-encoder networks; Encoder-decoder networks
G06N 3/0495 - Quantised networks; Sparse networks; Compressed networks
G06N 3/084 - Backpropagation, e.g. using gradient descent

58. CONTROLLING REINFORCEMENT LEARNING AGENTS USING GEOMETRIC POLICY COMPOSITION

Application Number	EP2023052205
Publication Number	2023/144395
Status	In Force
Filing Date	2023-01-30
Publication Date	2023-08-03
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Rowland, Mark Daniel Thakoor, Shantanu Yogeshraj Da Motta Salles Barreto, Andre Borsa, Diana Luiza Dabney, William Clinton Munos, Remi

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling a reinforcement learning agent in an environment. One of the methods may include maintaining data specifying a base policy set comprising a plurality of base policies for controlling the agent; receiving a current observation characterizing a current state of the environment; generating, for each of the plurality of base policies, one or more predicted future observations characterizing respective future states of the environment that are subsequent to the current state of the environment; using the predicted future observations generated for the plurality of base policies to determine a respective estimated value for each composite policy in a composite policy set with respect to the current state of the environment; and selecting an action using the respective estimated values for the composite policies.

IPC Classes ?

G06N 3/045 - Combinations of networks
G06N 3/047 - Probabilistic or stochastic networks
G06N 3/088 - Non-supervised learning, e.g. competitive learning
G06N 3/092 - Reinforcement learning

59. LEARNED COMPUTER CONTROL USING POINTING DEVICE AND KEYBOARD ACTIONS

Application Number	EP2023052210
Publication Number	2023/144397
Status	In Force
Filing Date	2023-01-30
Publication Date	2023-08-03
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Humphreys, Peter Conway Lillicrap, Timothy Paul Pohlen, Tobias Markus Santoro, Adam Anthony

Abstract

A computer-implemented method for controlling a particular computer to execute a task is described. The method includes receiving a control input comprising a visual input, the visual input including one or more screen frames of a computer display that represent at least a current state of the particular computer; processing the control input using a neural network to generate one or more control outputs that are used to control the particular computer to execute the task, in which the one or more control outputs include an action type output that specifies at least one of a pointing device action or a keyboard action to be performed to control the particular computer; determining one or more actions from the one or more control outputs; and executing the one or more actions to control the particular computer.

IPC Classes ?

G06F 3/01 - Input arrangements or combined input and output arrangements for interaction between user and computer
G06F 3/038 - Control and interface arrangements therefor, e.g. drivers or device-embedded control circuitry
G06N 3/0442 - Recurrent networks, e.g. Hopfield networks characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU]
G06N 3/0455 - Auto-encoder networks; Encoder-decoder networks
G06N 3/0464 - Convolutional networks [CNN, ConvNet]
G06F 40/00 - Handling natural language data
G06F 3/048 - Interaction techniques based on graphical user interfaces [GUI]
G06F 3/16 - Sound input; Sound output
G06N 3/0475 - Generative networks
G06N 3/092 - Reinforcement learning

60. AUGMENTING MACHINE LEARNING LANGUAGE MODELS USING SEARCH ENGINE RESULTS

Application Number	EP2023052286
Publication Number	2023/144413
Status	In Force
Filing Date	2023-01-31
Publication Date	2023-08-03
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Lazaridou, Angeliki Gribovskaya, Elena Grigorev, Nikolai Stokowiec, Wojciech Jan

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for augmenting machine learning language models using search engine results. One of the methods includes obtaining question data representing a question; generating, from the question data, a search engine query for a search engine; obtaining a plurality of documents identified by the search engine in response to processing the search engine query; generating, from the plurality of documents, a plurality of conditioning inputs each representing at least a portion of one or more of the obtained documents; for each of a plurality of the generated conditioning inputs, processing a network input generated from (i) the question data and (ii) the conditioning input using a neural network to generate a network output representing a candidate answer to the question; and generating, from the network outputs representing respective candidate answers, answer data representing a final answer to the question.

IPC Classes ?

G06F 16/953 - Querying, e.g. by the use of web search engines
G06N 20/00 - Machine learning
G06F 16/332 - Query formulation

61. OPTIMIZING DESIGN PARAMETERS USING A SIMULATION NEURAL NETWORK

Application Number	EP2023052177
Publication Number	2023/144383
Status	In Force
Filing Date	2023-01-30
Publication Date	2023-08-03
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Allen, Kelsey Rebecca Lopez Guevara, Tatiana Stachenfeld, Kimberly Hamrick, Jessica Blake Chandler Sanchez, Alvaro Battaglia, Peter William Pfaff, Tobias

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for optimizing a set of design parameters. In one aspect, a method includes: obtaining a respective initial value for each design parameter, and iteratively optimizing current values of the design parameters over a sequence of optimization iterations. The method further includes, each optimization iteration: generating a representation of an initial state of an environment using the current values of the design parameters, processing an input including the representation of the initial state of the environment using a simulation neural network to generate an output that defines a simulation of the state of the environment over a sequence of one or more time steps, determining a reward, determining gradients of the reward with respect to the current values of the design parameters, and updating the current values of the design parameters using the gradients.

IPC Classes ?

G06N 3/042 - Knowledge-based neural networks; Logical representations of neural networks
G06N 3/045 - Combinations of networks
G06N 3/084 - Backpropagation, e.g. using gradient descent
G06N 3/09 - Supervised learning

62. GENERATING DATA ITEMS USING OFF-THE-SHELF GUIDED GENERATIVE DIFFUSION PROCESSES

Application Number	EP2023052186
Publication Number	2023/144386
Status	In Force
Filing Date	2023-01-30
Publication Date	2023-08-03
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Durkan, Conor Michael Dieleman, Sander Etienne Lea Binkowski, Mikolaj Shang, Wenling

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a data item using a diffusion neural network. In particular, the data item is generated by guiding a reverse diffusion process using a time-independent guidance neural network.

IPC Classes ?

G06N 3/045 - Combinations of networks
G06N 3/0475 - Generative networks

63. LARGE SCALE RETRIEVAL FOR SEQUENCE GENERATION

Application Number	EP2022084592
Publication Number	2023/104793
Status	In Force
Filing Date	2022-12-06
Publication Date	2023-06-15
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Borgeaud Dit Avocat, Sebastian Sifre, Laurent Mensch, Arthur Hoffmann, Jordan

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a final output sequence. In one aspect, a method comprises: receiving a current output sequence comprising one or more current output segments; receiving a set of reference segments and a respective reference segment embedding of each reference segment that has been generated using an embedding neural network; for each current output segment: processing the current output segment using the embedding neural network to generate a current output segment embedding of the current output segment; and selecting k most similar reference segments to the current output segment using the reference segment embeddings and the current output segment embedding; and processing the current output sequence and the k most similar reference segments for each current output segment to generate an additional output segment that follows the current output sequence in the final output sequence.

IPC Classes ?

G06N 3/0455 - Auto-encoder networks; Encoder-decoder networks
G06N 3/084 - Backpropagation, e.g. using gradient descent
G06N 3/09 - Supervised learning
G06N 3/042 - Knowledge-based neural networks; Logical representations of neural networks
G06F 40/56 - Natural language generation
G06F 16/332 - Query formulation
G06N 3/0464 - Convolutional networks [CNN, ConvNet]
G06N 3/0499 - Feedforward networks

64. TRAINING CONDITIONAL COMPUTATION NEURAL NETWORKS USING REINFORCEMENT LEARNING

Application Number	EP2022084779
Publication Number	2023/104879
Status	In Force
Filing Date	2022-12-07
Publication Date	2023-06-15
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Clark, Aidan Mensch, Arthur

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network having one or more conditional computation layers, where each conditional computation layer includes a gating sub-layer having multiple gating parameters and an expert sub-layer having multiple expert neural networks. In one aspect, a method comprises: sampling a batch of target output sequences that comprises a respective ground truth output token at each of multiple output positions; for each target output sequence, processing the target output sequence using the neural network to generate a network output that includes respective score distributions over the vocabulary of output tokens for the output positions in the target output sequence; and training each gating sub-layer using respective rewards for the gating sub-layer for the output positions through reinforcement learning to optimize a reinforcement learning objective function that measures an expected reward received by the gating sub-layer.

IPC Classes ?

G06N 3/084 - Backpropagation, e.g. using gradient descent
G06N 3/092 - Reinforcement learning
G06N 3/0464 - Convolutional networks [CNN, ConvNet]
G06N 3/09 - Supervised learning

65. CONTROLLING INTERACTIVE AGENTS USING MULTI-MODAL INPUTS

Application Number	EP2022084780
Publication Number	2023/104880
Status	In Force
Filing Date	2022-12-07
Publication Date	2023-06-15
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Abramson, Joshua Simon Ahuja, Arun Carnevale, Federico Javier Georgiev, Petko Ivanov Hung, Chia-Chun Lillicrap, Timothy Paul Muldal, Alistair Michael Santoro, Adam Anthony Von Glehn, Tamara Louise Landon, Jessica Paige Wayne, Gregory Duncan Yan, Chen Zhu, Rui

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling agents. In particular, an interactive agent can be controlled based on multi-modal inputs that include both an observation image and a natural language text sequence.

IPC Classes ?

G06N 3/045 - Combinations of networks
G06N 3/0442 - Recurrent networks, e.g. Hopfield networks characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU]
G06N 3/092 - Reinforcement learning
G06N 3/084 - Backpropagation, e.g. using gradient descent
G06N 3/0895 - Weakly supervised learning, e.g. semi-supervised or self-supervised learning
G06N 3/008 - Artificial life, i.e. computing arrangements simulating life based on physical entities controlled by simulated intelligence so as to replicate intelligent life forms, e.g. based on robots replicating pets or humans in their appearance or behaviour
G05D 1/00 - Control of position, course, altitude, or attitude of land, water, air, or space vehicles, e.g. automatic pilot

66. DESIGNING PROTEINS BY JOINTLY MODELING SEQUENCE AND STRUCTURE

Application Number	EP2022082678
Publication Number	2023/094335
Status	In Force
Filing Date	2022-11-21
Publication Date	2023-06-01
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Kohl, Simon Jumper, John Senior, Andrew W. Zambaldi, Vinicius Galiazzi Schneider, Rosalia Bates, Russell James Stanton, Gabriella Hayley Fergus, Robert David Ionescu, Catalin-Dumitru La, David

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for designing a protein by jointly generating an amino acid sequence and a structure of the protein. In one aspect, a method comprises: generating data defining the amino acid sequence and the structure of the protein using a protein design neural network, comprising, for a plurality of positions in the amino acid sequence: receiving the current representation of the protein as of the current position; processing the current representation of the protein using the protein design neural network to generate design data for the current position that comprises: (i) data identifying an amino acid at the current position, and (ii) a set of structure parameters for the current position; and updating the current representation of the protein using the design data for the current position.

IPC Classes ?

G16B 35/00 - ICT specially adapted for in silico combinatorial libraries of nucleic acids, proteins or peptides

67. DISTRIBUTED TOP K COMPUTATION

Application Number	EP2022082276
Publication Number	2023/089030
Status	In Force
Filing Date	2022-11-17
Publication Date	2023-05-25
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Elsen, Erich Konrad Abercrombie, Stuart Christopher Benedict

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for distributing a top k computation across multiple computing units of an integrated circuit. One of the methods includes computing, by each of the plurality of computing units and for each candidate vector in a respective subset of the candidate vectors assigned to the computing unit, a respective distance between the query vector and the candidate vector; initializing, by the integrated circuit, a cut-off distance value; determining, by the integrated circuit, a final cut-off distance value; and providing, by the integrated circuit and as an output of a top k computation for the query vector and the set of candidate vectors, the candidate vectors that have respective distances that satisfy the final cut-off distance value.

IPC Classes ?

G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

68. COORDINATION OF MULTIPLE ROBOTS USING GRAPH NEURAL NETWORKS

Application Number	EP2022075656
Publication Number	2023/057185
Status	In Force
Filing Date	2022-09-15
Publication Date	2023-04-13
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Lai, Matthew Scholz, Jonathan Karl Chen, Jose Enrique

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for controlling a plurality of robots. One of the methods includes: obtaining state data representing a current state of the environment; generating, from the state data, graph data representing a graph of the current state of the environment; processing the graph data using a graph neural network to generate a graph output that comprises a respective updated feature representation for each of the robot nodes in the graph; and selecting, based on the graph output, a respective action to be performed by each of the robots.

IPC Classes ?

B25J 9/00 - Programme-controlled manipulators
B25J 9/16 - Programme controls

69. TRAINING A NEURAL NETWORK TO PREDICT MULTI-CHAIN PROTEIN STRUCTURES

Application Number	EP2022077595
Publication Number	2023/057455
Status	In Force
Filing Date	2022-10-04
Publication Date	2023-04-13
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Evans, Richard Andrew O'Neill, Michael James Pritzel, Alexander Antropova, Natasha Olegovna Green, Timothy Frederick Goldie Jumper, John

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for predicting a structure of a protein that comprises a plurality of amino acid chains using a protein structure prediction neural network, where each chain comprises a respective sequence of amino acids. In one aspect, a method comprises: receiving a network input for the protein structure prediction neural network, wherein the network input characterizes the protein; processing the network input characterizing the protein using the protein structure prediction neural network to generate a network output that characterizes a predicted structure of the protein; and determining the predicted structure of the protein based on the network output.

IPC Classes ?

G16B 15/20 - Protein or domain folding
G16B 40/20 - Supervised data analysis
G16B 30/10 - Sequence alignment; Homology search
G16B 5/20 - Probabilistic models

70. HIERARCHICAL LATENT MIXTURE POLICIES FOR AGENT CONTROL

Application Number	EP2022077694
Publication Number	2023/057511
Status	In Force
Filing Date	2022-10-05
Publication Date	2023-04-13
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Rao, Dushyant Sadeghi, Fereshteh Hasenclever, Leonard Wulfmeier, Markus Zambelli, Martina Vezzani, Giulia Tirumala Bukkapatnam, Dhruva Aytar, Yusuf Merel, Joshua Heess, Nicolas Manfred Otto Hadsell, Raia Thais

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling agents. In particular, an agent can be controlled using a hierarchical controller that includes a high-level controller neural network, a mid-level controller neural network, and a low-level controller neural network.

IPC Classes ?

G06N 3/092 - Reinforcement learning
G06N 3/0464 - Convolutional networks [CNN, ConvNet]
G06N 3/096 - Transfer learning
G06N 3/006 - Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
G06N 3/045 - Combinations of networks

71. RETRIEVAL AUGMENTED REINFORCEMENT LEARNING

Application Number	EP2022077696
Publication Number	2023/057512
Status	In Force
Filing Date	2022-10-05
Publication Date	2023-04-13
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Goyal, Anirudh Banino, Andrea Friesen, Abram Luke Weber, Theophane Guillaume Badia, Adrià Puigdomènech Ke, Nan Osindero, Simon Lillicrap, Timothy Paul Blundell, Charles

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for controlling a reinforcement learning agent in an environment to perform a task using a retrieval-augmented action selection process. One of the methods includes receiving a current observation characterizing a current state of the environment; processing an encoder network input comprising the current observation to determine a policy neural network hidden state that corresponds to the current observation; maintaining a plurality of trajectories generated as a result of the reinforcement learning agent interacting with the environment; selecting one or more trajectories from the plurality of trajectories; updating the policy neural network hidden state using update data determined from the one or more selected trajectories; and processing the updated hidden state using a policy neural network to generate a policy output that specifies an action to be performed by the agent in response to the current observation.

IPC Classes ?

G06N 3/092 - Reinforcement learning
G06N 3/006 - Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
G06N 3/042 - Knowledge-based neural networks; Logical representations of neural networks
G06N 3/0442 - Recurrent networks, e.g. Hopfield networks characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU]
G06N 3/0455 - Auto-encoder networks; Encoder-decoder networks
G06N 3/0464 - Convolutional networks [CNN, ConvNet]
G06N 3/084 - Backpropagation, e.g. using gradient descent

72. CONFORMAL TRAINING OF MACHINE-LEARNING MODELS

Application Number	EP2022077703
Publication Number	2023/057516
Status	In Force
Filing Date	2022-10-05
Publication Date	2023-04-13
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Cemgil, Ali Taylan Doucet, Arnaud Dvijotham, Krishnamurthy Stutz, David

Abstract

Methods, computer systems, and apparatus, including computer programs encoded on computer storage media, for training a classification machine-learning model. The system obtains calibration training examples and prediction training examples, determines a threshold value based on the calibration training examples, generates data characterizing predicted confidence sets based on the threshold value and the prediction training examples, and update model parameters based at least on the predicted confidence sets.

IPC Classes ?

G06N 3/084 - Backpropagation, e.g. using gradient descent
G06N 3/09 - Supervised learning
G06N 3/042 - Knowledge-based neural networks; Logical representations of neural networks
G16H 50/20 - ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems

73. DEMONSTRATION-DRIVEN REINFORCEMENT LEARNING

Application Number	EP2022077706
Publication Number	2023/057518
Status	In Force
Filing Date	2022-10-05
Publication Date	2023-04-13
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Sushkov, Oleg O. Davchev, Todor Bozhinov Scholz, Jonathan Karl

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a reinforcement learning system to select actions to be performed by an agent interacting with an environment to perform a particular task. In one aspect, one of the methods includes obtaining a training sequence comprising a respective training observations at each of a plurality of time steps; obtaining demonstration data comprising one or more demonstration sequences; generating a new training sequence from the training sequence and the demonstration data; and training the goal-conditioned policy neural network on the new training sequence through reinforcement learning.

IPC Classes ?

G06N 3/08 - Learning methods
G06N 3/00 - Computing arrangements based on biological models
G06N 3/04 - Architecture, e.g. interconnection topology

74. STEP-UNROLLED DENOISING NEURAL NETWORKS

Application Number	EP2022077806
Publication Number	2023/057565
Status	In Force
Filing Date	2022-10-06
Publication Date	2023-04-13
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Savinov, Nikolay Chung, Junyoung Binkowski, Mikolaj Van Den Oord, Aaron Gerard Antonius Elsen, Erich Konrad

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating output sequences using a non-auto-regressive neural network.

IPC Classes ?

G06N 3/088 - Non-supervised learning, e.g. competitive learning
G06N 3/0455 - Auto-encoder networks; Encoder-decoder networks
G06N 3/047 - Probabilistic or stochastic networks
G06N 3/084 - Backpropagation, e.g. using gradient descent

75. RATING TASKS AND POLICIES USING CONDITIONAL PROBABILITY DISTRIBUTIONS DERIVED FROM EQUILIBRIUM-BASED SOLUTIONS OF GAMES

Application Number	EP2022078056
Publication Number	2023/057651
Status	In Force
Filing Date	2022-10-10
Publication Date	2023-04-13
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Marris, Luke Christopher

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for rating tasks and policies using conditional probability distributions derived from equilibrium-based solutions of games. One of the methods includes: determining, for each action selection policy in a pool of action selection policies, a respective performance measure of the action selection policy on each task in a pool of tasks, processing the performance measures of the action selection policies on the tasks to generate data defining a joint probability distribution over a set of action selection policy - task pairs, and processing the joint probability distribution over the set of action selection policy - task pairs to generate a respective rating for each action selection policy in the pool of action selection policies, where the respective rating for each action selection policy characterizes a utility of the action selection policy in performing tasks from the pool of tasks.

IPC Classes ?

G06N 3/00 - Computing arrangements based on biological models
G06N 5/00 - Computing arrangements using knowledge-based models
G06N 3/08 - Learning methods
G06N 7/00 - Computing arrangements based on specific mathematical models

76. NEURAL NETWORKS WITH TRANSFORMED ACTIVATION FUNCTION LAYERS

Application Number	EP2022077600
Publication Number	2023/052653
Status	In Force
Filing Date	2022-10-04
Publication Date	2023-04-06
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Martens, James Zhang, Guodong Swirszcz, Grzegorz Michael Ballard, Andrew James Desjardins, Guillaume

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing inputs using a neural network system that includes one or more transformed activation function layers.

IPC Classes ?

G06N 3/04 - Architecture, e.g. interconnection topology
G06N 3/08 - Learning methods

77. AUTOREGRESSIVELY GENERATING SEQUENCES OF DATA ELEMENTS DEFINING ACTIONS TO BE PERFORMED BY AN AGENT

Application Number	EP2022072731
Publication Number	2023/025607
Status	In Force
Filing Date	2022-08-12
Publication Date	2023-03-02
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Reed, Scott Ellison Zolna, Konrad Parisotto, Emilio Erez, Tom Novikov, Alexander Rae, Jack William Denil, Misha Man Ray Gomes De Freitas, Joao Ferdinando Vinyals, Oriol Gomez, Sergio Edwards, Ashley Deloris Bruce, Jacob Barth-Maron, Gabriel

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for selecting actions to be performed by an agent to interact with an environment using an action selection neural network. In one aspect, a method comprises, at each time step in a sequence of time steps: generating a current representation of a state of a task being performed by the agent in the environment as of the current time step as a sequence of data elements; autoregressively generating a sequence of data elements representing a current action to be performed by the agent at the current time step; and after autoregressively generating the sequence of data elements representing the current action, causing the agent to perform the current action at the current time step.

IPC Classes ?

G06N 3/04 - Architecture, e.g. interconnection topology
G06N 3/08 - Learning methods

78. CONTROLLING AGENTS BY SWITCHING BETWEEN CONTROL POLICIES DURING TASK EPISODES

Application Number	EP2022071864
Publication Number	2023/012234
Status	In Force
Filing Date	2022-08-03
Publication Date	2023-02-09
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Schaul, Tom Pîslar, Miruna

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling agents. In particular, an agent can be controlled to perform a task episode by switching the control policy that is used to control the agent at one or more time steps during the task episode.

IPC Classes ?

G06N 3/04 - Architecture, e.g. interconnection topology
G06N 3/08 - Learning methods
G06N 3/00 - Computing arrangements based on biological models

79. TRAINING AGENT NEURAL NETWORKS THROUGH OPEN-ENDED LEARNING

Application Number	EP2022071137
Publication Number	2023/006848
Status	In Force
Filing Date	2022-07-27
Publication Date	2023-02-02
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Jaderberg, Maxwell Elliot Czarnecki, Wojciech

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for raining an agent neural network for use in controlling an agent to perform a plurality of tasks. One of the methods includes maintaining population data specifying a population of one or more candidate agent neural networks; and training each candidate agent neural network on a respective set of one or more tasks to update the parameter values of the parameters of the candidate agent neural networks in the population data, the training comprising, for each candidate agent neural network: obtaining data identifying a candidate task; obtaining data specifying a control policy for the candidate task; determining whether to train the candidate agent neural network on the candidate task; and in response to determining to train the candidate agent neural network on the candidate task, training the candidate agent neural network on the candidate task.

IPC Classes ?

G06N 3/00 - Computing arrangements based on biological models
G06N 3/04 - Architecture, e.g. interconnection topology
G06N 3/08 - Learning methods

80. CONTROLLING A MAGNETIC FIELD OF A MAGNETIC CONFINEMENT DEVICE USING A NEURAL NETWORK

Application Number	EP2022069047
Publication Number	2023/281048
Status	In Force
Filing Date	2022-07-08
Publication Date	2023-01-12
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Degrave, Jonas Felici, Federico Alberto Alfredo Buchli, Jonas Neunert, Michael Peter Tracey, Brendan Daniel Carpanese, Francesco Ewalds, Timo Victor Hafner, Roland Riedmiller, Martin

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating control signals for controlling a magnetic field for confining plasma in a chamber of a magnetic confinement device. One of the methods includes, for each of a plurality of time steps, obtaining an observation characterizing a current state of the plasma in the chamber of the magnetic confinement device, processing an input including the observation using a plasma confinement neural network to generate a magnetic control output that characterizes control signals for controlling the magnetic field of the magnetic confinement device, and generating the control signals for controlling the magnetic field of the magnetic confinement device based on the magnetic control output.

IPC Classes ?

G21B 1/05 - Thermonuclear fusion reactors with magnetic or electric plasma confinement
G21D 3/00 - Control of nuclear power plant

81. DETERMINING A DISTRIBUTION OF ATOM COORDINATES OF A MACROMOLECULE FROM IMAGES USING AUTO-ENCODERS

Application Number	EP2022067541
Publication Number	2022/269100
Status	In Force
Filing Date	2022-06-27
Publication Date	2022-12-29
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Ronneberger, Olaf Abellanas, Marta Garnelo Rosenbaum, Dan Eslami, Seyed Mohammadali Adler, Jonas Anders

Abstract

Methods, systems and apparatus, including computer programs encoded on computer storage media. One of the methods includes obtaining a plurality of images of a macromolecule having a plurality of atoms, training a decoder neural network on the plurality of images, and after the training, generating a plurality of conformations for at least a portion of the macromolecule that each include respective three-dimensional coordinates of each of the plurality of atoms, wherein generating each conformation includes sampling a conformation latent representation from a prior distribution over conformation latent representations, processing a respective input including the sampled conformation latent representation using the decoder neural network to generate a conformation output that specifies three-dimensional coordinates of each of the plurality of atoms for the conformation, and generating the conformation from the conformation output.

IPC Classes ?

G16B 15/30 - Drug targeting using structural data; Docking or binding prediction
G16B 40/20 - Supervised data analysis

82. ADAPTIVE VISUAL SPEECH RECOGNITION

Application Number	EP2022066419
Publication Number	2022/263570
Status	In Force
Filing Date	2022-06-15
Publication Date	2022-12-22
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Assael, Ioannis Alexandros Shillingford, Brendan Gomes De Freitas, Joao Ferdinando

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing video data using an adaptive visual speech recognition model. One of the methods includes receiving a video that includes a plurality of video frames that depict a first speaker; obtaining a first embedding characterizing the first speaker; and processing a first input comprising (i) the video and (ii) the first embedding using a visual speech recognition neural network having a plurality of parameters, wherein the visual speech recognition neural network is configured to process the video and the first embedding in accordance with trained values of the parameters to generate a speech recognition output that defines a sequence of one or more words being spoken by the first speaker in the video.

IPC Classes ?

G10L 15/16 - Speech classification or search using artificial neural networks
G10L 15/26 - Speech to text systems
G06N 3/04 - Architecture, e.g. interconnection topology
G06N 3/08 - Learning methods
G06V 40/16 - Human faces, e.g. facial parts, sketches or expressions
G10L 15/25 - Speech recognition using non-acoustical features using position of the lips, movement of the lips or face analysis

83. MULTIMODAL FEW-SHOT LEARNING WITH FROZEN LANGUAGE MODELS

Application Number	EP2022065502
Publication Number	2022/258666
Status	In Force
Filing Date	2022-06-08
Publication Date	2022-12-15
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Tsimpoukelli, Maria Rafailia Menick, Jacob Lee Cabi, Serkan Hill, Felix George Eslami, Seyed Mohammadali Vinyals, Oriol

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing multi-modal inputs using language models. In particular, the inputs include an image, and the image is encoded by an image encoder neural network to generate a sequence of image embeddings representing the image. The sequence of image embeddings is provided as at least part of an input sequence to that is processed by a language model neural network.

IPC Classes ?

G06N 3/04 - Architecture, e.g. interconnection topology
G06N 3/08 - Learning methods
G06V 10/80 - Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
G06V 10/82 - Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks

84. PREDICTING SPECTRAL REPRESENTATIONS FOR TRAINING SPEECH SYNTHESIS NEURAL NETWORKS

Application Number	EP2022064925
Publication Number	2022/253901
Status	In Force
Filing Date	2022-06-01
Publication Date	2022-12-08
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Casagrande, Norman

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network to perform speech synthesis. One of the methods includes obtaining a training data set for training a first neural network to process a spectral representation of an audio sample and to generate a prediction of the audio sample, wherein, after training, the first neural network obtains spectral representations of audio samples from a second neural network; for a plurality of audio samples in the training data set: generating a ground-truth spectral representation of the audio sample; and processing the ground-truth spectral representation using a third neural network to generate an updated spectral representation of the audio sample; and training the first neural network using the updated spectral representations, wherein the third neural network is configured to generate updated spectral representations that resemble spectral representations generated by the second neural network.

IPC Classes ?

G10L 13/047 - Architecture of speech synthesisers
G10L 25/30 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the analysis technique using neural networks
G06N 3/04 - Architecture, e.g. interconnection topology
G10L 13/10 - Prosody rules derived from text; Stress or intonation
G10L 13/033 - Voice editing, e.g. manipulating the voice of the synthesiser

85. CONTINUAL LEARNING NEURAL NETWORK SYSTEM TRAINING FOR CLASSIFICATION TYPE TASKS

Application Number	EP2022064431
Publication Number	2022/248676
Status	In Force
Filing Date	2022-05-27
Publication Date	2022-12-01
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Shanahan, Murray

Abstract

There is disclosed a computer-implemented method for training a neural network-based system. The method comprises receiving a training data item and target data associated with the training data item. The training data item is processed using an encoder to generate an encoding of the training data item. A subset of neural networks is selected from a plurality of neural networks stored in a memory based upon the encoding; wherein the plurality of neural networks are configured to process the encoding to generate output data indicative of a classification of an aspect of the training data item. The encoding is processed using the selected subset of neural networks to generate the output data. An update to the parameters of the selected subset of neural networks is determined based upon a loss function comprising a relationship between the generated output data and the target data associated with the training data item. The parameters of the selected subset of neural networks are updated based upon the determined update.

IPC Classes ?

G06N 3/04 - Architecture, e.g. interconnection topology
G06N 3/08 - Learning methods

86. NEURAL NETWORKS WITH HIERARCHICAL ATTENTION MEMORY

Application Number	EP2022064497
Publication Number	2022/248723
Status	In Force
Filing Date	2022-05-27
Publication Date	2022-12-01
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Lampinen, Andrew Kyle

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing a machine learning task on a network input to generate a network output. One of the systems includes an attention neural network comprising one or more hierarchical attention blocks, each hierarchical attention block configured to: receive an input sequence for the hierarchical attention block; maintain a 5 plurality of memory summary keys, each memory summary key corresponding to a respective one of a plurality of partitions of a sequence of memory block inputs; determine a proper subset of the plurality of memory summary keys; and generate an attended input sequence for the hierarchical attention block including applying an attention mechanism over the respective memory block inputs at the memory positions within the partitions of 10 the sequence of memory block inputs that correspond to the proper subset of the plurality of memory summary keys.

IPC Classes ?

G06N 3/00 - Computing arrangements based on biological models
G06N 3/04 - Architecture, e.g. interconnection topology
G06N 3/08 - Learning methods

87. REINFORCEMENT LEARNING USING AN ENSEMBLE OF DISCRIMINATOR MODELS

Application Number	EP2022064499
Publication Number	2022/248725
Status	In Force
Filing Date	2022-05-27
Publication Date	2022-12-01
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Hansen, Steven Stenberg Strouse, Daniel Joseph

Abstract

This specification describes a method performed by one or more data processing apparatus that includes: sampling a latent from a set of possible latents, selecting actions to be performed by an agent to interact with an environment over a sequence of time steps using an action selection neural network that is conditioned on the sampled latent, determining a respective reward received for each time step in the sequence of time steps using an ensemble of discriminator models, and training the action selection neural network based on the rewards using a reinforcement learning technique. Each discriminator model can process an observation to generate a respective prediction output that predicts which latent the action selection neural network was conditioned on to cause the environment to enter the state characterized by the observation.

IPC Classes ?

G06N 3/08 - Learning methods
G06N 20/20 - Ensemble learning
G06N 3/00 - Computing arrangements based on biological models
G06N 3/04 - Architecture, e.g. interconnection topology

88. TRAINING GRAPH NEURAL NETWORKS USING A DE-NOISING OBJECTIVE

Application Number	EP2022064565
Publication Number	2022/248735
Status	In Force
Filing Date	2022-05-30
Publication Date	2022-12-01
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Godwin, Jonathan William Battaglia, Peter William Schaarschmidt, Kevin Michael Sanchez, Alvaro

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network that includes one or more graph neural network layers. In one aspect, a method comprises: generating data defining a graph, comprising: generating a respective final feature representation for each node, wherein, for each of one or more of the nodes, the respective final feature representation is a modified feature representation that is generated from a respective feature representation for the node using respective noise; processing the data defining the graph using one or more of the graph neural network layers of the neural network to generate a respective updated node embedding of each node; and processing, for each of one or more of the nodes having modified feature representations, the updated node embedding of the node to generate a respective de-noising prediction for the node.

IPC Classes ?

G06N 3/04 - Architecture, e.g. interconnection topology
G06N 3/08 - Learning methods
G16C 20/70 - Machine learning, data mining or chemometrics
G16C 20/30 - Prediction of properties of chemical compounds, compositions or mixtures
G16C 20/50 - Molecular design, e.g. of drugs

89. UNSUPERVISED LEARNING OF OBJECT REPRESENTATIONS FROM VIDEO SEQUENCES USING ATTENTION OVER SPACE AND TIME

Application Number	EP2022064484
Publication Number	2022/248712
Status	In Force
Filing Date	2022-05-27
Publication Date	2022-12-01
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Kabra, Rishabh Zoran, Daniel Erdogan, Goker Creswell, Antonia Phoebe Nina Matthey-De-L'Endroit, Loic Botvinick, Matthew Lerchner, Alexander Burgess, Christopher Paul

Abstract

A computer-implemented video generation neural network system, configured to determine a value for each of a set of object latent variables by sampling from a respective prior object latent distribution for the object latent variable. The system comprises a trained image frame decoder neural network configured to, for each pixel of each generated image frame and for each generated image frame time step process determined values of the object latent variables to determine parameters of a pixel distribution for each of the object latent variables, combine the pixel distributions for each of the object latent variables to determine a combined pixel distribution, and sample from the combined pixel distribution to determine a value for the pixel and for the time step.

IPC Classes ?

G06N 3/04 - Architecture, e.g. interconnection topology
G06N 3/08 - Learning methods

90. LEARNING DIVERSE SKILLS FOR TASKS USING SEQUENTIAL LATENT VARIABLES FOR ENVIRONMENT DYNAMICS

Application Number	EP2022064491
Publication Number	2022/248718
Status	In Force
Filing Date	2022-05-27
Publication Date	2022-12-01
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Hansen, Steven Stenberg

Abstract

This specification relates to methods for controlling agents to perform actions according to a goal (or option) comprising a sequence of local goals (or local options) and corresponding methods for training. As discussed herein, environment dynamics may be modelled sequentially by sampling latent variables, each latent variable relating to a local goal and being dependent on a previous latent variable. These latent variables are used to condition an action-selection policy neural network to select actions according to the local goal. This allows the agents to reach more diverse states than would be possible through a fixed latent variable or goal, thereby encouraging exploratory behavior. In addition, specific methods described herein model the sequence of latent variables through a simple linear and recurrent relationship that allows the system to be trained more efficiently. This avoids the need to learn a state-dependent higher level policy for selecting the latent variables which can be difficult to train in practice.

IPC Classes ?

G06N 3/08 - Learning methods
G06N 3/04 - Architecture, e.g. interconnection topology
G06N 7/00 - Computing arrangements based on specific mathematical models

91. MULTI-OBJECTIVE REINFORCEMENT LEARNING USING WEIGHTED POLICY PROJECTION

Application Number	EP2022064493
Publication Number	2022/248720
Status	In Force
Filing Date	2022-05-27
Publication Date	2022-12-01
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Abdolmaleki, Abbas Huang, Sandy Han Riedmiller, Martin

Abstract

Computer implemented systems and methods for training an action selection policy neural network to select actions to be performed by an agent to control the agent to perform a task. The techniques are able to optimize multiple objectives one of which may be to stay close to a behavioral policy of a teacher. The behavioral policy of the teacher may be defined by a predetermined dataset of behaviors and the systems and methods may then learn offline. The described techniques provide a mechanism for explicitly defining a trade-off between the multiple objectives.

IPC Classes ?

G06N 3/04 - Architecture, e.g. interconnection topology
G06N 3/08 - Learning methods
G06N 7/00 - Computing arrangements based on specific mathematical models
G06N 5/00 - Computing arrangements using knowledge-based models

92. REINFORCEMENT LEARNING BY SOLUTION OF A CONVEX MARKOV DECISION PROCESS

Application Number	EP2022064495
Publication Number	2022/248722
Status	In Force
Filing Date	2022-05-27
Publication Date	2022-12-01
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Zahavy, Tom Ben Zion O'Donoghue, Brendan Timothy Desjardins, Guillaume Baveja, Satinder Singh

Abstract

The actions of an agent in an environment are selected using a policy model neural network which implements a policy model defining, for any observed state of the environment characterized by an observation received by the policy model neural network, a state-action distribution over the set of possible actions the agent can perform. The policy model neural network is jointly trained with a cost model neural network which, upon receiving an observation characterizing the environment, outputs a reward vector. The reward vector comprises a corresponding reward value for every possible action. The training involves a sequence of iterations, in each of which (a) a cost model is derived based on the state-action distribution of a candidate policy model defined in one or more previous iterations, and subsequently (b) a candidate policy model is obtained based on reward vector(s) defined by the cost model obtained in the iteration.

IPC Classes ?

G06N 3/00 - Computing arrangements based on biological models

93. GENERATING NEURAL NETWORK OUTPUTS BY CROSS ATTENTION OF QUERY EMBEDDINGS OVER A SET OF LATENT EMBEDDINGS

Application Number	EP2022064501
Publication Number	2022/248727
Status	In Force
Filing Date	2022-05-27
Publication Date	2022-12-01
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Jaegle, Andrew Coulter Alayrac, Jean-Baptiste Borgeaud Dit Avocat, Sebastian Ionescu, Catalin-Dumitru Doersch, Carl Ding, Fengning Vinyals, Oriol Hénaff, Olivier Jean Koppula, Skanda Kumar Zoran, Daniel Brock, Andrew Shelhamer, Evan Gerard Zisserman, Andrew Carreira, Joao

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a network output using a neural network. In one aspect, a method comprises: obtaining: (i) a network input to a neural network, and (ii) a set of query embeddings; processing the network input using the neural network to generate a network output that comprises a respective dimension corresponding to each query embedding in the set of query embeddings, comprising: processing the network input using an encoder block of the neural network to generate a representation of the network input as a set of latent embeddings; and processing: (i) the set of latent embeddings, and (ii) the set of query embeddings, using a cross-attention block that generates each dimension of the network output by cross-attention of a corresponding query embedding over the set of latent embeddings.

IPC Classes ?

G06N 3/04 - Architecture, e.g. interconnection topology
G06N 3/08 - Learning methods

94. ENHANCING POPULATION-BASED TRAINING OF NEURAL NETWORKS

Application Number	EP2022064563
Publication Number	2022/248734
Status	In Force
Filing Date	2022-05-30
Publication Date	2022-12-01
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Dalibard, Valentin Clement Jaderberg, Maxwell Elliot

Abstract

Methods, computer systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network for performing a task. The system maintains data specifying (i) a plurality of candidate neural networks and (ii) a partitioning of the plurality of candidate neural networks into a plurality of partitions. The system repeatedly performs operations, including: training each of the candidate neural networks; evaluating each candidate neural network using a respective fitness function for the partition; and for each partition, updating the respective values of the one or more hyperparameters for at least one of the candidate neural networks in the partition based on the respective fitness metrics of the candidate neural networks in the partition. After repeatedly performing the operations, the system selects, from the maintained data, the respective values of the network parameters of one of the candidate neural networks.

IPC Classes ?

G06N 3/00 - Computing arrangements based on biological models
G06N 3/04 - Architecture, e.g. interconnection topology
G06N 3/08 - Learning methods

95. TRAINING RATE CONTROL NEURAL NETWORKS THROUGH REINFORCEMENT LEARNING

Application Number	EP2022064566
Publication Number	2022/248736
Status	In Force
Filing Date	2022-05-30
Publication Date	2022-12-01
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Zhernov, Anton Gu, Chenjie Mankowitz, Daniel J. Schrittwieser, Julian Mandhane, Amol Balkishan Rauh, Mary Elizabeth Wang, Miaosen Hubert, Thomas Keisuke

Abstract

Systems and methods for training rate control neural networks through reinforcement learning. During training, reward values for training examples are generated from the current performance of the rate control neural network in encoding the video in the training example and the historical performance of the rate control neural network in encoding the video in the training example.

IPC Classes ?

H04N 19/124 - Quantisation
H04N 19/136 - Incoming video signal characteristics or properties

96. INTERACTIVE DECODING OF WORDS FROM PHONEME SCORE DISTRIBUTIONS

Application Number	EP2022059331
Publication Number	2022/214623
Status	In Force
Filing Date	2022-04-07
Publication Date	2022-10-13
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Assael, Ioannis Alexandros Shillingford, Brendan Denil, Misha Man Ray

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for interactive decoding of a word sequence.

IPC Classes ?

G10L 15/16 - Speech classification or search using artificial neural networks
G06V 40/20 - Movements or behaviour, e.g. gesture recognition
G10L 15/183 - Speech classification or search using natural language modelling using context dependencies, e.g. language models
G10L 15/20 - Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise or of stress induced speech
G10L 15/25 - Speech recognition using non-acoustical features using position of the lips, movement of the lips or face analysis

97. PREDICTING COMPLETE PROTEIN REPRESENTATIONS FROM MASKED PROTEIN REPRESENTATIONS

Application Number	EP2022051943
Publication Number	2022/194434
Status	In Force
Filing Date	2022-01-27
Publication Date	2022-09-22
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Pritzel, Alexander Ionescu, Catalin-Dumitru Kohl, Simon

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for unmasking a masked representation of a protein using a protein reconstruction neural network. In one aspect, a method comprises: receiving the masked representation of the protein; and processing the masked representation of the protein using the protein reconstruction neural network to generate a respective predicted embedding corresponding to one or more masked embeddings that are included in the masked representation of the protein, wherein a predicted embedding corresponding to a masked embedding in a representation of the amino acid sequence of the protein defines a prediction for an identity of an amino acid at a corresponding position in the amino acid sequence, wherein a predicted embedding corresponding to a masked embedding in a representation of the structure of the protein defines a prediction for a corresponding structural feature of the protein.

IPC Classes ?

G16B 15/20 - Protein or domain folding
G06N 20/00 - Machine learning
G16B 15/30 - Drug targeting using structural data; Docking or binding prediction
G16B 40/20 - Supervised data analysis

98. GENERATING OUTPUT SIGNALS USING VARIABLE-RATE DISCRETE REPRESENTATIONS

Application Number	EP2022056014
Publication Number	2022/189493
Status	In Force
Filing Date	2022-03-09
Publication Date	2022-09-15
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Dieleman, Sander Etienne Lea

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating output signals using variable-rate discrete representations. One of the methods includes generating, using a generative neural network, an event sequence representing a run-length encoding of a discrete representation of the audio signal, the event sequence comprising a respective event at each of a plurality of event sequence time steps; generating the discrete representation of the audio signal from the event sequence using run-length decoding; and processing the discrete representation using a decoder neural network, wherein the decoder neural network is configured to process the discrete representation of the audio signal to generate the prediction of the audio signal.

IPC Classes ?

G10L 19/16 - Vocoder architecture
G10L 25/30 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the analysis technique using neural networks

99. NOWCASTING USING GENERATIVE NEURAL NETWORKS

Application Number	EP2022053834
Publication Number	2022/175337
Status	In Force
Filing Date	2022-02-16
Publication Date	2022-08-25
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Ravuri, Suman Lenc, Karel Mirowski, Piotr Wojciech Lam, Remi Roger Alain Paul Willson, Matthew James Brock, Andrew

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for precipitation nowcasting using generative neural networks. One of the methods includes obtaining a context temporal sequence of a plurality of context radar fields characterizing a real-world location, each context radar field characterizing the weather in the real-world location at a corresponding preceding time point; sampling a set of one or more latent inputs by sampling values from a specified distribution; and for each sampled latent input, processing the context temporal sequence of radar fields and the sampled latent input using a generative neural network that has been configured through training to process the temporal sequence of radar fields to generate as output a predicted temporal sequence comprising a plurality of predicted radar fields, each predicted radar field in the predicted temporal sequence characterizing the predicted weather in the real-world location at a corresponding future time point.

IPC Classes ?

G06N 3/04 - Architecture, e.g. interconnection topology
G01W 1/10 - Devices for predicting weather conditions
G05D 1/00 - Control of position, course, altitude, or attitude of land, water, air, or space vehicles, e.g. automatic pilot
G06N 3/08 - Learning methods
G08G 3/00 - Traffic control systems for marine craft
G08G 5/00 - Traffic control systems for aircraft

100. SEQUENCE-TO SEQUENCE NEURAL NETWORK SYSTEMS USING LOOK AHEAD TREE SEARCH

Application Number	EP2022053035
Publication Number	2022/171635
Status	In Force
Filing Date	2022-02-08
Publication Date	2022-08-18
Owner	DEEPMIND TECHNOLOGIES LIMITED (United Kingdom)
Inventor	Leblond, Rémi Bertrand Francis Alayrac, Jean-Baptiste Sifre, Laurent Pîslar, Miruna Lespiau, Jean-Baptiste Antonoglou, Ioannis Simonyan, Karen Silver, David Vinyals, Oriol

Abstract

A computer-implemented method for generating an output token sequence from an input token sequence. The method combines a look ahead tree search, such as a Monte Carlo tree search, with a sequence-to-sequence neural network system. The sequence-to-sequence neural network system has a policy output defining a next token probability distribution, and may include a value neural network providing a value output to evaluate a sequence. An initial partial output sequence is extended using the look ahead tree search guided by the policy output and, in implementations, the value output, of the sequence-to-sequence neural network system until a complete output sequence is obtained.

IPC Classes ?

G06N 3/08 - Learning methods
G06N 5/00 - Computing arrangements using knowledge-based models
G06N 3/04 - Architecture, e.g. interconnection topology

1 2 3 Next Page