Audio translation system includes a feature extractor and a style transfer machine learning model. The feature extractor generates for each of a plurality of source voice files one or more source voice parameters encoded as a collection of source feature vectors, and generates for each of a plurality of target voice files one or more target voice parameters encoded as a collection of target feature vectors. The style transfer machine learning model trained on the collection of source feature vectors for the plurality of source voice files and the collection of target feature vectors for the plurality of target voice files to generate a style transformed feature vector.
G10L 21/003 - Changement de la qualité de la voix, p.ex. de la hauteur tonale ou des formants
G10L 25/45 - Techniques d'analyses de la parole ou de la voix qui ne se limitent pas à un seul des groupes caractérisées par le type de fenêtre d’analyse
G10L 25/75 - Techniques d'analyses de la parole ou de la voix qui ne se limitent pas à un seul des groupes pour la modélisation des paramètres du conduit vocal
G10L 15/06 - Création de gabarits de référence; Entraînement des systèmes de reconnaissance de la parole, p.ex. adaptation aux caractéristiques de la voix du locuteur
2.
SELECTION OF A WIRELESS DEVICE TO BE REMOTELY CONTROLLED BY A USER INTERFACE DEVICE FOR MEDIA PRESENTATION IN A VEHICLE
A method includes receiving a Bluetooth Low Energy (BLE) advertising message from a user interface (UI) device. The method includes, responsive to a receipt of the BLE advertising message from the UI device: waking up an application module of the first wireless device and authorizing the UI device to remotely control media presentation as presented by the application module. The method includes determining a first determination of whether the first wireless device is paired or is in a current cabled connection with an electronic device that is distinct from the UI device; and in accordance with the first determination being a determination that the first wireless device is not paired with the electronic device and is not in a current cabled connection with the electronic device, automatically terminating the authorization of the UI device to remotely control media presentation as presented by the application module.
H04N 21/414 - Plate-formes spécialisées de client, p.ex. récepteur au sein d'une voiture ou intégré dans un appareil mobile
H04W 4/48 - Services spécialement adaptés à des environnements, à des situations ou à des fins spécifiques pour les véhicules, p.ex. communication véhicule-piétons pour la communication dans le véhicule
H04W 4/80 - Services utilisant la communication de courte portée, p.ex. la communication en champ proche, l'identification par radiofréquence ou la communication à faible consommation d’énergie
H04L 67/125 - Protocoles spécialement adaptés aux environnements propriétaires ou de mise en réseau pour un usage spécial, p.ex. les réseaux médicaux, les réseaux de capteurs, les réseaux dans les véhicules ou les réseaux de mesure à distance en impliquant la commande des applications des terminaux par un réseau
H04N 21/41 - Structure de client; Structure de périphérique de client
Media content episodes are received. Using machine learning, one or more media segments of interest are identified in each of the media content episodes based at least in part on an analysis of content included in a corresponding audio content episode. Each of the identified media segments is associated with one or more automatically determined tags. Using machine learning, a recommended media segment is selected for a specific user from the identified media segments based at least in part on attributes of the specific user and the automatically determined tags of the identified media segments. The recommended media segment is automatically provided in an media segment feed.
G10L 25/51 - Techniques d'analyses de la parole ou de la voix qui ne se limitent pas à un seul des groupes spécialement adaptées pour un usage particulier pour comparaison ou différentiation
G06F 3/14 - Sortie numérique vers un dispositif de visualisation
G10L 17/00 - Identification ou vérification du locuteur
G06N 5/04 - Modèles d’inférence ou de raisonnement
G06F 16/638 - Présentation des résultats des requêtes
G06F 16/64 - Navigation; Visualisation à cet effet
Methods, systems, and related products that provide detection of media content items that are under-locatable by machine voice-driven retrieval of uttered requests for retrieval of the media items. For a given media item, a resolvability value and/or an utterance resolve frequency is calculated by a number of playbacks of the media item by a speech retrieval modality to a total number of playbacks of the media item regardless of retrieval modality. In some examples, the methods, systems and related products also provide for improvement in the locatability of an under-locatable media item by collecting and/or generating one or more pronunciation aliases for the under-locatable item.
A text-to-speech engine creates audio output that includes synthesized speech and one or more media content item snippets. The input text is obtained and partitioned into text sets. A track having lyrics that match a part of one of the text sets is identified. The location of the track's audio that contains the lyric is extracted based on forced alignment data. The extracted audio is combined with synthesized speech corresponding to the remainder of the input text to form audio output.
G10L 13/00 - Synthèse de la parole; Systèmes de synthèse de la parole à partir de texte
G06F 16/683 - Recherche de données caractérisée par l’utilisation de métadonnées, p.ex. de métadonnées ne provenant pas du contenu ou de métadonnées générées manuellement utilisant des métadonnées provenant automatiquement du contenu
G10L 13/04 - Procédés d'élaboration de parole synthétique; Synthétiseurs de parole - Détails des systèmes de synthèse de la parole, p.ex. structure du synthétiseur ou gestion de la mémoire
A method of determining relations between music items, wherein a music item is a submix of a musical composition comprising one or more music tracks, the method comprising determining a first input representation for at least part of a first music item, mapping the first input representation onto to one or more subspaces derived from a vector space using a first model, wherein each subspace models a characteristic of the music items, determining a second input representation for at least part of a second music item, mapping the second input representation onto the one or more subspaces using a second model, and determining a distance between the mappings of the first and second input representations in each subspace, wherein the distance represents the degree of relation between the first and second input representations with respect to the characteristic modelled by the subspace.
G10H 1/00 - INSTRUMENTS DE MUSIQUE ÉLECTROPHONIQUES; INSTRUMENTS DANS LESQUELS LES SONS SONT PRODUITS PAR DES MOYENS ÉLECTROMÉCANIQUES OU DES GÉNÉRATEURS ÉLECTRONIQUES, OU DANS LESQUELS LES SONS SONT SYNTHÉTISÉS À PARTIR D'UNE MÉMOIRE DE DONNÉES Éléments d'instruments de musique électrophoniques
A method of determining relations between music items, the method comprising determining a first input representation for a symbolic representation of a first music item, mapping the first input representation onto to one or more subspaces derived from a vector space using a first model, wherein each subspace models a characteristic of the music items, determining a second input representation for music data representing a second music item, mapping the second input representation onto the one or more subspaces using a second model, determining a distance between the mappings of the first and second input representation in each subspace, wherein the distance represents the degree of relation between the first and second input representation with respect to the characteristic modelled by the subspace.
A method for training a speech synthesis model adapted to output speech in response to input text is provided. The method includes receiving training data for training said speech synthesis model, the training data comprising speech that corresponds to known text. The method includes training said speech synthesis model. The method includes testing said speech synthesis model using a plurality of text sequences. The method includes calculating at least one metric indicating the performance of the model when synthesising each text sequence. The method includes determining from said metric whether the speech synthesis model requires further training. The method includes determining targeted training text from said calculated metrics, wherein said targeting training text is text related to text sequences where the metric indicated that the model required further training. And the method includes outputting said determined targeted training text with a request further speech corresponding to the targeted training text.
G10L 13/047 - Architecture des synthétiseurs de parole
G10L 13/08 - Analyse de texte ou génération de paramètres pour la synthèse de la parole à partir de texte, p.ex. conversion graphème-phonème, génération de prosodie ou détermination de l'intonation ou de l'accent tonique
A system and method for media content sequencing. Prior tracks for a listening session are segmented into groups based on attribute scores for an audial attribute. A preferred group is then selected, which can be based on user feedback regarding the prior tracks in the listening session. Candidate tracks, such as from a candidate track pool for future playback in the listening session, are also segmented into the groups of the prior tracks. The candidate tracks can then be ranked based on their associated group and the preferred group.
G10H 1/00 - INSTRUMENTS DE MUSIQUE ÉLECTROPHONIQUES; INSTRUMENTS DANS LESQUELS LES SONS SONT PRODUITS PAR DES MOYENS ÉLECTROMÉCANIQUES OU DES GÉNÉRATEURS ÉLECTRONIQUES, OU DANS LESQUELS LES SONS SONT SYNTHÉTISÉS À PARTIR D'UNE MÉMOIRE DE DONNÉES Éléments d'instruments de musique électrophoniques
A second wake word detector, at a media-playback device, that plays audio (or other) content to a device, such as a voice-enabled device, detects false wake words in the audio content. The second wake word detector analyzes the audio stream to determine if the audio stream contains any audio that sounds like the wake word. If so, the second wake word detector can generate one of a plurality of instructions that describes the time period, within the audio content, in which the false wake word was encountered. The instruction can cause a first wake word detector to assume one of a plurality of configurations. The media-playback device can then instruct or inform the voice-enabled device of the presence of the false wake word. In this way, the wake word detector, at the voice-enabled device, is not activated to receive the false wake word or ignores the wake word.
G10L 15/22 - Procédures utilisées pendant le processus de reconnaissance de la parole, p.ex. dialogue homme-machine
G10L 15/20 - Techniques de reconnaissance de la parole spécialement adaptées de par leur robustesse contre les perturbations environnantes, p.ex. en milieu bruyant ou reconnaissance de la parole émise dans une situation de stress
A wake word detector, at a server of a content delivery network (CDN) that provides audio (or other) content to a device, such as a voice-enabled device, detects false wake words in the audio content. The CDN wake word detector analyzes the audio stream to determine if the audio stream contains any audio that sounds like the wake word. If so, the CDN wake word detector can generate metadata that describes the time period, within the audio content, in which the false wake word was encountered. The metadata can include time offsets, from the start of the audio content, which can instruct a voice-enabled device to deactivate during the time period. This metadata is stored and then sent to the media-playback device requests the media content. The media-playback device can then instruct or inform the voice-enabled device of the presence of the false wake word. In this way, the wake word detector, at the voice-enabled device, is not activated to receive the false wake word.
Systems, devices, apparatuses, components, methods, and techniques for predicting user and media-playback device states are provided. Systems, devices, apparatuses, components, methods, and techniques for representing cached, user-selected, and streaming content are also provided.
G06F 15/167 - Communication entre processeurs utilisant une mémoire commune, p.ex. boîte aux lettres électronique
G06N 5/02 - Représentation de la connaissance; Représentation symbolique
G06F 12/0888 - Adressage d’un niveau de mémoire dans lequel l’accès aux données ou aux blocs de données désirés nécessite des moyens d’adressage associatif, p.ex. mémoires cache utilisant la mémorisation cache sélective, p.ex. la purge du cache
H04N 21/231 - Opération de stockage de contenu, p.ex. mise en mémoire cache de films pour stockage à court terme, réplication de données sur plusieurs serveurs, ou établissement de priorité des données pour l'effacement
H04L 67/5681 - Pré-extraction ou pré-livraison de données en fonction des caractéristiques du réseau
G06F 12/14 - Protection contre l'utilisation non autorisée de mémoire
13.
TEXT-TO-SPEECH SYNTHESIS METHOD AND SYSTEM, AND A METHOD OF TRAINING A TEXT-TO-SPEECH SYNTHESIS SYSTEM
A text-to-speech synthesis method includes receiving text, inputting the received text in a synthesizer that includes a prediction network configured to convert the received text into speech data having a speech attribute that includes emotion, intention, projection, pace, and/or accent, and outputting said speech data. The prediction network is obtained by obtaining a first sub-dataset and a second sub-dataset, where the first sub-dataset and the second sub-dataset each include audio samples and corresponding text, and the speech attribute of the audio samples of the second sub-dataset is more pronounced than the speech attribute of the audio samples of the first sub-dataset, training a first model using the first sub-dataset until a performance metric reaches a first predetermined value, training a second model by further training the first model using the second sub-dataset until the performance metric reaches a second predetermined value, and selecting one trained model as the prediction network.
G10L 13/033 - Procédés d'élaboration de parole synthétique; Synthétiseurs de parole Édition de voix, p.ex. transformation de la voix du synthétiseur
G10L 13/047 - Architecture des synthétiseurs de parole
G10L 13/027 - Synthétiseurs de parole à partir de concepts; Génération de phrases naturelles à partir de concepts automatisés
G10L 13/08 - Analyse de texte ou génération de paramètres pour la synthèse de la parole à partir de texte, p.ex. conversion graphème-phonème, génération de prosodie ou détermination de l'intonation ou de l'accent tonique
Methods, systems and computer program products are provided for determining acoustic feature vectors of query and target items in a first vector space, and mapping the acoustic feature vectors to a second vector space having a lower dimension. The distribution of vectors in the second vector space can then be used to identify items from the same songs, and/or items that are complementary. A mapping function is trained using a machine learning algorithm, such that complementary audio items are closer in the second vector space than the first, according to a given distance metric.
G10L 25/51 - Techniques d'analyses de la parole ou de la voix qui ne se limitent pas à un seul des groupes spécialement adaptées pour un usage particulier pour comparaison ou différentiation
G10L 25/30 - Techniques d'analyses de la parole ou de la voix qui ne se limitent pas à un seul des groupes caractérisées par la technique d’analyse utilisant des réseaux neuronaux
A method, which may be performed at an electronic device, such as a media server associated with a media-providing service, causes a set of media items to be provided to a user based on identifying performance listings relevant to the user. The method includes determining a list of one or more performance listings of artists relevant to a user based on a media consumption history of the user, the media consumption history describing media content items previously delivered to the user by a media content server, and a listening profile of a second user, distinct from the first user, the listening profile identifying media content and artists played by the second user via the media content server. The method includes providing one or more media items to the user, the one or more media items selected based on the list of one or more performance listings.
A descriptive media content search solution is provided to allow a user to search for media content that better matches a user's descriptive search request. The descriptive media content search solution utilizes an extensive catalog of playlists each having a playlist description, such as a playlist title or other descriptive text, and identifies additional descriptive information for media content items to be searched. The descriptive media content search solution can set up a descriptive search database and utilize the descriptive search database to conduct a descriptive search responsive to the user's descriptive search request.
G06F 16/48 - Recherche caractérisée par l’utilisation de métadonnées, p.ex. de métadonnées ne provenant pas du contenu ou de métadonnées générées manuellement
G06F 16/41 - Indexation; Structures de données à cet effet; Structures de stockage
G06F 16/438 - Présentation des résultats des requêtes
G06F 16/2457 - Traitement des requêtes avec adaptation aux besoins de l’utilisateur
17.
Systems and Methods for Using Hierarchical Ordered Weighted Averaging for Providing Personalized Media Content
An electronic device, for each media content item of a plurality of media content items, receives a respective score for each a first set of objectives and one or more other objectives and generates a respective score between a user and the media content item. The generating includes applying a first ordered weighted average to the respective scores for the first set of objectives, to produce a first combined score for the first set of objectives, applying a second ordered weighted average to the respective scores for a second set of objectives, wherein the second set of objectives includes (i) a resulting objective corresponding to the first set of objectives and having the first combined score and (ii) the one or more other objectives. The electronic device streams media content to the user selected based on the respective scores between the user and the media content items.
A first client device is associated with a first user hosting a shared playback session. While a first media content item from the shared playback session is being presented on a set of presentation devices, the first client device communicates with a set of observer devices for the shared playback session. The first client device receives a request to modify playback of the shared playback session from a second client device, the second client device being one observer device of the set of observer devices. In response to the request to modify playback of the shared playback session, the first client device determines an action to take with respect to the shared playback session. In response to determining the action to take with respect to the shared playback session, the first client device sends a command for the action to each of the set of presentation devices.
H04N 21/472 - Interface pour utilisateurs finaux pour la requête de contenu, de données additionnelles ou de services; Interface pour utilisateurs finaux pour l'interaction avec le contenu, p.ex. pour la réservation de contenu ou la mise en place de rappels, pour la requête de notification d'événement ou pour la transformation de contenus affichés
H04L 65/401 - Prise en charge des services ou des applications dans laquelle les services impliquent une session principale en temps réel et une ou plusieurs sessions parallèles additionnelles en temps réel ou sensibles au temps, p.ex. accès partagé à un tableau blanc ou mise en place d’une sous-conférence
H04N 21/4788 - Services additionnels, p.ex. affichage de l'identification d'un appelant téléphonique ou application d'achat communication avec d'autres utilisateurs, p.ex. discussion en ligne
19.
ADAPTIVE MULTI-MODEL ITEM SELECTION SYSTEMS AND METHODS
An adaptive multi-model item selection method, comprising: receiving, from one of a plurality of client devices, a request including a client-side feature vector representing a state of the client device; determining, by an advocate model, a probability distribution of a plurality of specialist cluster models from the client-side feature vector; choosing, by a use case selector, a cluster corresponding to a use case from the probability distribution; and obtaining, by the use case selector based on the cluster (i.e., the cluster that was sampled by the user case selector), a specialist cluster model from the plurality of specialist cluster models.
Apparatus, methods and computer-readable medium are provided for processing wind noise. Audio input is processed by receiving an audio input. A wind noise level representative of a wind noise at the microphone array is measured using the audio input and a determination is made, based on the wind noise level, whether to perform either (i) a wind noise suppression process on the audio input on-device, or (ii) the wind noise suppression process on the audio input on-device and an audio reconstruction process in-cloud.
G10L 21/0232 - Traitement dans le domaine fréquentiel
H04R 1/40 - Dispositions pour obtenir la fréquence désirée ou les caractéristiques directionnelles pour obtenir la caractéristique directionnelle désirée uniquement en combinant plusieurs transducteurs identiques
A method is provided for modifying a first media content item by superimposing a first set of data over a first audio event having an amplitude that satisfies a first threshold. The first audio event has a first audio profile, the first set of data has a second audio profile, playback of the second audio profile is configured to be masked by the first audio profile during playback of the first media content item, and the first set of data includes playlist information. The method includes transmitting, to a second electronic device, the modified first media content item.
G06F 16/683 - Recherche de données caractérisée par l’utilisation de métadonnées, p.ex. de métadonnées ne provenant pas du contenu ou de métadonnées générées manuellement utilisant des métadonnées provenant automatiquement du contenu
22.
METHODS AND SYSTEMS FOR PROVIDING PERSONALIZED CONTENT BASED ON SHARED LISTENING SESSIONS
An electronic device receives a request, from a first device of a host user, to initiate a first shared playback session for the first device and one or more additional devices. The electronic device streams media content from a first playback queue to the first device and to the one or more additional devices, the first playback queue including one or more media content items corresponding to the first shared playback session. The electronic device determines that the first device of the host user has left the first shared playback session and, in response, maintains the first playback queue to be accessed by the one or more additional devices. After the host user has left the first shared playback session, the electronic device provides one or more media content items from the first playback queue to at least one of the one or more additional devices.
H04N 21/442 - Surveillance de procédés ou de ressources, p.ex. détection de la défaillance d'un dispositif d'enregistrement, surveillance de la bande passante sur la voie descendante, du nombre de visualisations d'un film, de l'espace de stockage disponible dans l
H04N 21/485 - Interface pour utilisateurs finaux pour la configuration du client
H04N 21/647 - Signalisation de contrôle entre des éléments du réseau et serveur ou clients; Procédés réseau pour la distribution vidéo entre serveur et clients, p.ex. contrôle de la qualité du flux vidéo en éliminant des paquets, protection du contenu contre une modification non autorisée dans le réseau ou surveillance de la charge du résea
H04N 21/44 - Traitement de flux élémentaires vidéo, p.ex. raccordement d'un clip vidéo récupéré d'un stockage local avec un flux vidéo en entrée ou rendu de scènes selon des graphes de scène MPEG-4
H04N 21/439 - Traitement de flux audio élémentaires
23.
METHODS AND SYSTEMS FOR SYNTHESISING SPEECH FROM TEXT
A method for synthesising speech from text includes receiving text and encoding, by way of an encoder module, the received text. The method further includes determining, by way of an attention module, a context vector from the encoding of the received text, wherein determining the context vector comprises at least one of: applying a threshold function to an attention vector and accumulating the thresholded attention vector, or applying an activation function to the attention vector and accumulating the activated attention vector. The method further includes determining speech data from the context vector.
G10L 13/08 - Analyse de texte ou génération de paramètres pour la synthèse de la parole à partir de texte, p.ex. conversion graphème-phonème, génération de prosodie ou détermination de l'intonation ou de l'accent tonique
G10L 13/047 - Architecture des synthétiseurs de parole
A method for personalizing media content for a user is provided. The method includes, at an electronic device, streaming a first media item from a first set of media items, the first set of media items compiled using a first recommendation hypothesis. The method further includes, while streaming the first media item, in response to a first user request, selecting, without user intervention, a second set of media items, distinct from the first set of media items, including determining a presentation order of a plurality of sets of media items using a heuristic applied to the plurality of sets of media items. The second set of media items is compiled using a second recommendation hypothesis, wherein the second recommendation hypothesis is distinct from the first recommendation hypothesis. The method includes streaming a second media item from the second set of media items.
H04L 65/613 - Diffusion en flux de paquets multimédias pour la prise en charge des services de diffusion par flux unidirectionnel, p.ex. radio sur Internet pour la commande de la source par la destination
H04L 65/1089 - Procédures en session en supprimant des médias
This disclosure is directed to adjusting a playlist of media-content items. One aspect is a method comprising receiving a request to adjust a playlist comprising initial media-content items, in response to receiving the input requesting the playlist be adjusted, compiling a set of features for the playlist and selecting a strong seed media-content item from the initial media-content items as a strong seed, predicting scores for a plurality of candidate media-content items based at least in part on the set of features for the playlist and the strong seed, the scores indicating a likelihood that a corresponding candidate media-content item will be added to the playlist, and inserting a candidate media-content item of the plurality of candidate media-content items after the strong seed media-content item based at least in part on the scores predicted for the plurality of candidate media-content items.
An audio cancellation system includes a voice enabled computing system that is connected to an audio output device using a wired or wireless communication network. The voice enabled computing device can provide media content to a user and receive a voice command from the user. The connection between the voice enabled computing system and the audio output device introduces a time delay between the media content being generated at the voice enabled computing device and the media content being reproduced at the audio output device. The system operates to determine a calibration value adapted for the voice enabled computing system and the audio output device. The system uses the calibration value to filter the user's voice command from a recording of ambient sound including the media content, without requiring significant use of memory and computing resources.
G10L 21/0232 - Traitement dans le domaine fréquentiel
G10L 25/51 - Techniques d'analyses de la parole ou de la voix qui ne se limitent pas à un seul des groupes spécialement adaptées pour un usage particulier pour comparaison ou différentiation
A method for communicating a playback order for a plurality of media content items to a user device operating in an online mode, the method performed at a server system and comprising receiving an indication that the user device will enter an offline mode, generating a playback order for the plurality of media content items, and transmitting the generated playback order to the user device before the user device enters the offline mode.
Methods, systems, and computer programs for generating a playlist of media content items without explicit content. A vector space is created that represents explicit and non-explicit tracks in the same playlists created by other users and then tracks are filtered based on cosine distance between the “seed tracks” and all the tracks in the aforementioned playlist. The explicit tracks are filtered out, and tracks are sorted based on the affinity of the user to the artist.
G06F 16/635 - Filtrage basé sur des données supplémentaires, p.ex. sur des profils d'utilisateurs ou de groupes
H04N 21/4545 - Signaux d'entrée aux algorithmes de filtrage, p.ex. filtrage d'une région de l'image
H04N 21/45 - Opérations de gestion réalisées par le client pour faciliter la réception de contenu ou l'interaction avec le contenu, ou pour l'administration des données liées à l'utilisateur final ou au dispositif client lui-même, p.ex. apprentissage des préféren
H04N 21/454 - Filtrage de contenu, p.ex. blocage des publicités
G06F 16/638 - Présentation des résultats des requêtes
G06F 16/683 - Recherche de données caractérisée par l’utilisation de métadonnées, p.ex. de métadonnées ne provenant pas du contenu ou de métadonnées générées manuellement utilisant des métadonnées provenant automatiquement du contenu
A system for supporting a user’s repetitive motion activity operates to manage cadence-based playlists identifying one or more media content items having a tempo corresponding to a user’s cadence. The cadence-based playlists can be categorized by different tempi or tempo ranges that cover all likely cadences during the user’s activities. A media-playback device is provided to acquire a user’s cadence and retrieve a cadence-based playlist associated with a tempo or a tempo range corresponding to the cadence.
G06F 16/683 - Recherche de données caractérisée par l’utilisation de métadonnées, p.ex. de métadonnées ne provenant pas du contenu ou de métadonnées générées manuellement utilisant des métadonnées provenant automatiquement du contenu
G06F 16/638 - Présentation des résultats des requêtes
G06F 16/68 - Recherche de données caractérisée par l’utilisation de métadonnées, p.ex. de métadonnées ne provenant pas du contenu ou de métadonnées générées manuellement
G06F 16/9535 - Adaptation de la recherche basée sur les profils des utilisateurs et la personnalisation
G05B 15/02 - Systèmes commandés par un calculateur électriques
An electronic device provides, to a user, a user-curated playlist, the user-curated playlist including an ordered set of media items that were added by the user. While providing a first media item in the ordered set of media items, the electronic device receives a first user input selecting an option to include recommended media items in the user-curated playlist. In response to the first user input, the electronic device updates the user-curated playlist to include a first recommended media item, the first recommended media item selected without user intervention based at least in part on attributes of the user-curated playlist. The first recommended media item is positioned in the user-curated playlist in between media items that were added to the ordered set of media items by the user.
G06F 16/638 - Présentation des résultats des requêtes
G06F 16/635 - Filtrage basé sur des données supplémentaires, p.ex. sur des profils d'utilisateurs ou de groupes
G06F 16/735 - Filtrage basé sur des données supplémentaires, p.ex. sur des profils d'utilisateurs ou de groupes
G06F 16/783 - Recherche de données caractérisée par l’utilisation de métadonnées, p.ex. de métadonnées ne provenant pas du contenu ou de métadonnées générées manuellement utilisant des métadonnées provenant automatiquement du contenu
32.
SYSTEMS AND METHODS FOR IMPORTING AUDIO FILES IN A DIGITAL AUDIO WORKSTATION
A method includes displaying a user interface of a digital audio workstation, which includes a composition region for generating a composition. The composition region includes a representation of a first MIDI file that has already been added to the composition by a user. The method further includes receiving a user input to import, into the composition region, an audio file. In response to the user input to import the audio file, the method includes importing the audio file, which includes, without user intervention, aligning the audio file with a rhythm of the first MIDI file, modifying a rhythm of the audio file based on the rhythm of the first MIDI file, and displaying a representation of the audio file in the composition region.
G10H 1/00 - INSTRUMENTS DE MUSIQUE ÉLECTROPHONIQUES; INSTRUMENTS DANS LESQUELS LES SONS SONT PRODUITS PAR DES MOYENS ÉLECTROMÉCANIQUES OU DES GÉNÉRATEURS ÉLECTRONIQUES, OU DANS LESQUELS LES SONS SONT SYNTHÉTISÉS À PARTIR D'UNE MÉMOIRE DE DONNÉES Éléments d'instruments de musique électrophoniques
This disclosure concerns the provision of media, and more particularly streaming of media. In particular, one aspect herein relates to a method performed by a server system of streaming an audio content item to an electronic device. In response to receiving a request message from the electronic device, a selected audio content item is retrieved from a first storage. Descriptive metadata including an origin-ID associated with the retrieved audio content item is determined. A second storage is browsed utilizing said metadata including the origin-ID to locate non-static media content item(s) associated with the origin-ID. In response to finding a non-static media content item associated with the origin ID, the selected audio content item is sent along with the located non-static media content item to the electronic device for simultaneous presentation of the audio content item and the located non-static media content item.
G06F 16/683 - Recherche de données caractérisée par l’utilisation de métadonnées, p.ex. de métadonnées ne provenant pas du contenu ou de métadonnées générées manuellement utilisant des métadonnées provenant automatiquement du contenu
H04N 21/4722 - Interface pour utilisateurs finaux pour la requête de contenu, de données additionnelles ou de services; Interface pour utilisateurs finaux pour l'interaction avec le contenu, p.ex. pour la réservation de contenu ou la mise en place de rappels, pour la requête de notification d'événement ou pour la transformation de contenus affichés pour la requête de données additionnelles associées au contenu
H04N 21/431 - Génération d'interfaces visuelles; Rendu de contenu ou données additionnelles
G06F 16/48 - Recherche caractérisée par l’utilisation de métadonnées, p.ex. de métadonnées ne provenant pas du contenu ou de métadonnées générées manuellement
G06F 16/583 - Recherche caractérisée par l’utilisation de métadonnées, p.ex. de métadonnées ne provenant pas du contenu ou de métadonnées générées manuellement utilisant des métadonnées provenant automatiquement du contenu
G06F 16/783 - Recherche de données caractérisée par l’utilisation de métadonnées, p.ex. de métadonnées ne provenant pas du contenu ou de métadonnées générées manuellement utilisant des métadonnées provenant automatiquement du contenu
A system, method and computer product for training a neural network system. The method comprises inputting an audio signal to the system to generate plural outputs f(X, Θ). The audio signal includes one or more of vocal content and/or musical instrument content, and each output f(X, Θ) corresponds to a respective one of the different content types. The method also comprises comparing individual outputs f(X, Θ) of the neural network system to corresponding target signals. For each compared output f(X, Θ), at least one parameter of the system is adjusted to reduce a result of the comparing performed for the output f(X, Θ), to train the system to estimate the different content types. In one example embodiment, the system comprises a U-Net architecture. After training, the system can estimate various different types of vocal and/or instrument components of an audio signal, depending on which type of component(s) the system is trained to estimate.
G10H 1/00 - INSTRUMENTS DE MUSIQUE ÉLECTROPHONIQUES; INSTRUMENTS DANS LESQUELS LES SONS SONT PRODUITS PAR DES MOYENS ÉLECTROMÉCANIQUES OU DES GÉNÉRATEURS ÉLECTRONIQUES, OU DANS LESQUELS LES SONS SONT SYNTHÉTISÉS À PARTIR D'UNE MÉMOIRE DE DONNÉES Éléments d'instruments de musique électrophoniques
An electronic device generates a respective user queue for each user of a plurality of users participating in a shared listening session. While providing a first media content item for playback, the device receives a second request, from a first user, to add a second media content item to the shared playback queue and updates the respective user queue for the first user. After receiving the second request, the electronic device receives a third request, from a second user, to add a third media content item to the shared playback queue and updates the respective user queue for the second user. The electronic device updates the shared playback queue using the respective user queues of the first user and the second user, including positioning the third media content item in an order of the shared playback queue to be played back before the second media content item.
H04N 21/472 - Interface pour utilisateurs finaux pour la requête de contenu, de données additionnelles ou de services; Interface pour utilisateurs finaux pour l'interaction avec le contenu, p.ex. pour la réservation de contenu ou la mise en place de rappels, pour la requête de notification d'événement ou pour la transformation de contenus affichés
H04N 21/458 - Ordonnancement de contenu pour créer un flux personnalisé, p.ex. en combinant une publicité stockée localement avec un flux d'entrée; Opérations de mise à jour, p.ex. pour modules de système d'exploitation
H04N 21/442 - Surveillance de procédés ou de ressources, p.ex. détection de la défaillance d'un dispositif d'enregistrement, surveillance de la bande passante sur la voie descendante, du nombre de visualisations d'un film, de l'espace de stockage disponible dans l
H04N 21/25 - Opérations de gestion réalisées par le serveur pour faciliter la distribution de contenu ou administrer des données liées aux utilisateurs finaux ou aux dispositifs clients, p.ex. authentification des utilisateurs finaux ou des dispositifs clients ou
H04N 21/258 - Gestion de données liées aux clients ou aux utilisateurs finaux, p.ex. gestion des capacités des clients, préférences ou données démographiques des utilisateurs, traitement des multiples préférences des utilisateurs finaux pour générer des données co
H04N 21/45 - Opérations de gestion réalisées par le client pour faciliter la réception de contenu ou l'interaction avec le contenu, ou pour l'administration des données liées à l'utilisateur final ou au dispositif client lui-même, p.ex. apprentissage des préféren
H04N 21/466 - Procédé d'apprentissage pour la gestion intelligente, p.ex. apprentissage des préférences d'utilisateurs pour recommander des films
H04N 21/262 - Ordonnancement de la distribution de contenus ou de données additionnelles, p.ex. envoi de données additionnelles en dehors des périodes de pointe, mise à jour de modules de logiciel, calcul de la fréquence de transmission de carrousel, retardement d
Systems, devices, apparatuses, components, methods, and techniques for media a simple user interface that can facilitate discovery of contextually relevant media content with minimal navigation are provided. For example, the disclosed user interface may present contextually relevant categories, sub-categories and media content items while concurrently playing a media content item predicted to likely be selected by the user.
H04N 21/442 - Surveillance de procédés ou de ressources, p.ex. détection de la défaillance d'un dispositif d'enregistrement, surveillance de la bande passante sur la voie descendante, du nombre de visualisations d'un film, de l'espace de stockage disponible dans l
H04N 21/2668 - Création d'un canal pour un groupe dédié d'utilisateurs finaux, p.ex. en insérant des publicités ciblées dans un flux vidéo en fonction des profils des utilisateurs finaux
H04N 21/45 - Opérations de gestion réalisées par le client pour faciliter la réception de contenu ou l'interaction avec le contenu, ou pour l'administration des données liées à l'utilisateur final ou au dispositif client lui-même, p.ex. apprentissage des préféren
H04N 21/472 - Interface pour utilisateurs finaux pour la requête de contenu, de données additionnelles ou de services; Interface pour utilisateurs finaux pour l'interaction avec le contenu, p.ex. pour la réservation de contenu ou la mise en place de rappels, pour la requête de notification d'événement ou pour la transformation de contenus affichés
37.
SYSTEMS AND METHODS FOR SEQUENCING A PLAYLIST OF MEDIA ITEMS
A server system receives a request to generate a playlist. The playlist includes a sequence of media items. The server system receives a plurality of constraints that define disqualification criteria for excluding media items from a respective slot in the sequence of media items. The plurality of constraints for the respective slot in the sequence of media items includes at least one constraint that is based on already-populated slots in the sequence of media items. The server system generates the playlist by sequentially populating each respective slot in the sequence of media items, including selecting, for the respective slot, a respective media item that meets the plurality of constraints for the respective slot in the sequence of media items. The server system provides the playlist to a user of the media providing service.
H04N 21/262 - Ordonnancement de la distribution de contenus ou de données additionnelles, p.ex. envoi de données additionnelles en dehors des périodes de pointe, mise à jour de modules de logiciel, calcul de la fréquence de transmission de carrousel, retardement d
H04N 21/454 - Filtrage de contenu, p.ex. blocage des publicités
H04N 21/239 - Interfaçage de la voie montante du réseau de transmission, p.ex. établissement de priorité des requêtes de clients
38.
SYSTEMS AND METHODS FOR DETERMINING DESCRIPTORS FOR MEDIA CONTENT ITEMS
An electronic device obtains a plurality of collections of media content items, each collection of media content items being associated with text generated by one or more users of the media-providing service. Based on how frequently a first media content item co-occurs with a first descriptor in text for respective collections of media items that include the first media content item, the electronic device generates, without user input, a new collection of media content items for a first user. The new collection of media content items corresponds to the first descriptor and includes the first media content item. The electronic device presents the new collection of media content items to the first user as a recommendation.
G06F 16/908 - Recherche caractérisée par l’utilisation de métadonnées, p.ex. de métadonnées ne provenant pas du contenu ou de métadonnées générées manuellement utilisant des métadonnées provenant automatiquement du contenu
G06F 16/9535 - Adaptation de la recherche basée sur les profils des utilisateurs et la personnalisation
G06F 16/68 - Recherche de données caractérisée par l’utilisation de métadonnées, p.ex. de métadonnées ne provenant pas du contenu ou de métadonnées générées manuellement
Technology for generating, reading, and using machine-readable codes is disclosed. There is a method, performed by an image capture device, for reading and using the codes. The method includes obtaining an image, identifying an area in the image having a machine-readable code. The method also includes, within the image area, finding a predefined start marker defining a start point and a predefined stop marker defining a stop point, an axis being defined there between. A plurality of axis points can be defined along the axis. For each axis point, a first distance within the image area to a mark is determined. The distance can be measured from the axis point in a first direction which is orthogonal to the axis. The first distances can be converted to a binary code using Gray code such that each first distance encodes at least one bit of data in the code.
G06K 19/06 - Supports d'enregistrement pour utilisation avec des machines et avec au moins une partie prévue pour supporter des marques numériques caractérisés par le genre de marque numérique, p.ex. forme, nature, code
G06K 7/14 - Méthodes ou dispositions pour la lecture de supports d'enregistrement par radiation corpusculaire utilisant la lumière sans sélection des longueurs d'onde, p.ex. lecture de la lumière blanche réfléchie
40.
Display screen with animated graphical user interface
Audio content episodes are received. Using machine learning, one or more audio segments of interest are identified in each of the audio content episodes based at least in part on an analysis of content included in a corresponding audio content episode. Each of the identified audio segments is associated with one or more automatically determined tags. Using machine learning, a recommended audio segment is selected for a specific user from the identified audio segments based at least in part on attributes of the specific user and the automatically determined tags of the identified audio segments. The recommended audio segment are automatically provided in an audio segment feed.
G10L 25/51 - Techniques d'analyses de la parole ou de la voix qui ne se limitent pas à un seul des groupes spécialement adaptées pour un usage particulier pour comparaison ou différentiation
G06F 3/14 - Sortie numérique vers un dispositif de visualisation
G10L 17/00 - Identification ou vérification du locuteur
G06N 5/04 - Modèles d’inférence ou de raisonnement
G06F 16/638 - Présentation des résultats des requêtes
G06F 16/64 - Navigation; Visualisation à cet effet
A method comprises the following steps: providing a Gaussian process variational autoencoder (GP-VAE) including a Gaussian process (GP) encoder and a neural network decoder; selecting a plurality of inducing points in a data space; generating a mapping of the plurality of inducing points in a latent space; and training the GP-VAE using a training dataset.
A system, method and computer product for combining audio tracks. In one example embodiment herein, the method comprises determining at least one music track that is musically compatible with a base music track, aligning those tracks in time, and combining the tracks. In one example embodiment herein, the tracks may be music tracks of different songs, the base music track can be an instrumental accompaniment track, and the at least one music track can be a vocal track. Also in one example embodiment herein, the determining is based on musical characteristics associated with at least one of the tracks, such as an acoustic feature vector distance between tracks, a likelihood of at least one track including a vocal component, a tempo, or musical key. Also, determining of musical compatibility can include determining at least one of a vertical musical compatibility or a horizontal musical compatibility among tracks.
G10H 1/00 - INSTRUMENTS DE MUSIQUE ÉLECTROPHONIQUES; INSTRUMENTS DANS LESQUELS LES SONS SONT PRODUITS PAR DES MOYENS ÉLECTROMÉCANIQUES OU DES GÉNÉRATEURS ÉLECTRONIQUES, OU DANS LESQUELS LES SONS SONT SYNTHÉTISÉS À PARTIR D'UNE MÉMOIRE DE DONNÉES Éléments d'instruments de musique électrophoniques
Apparatus, methods and computer-readable medium are provided for processing wind noise. Audio input is processed by receiving an audio input. A wind noise level representative of a wind noise at the microphone array is measured using the audio input and a determination is made, based on the wind noise level, whether to perform either (i) a wind noise suppression process on the audio input on-device, or (ii) the wind noise suppression process on the audio input on-device and an audio reconstruction process in-cloud.
G10L 21/0232 - Traitement dans le domaine fréquentiel
H04R 1/40 - Dispositions pour obtenir la fréquence désirée ou les caractéristiques directionnelles pour obtenir la caractéristique directionnelle désirée uniquement en combinant plusieurs transducteurs identiques
A server receives a request to play a selected playlist from a first electronic device associated with a host listener. The playlist includes audio items having a common attribute. The server also receives an identity of a guest listener having a second electronic device and retrieves an indication of taste of the guest listener based on the received identity. The server selects an additional audio item based at least in part on the indication of taste of the guest listener and the common attribute of the audio items of the selected playlist, and incorporates the additional audio item into the selected playlist.
Systems, devices, apparatuses, components, methods, and techniques for generating and playing a selectable content depth media program are provided. Media content items are edited to produce selectable depth media segments which are assembled into selectable depth media programs. A media-playback device is configured to navigate and play the selectable depth media program through interaction by a listening user. The user selects the desired content depth for each media segment.
H04N 21/482 - Interface pour utilisateurs finaux pour la sélection de programmes
H04N 21/2387 - Traitement de flux en réponse à une requête de reproduction par un utilisateur final, p.ex. pour la lecture à vitesse variable ("trick play")
49.
METHODS AND SYSTEMS FOR PROVISIONING SETTINGS OF A MEDIA PLAYBACK DEVICE
A system is provided for streaming media content in a vehicle. The system includes a personal media streaming appliance system configured to connect to a media delivery system and receive media content from the media delivery system at least via a cellular network. The media delivery system is configured to link a user media streaming account with a particular personal media streaming appliance to provide personalized media content to the appliance. Media contexts are assigned to multiple preset settings automatically so that the personal media streaming appliance system is configured to output personalized media content upon first use.
H04L 41/0806 - Réglages de configuration pour la configuration initiale ou l’approvisionnement, p.ex. prêt à l’emploi [plug-and-play]
H04L 41/08 - Gestion de la configuration des réseaux ou des éléments de réseau
G06Q 30/06 - Transactions d’achat, de vente ou de crédit-bail
H04L 65/612 - Diffusion en flux de paquets multimédias pour la prise en charge des services de diffusion par flux unidirectionnel, p.ex. radio sur Internet pour monodiffusion [unicast]
G06Q 10/08 - Logistique, p.ex. entreposage, chargement ou distribution; Gestion d’inventaires ou de stocks
H04L 65/1063 - Serveurs d'applications fournissant des services réseau
A server determines, at a first predetermined time, a default decision as to whether to provide a first media content clip after the end of a first media content item. At a second predetermined time, after the first predetermined time, the server initiates a determination of a first decision as to whether to provide the first media content clip after the end of the first media content item. In accordance with the first decision being reached within a predetermined latency period, the server provides the first media content clip to the first electronic device in accordance with the first decision. In accordance with a determination that the predetermined latency period has elapsed without the first decision being reached, provides the first media content clip to the first electronic device in accordance with the default decision.
H04L 65/61 - Diffusion en flux de paquets multimédias pour la prise en charge des services de diffusion par flux unidirectionnel, p.ex. radio sur Internet
52.
SYSTEMS AND METHODS FOR GENERATING TRAILERS FOR AUDIO CONTENT
An electronic device receives an audio file and divides the audio file into a plurality of segments. The electronic device, automatically, without user input, determines, for each segment, a descriptor from a plurality of descriptors and a value of the descriptor for the segment. The electronic device selects one or more segments of the plurality of segments, based on a comparison of the respective values of respective descriptors for respective segments and genre-specific criteria selected based on a genre of the audio file. The electronic device generates a trailer for the audio file using the selected one or more segments.
G10L 25/63 - Techniques d'analyses de la parole ou de la voix qui ne se limitent pas à un seul des groupes spécialement adaptées pour un usage particulier pour comparaison ou différentiation pour estimer un état émotionnel
G10L 25/30 - Techniques d'analyses de la parole ou de la voix qui ne se limitent pas à un seul des groupes caractérisées par la technique d’analyse utilisant des réseaux neuronaux
G10L 15/04 - Segmentation; Détection des limites de mots
G10L 15/22 - Procédures utilisées pendant le processus de reconnaissance de la parole, p.ex. dialogue homme-machine
53.
System and method for generating models representing users of a media providing service
A method of recommending media items to a user is provided. The method includes receiving historical data for a user of a media providing service. The historical data indicates past interactions of the user with media items. The method includes generating a model of the user. The model includes a first set of parameters, each of the first set of parameters quantifying a predicted latent preference of the user for a respective media item provided by the media providing service. The method includes evaluating the predicted latent preferences of the user for the respective media items against the historical data indicating the past interactions of the user with the media items provided by the media providing service. The method includes selecting a recommender system from a plurality of recommender systems using the model of the user, including the first set of parameters. The method includes providing a media item to a second user using the selected recommender system.
An electronic device associated with a media-providing service receives a first media item and a request, from a second device, for playback of the first media content item. The electronic device determines an insertion time within the first media content item for inserting a second media content item, and generates a queue indicating an order in which a first, second, and third file are to be provided. The first file corresponds to a portion of the first media content item from a start of the first media content item until the insertion time, the second file corresponds to the second media content item, and the third file corresponds to a portion of the first media content item starting at the insertion time. The electronic device generates the files, and queues the second electronic device to play back the first, second, and the third files in accordance with the queue.
H04N 21/433 - Opération de stockage de contenu, p.ex. opération de stockage en réponse à une requête de pause ou opérations de cache
H04N 21/262 - Ordonnancement de la distribution de contenus ou de données additionnelles, p.ex. envoi de données additionnelles en dehors des périodes de pointe, mise à jour de modules de logiciel, calcul de la fréquence de transmission de carrousel, retardement d
H04N 21/658 - Transmission du client vers le serveur
H04N 21/414 - Plate-formes spécialisées de client, p.ex. récepteur au sein d'une voiture ou intégré dans un appareil mobile
This disclosure is directed to systems and methods for managing a group session for consuming media content across a plurality of devices. In some configurations and by non-limiting example, the group session operates to synchronize playback and control of media content at the plurality of devices. In one aspect a method of simultaneously playing media content on a plurality of media playback devices for a group session is disclosed.
A server obtains user data for a respective user, including data corresponding to the respective user's consumption of media in a first content domain. Before obtaining, for the respective user, data corresponding to a second content domain, the server uses a neural network to generate a user embedding for the respective user based on the user data. The server generates, for a plurality of content items of the second content domain consumed by users other than the respective user, a respective content item embedding. The respective content item embedding is based on user embeddings of the at least one user other than the respective user. The server system determines, using the user embedding for the respective user and respective content item embeddings, a first content item in the second content domain that meets matching criteria for the respective user and provides the first content item to the respective user.
G06F 16/435 - Filtrage basé sur des données supplémentaires, p.ex. sur des profils d'utilisateurs ou de groupes
G06F 16/438 - Présentation des résultats des requêtes
G06F 16/483 - Recherche caractérisée par l’utilisation de métadonnées, p.ex. de métadonnées ne provenant pas du contenu ou de métadonnées générées manuellement utilisant des métadonnées provenant automatiquement du contenu
57.
SYSTEMS AND METHODS FOR COMMUNICATING WITH A DEVICE IN A LOW POWER MODE
A first server system is configured to communicate with a first client device through a first application executing on the first client device. The first server system determines that communication with the first client device through the first application has been lost due to the first client device entering an idle mode. The first server system receives a request from a second client device that triggers reestablishing communication with the first client device through the first application. In response, the first server system transmits a request to a second server system to wake the first client device from the idle mode. The first server system receives, from the first application on the first client device, an indication that communication has been reestablished between the first server system and the first application. The first server system transmits a control command to control the first client device.
A method of processing playback content control commands generated at a client device and communicated by a backend server to the client device and a controlled device to control media content playback at the controlled device is provided. The method includes the following steps: sending, by the client device, a playback content control command to the backend server, wherein the backend server is configured to communicated the playback content control command to the client device and to the controlled device; initiating a buffer time period; and refraining from processing, at the client device, one or more subsequent playback content control commands from the backend server during the buffer time period.
A system for device-to-device media capturing is described herein. An example system includes a media delivery system of a media service and at least a first and second device associated with respective first and second user accounts of the media service. The first device is also associated with an active media content item provided by the media delivery system that is automatically captured by the second device. For example, as the second device is moved proximate to the first device, one or more wireless communications are transmitted between the devices that trigger device-to-device media capturing. Resultantly, an identifier for the active media content item is stored to a library of the second user account of the media service. The identifier is stored in response to detecting the proximity of the devices and determining that the second device is moving towards the first device.
H04W 4/80 - Services utilisant la communication de courte portée, p.ex. la communication en champ proche, l'identification par radiofréquence ou la communication à faible consommation d’énergie
Systems, devices, apparatuses, components, methods, and techniques for saving media content to a context for later playback are provided. An example media-playback device for identifying and playing media content for a user traveling in a vehicle includes a context detecting device, a context-driven playback engine, and a media playback engine. Contexts are established by parameters that can be detected by a media-playback device. Contexts are situations that are defined by one or more locations, times, events, activities, people, and devices. Media content is saved to the contexts for later playback. The contexts are detected by the context detecting device, the associated media content is identified by the context-driven playback engine, and the media content is automatically played through the media playback engine, without additional input required by the user.
H04N 21/435 - Traitement de données additionnelles, p.ex. décryptage de données additionnelles ou reconstruction de logiciel à partir de modules extraits du flux de transport
H04N 21/462 - Gestion de contenu ou de données additionnelles, p.ex. création d'un guide de programmes électronique maître à partir de données reçues par Internet et d'une tête de réseau ou contrôle de la complexité d'un flux vidéo en dimensionnant la résolution o
H04N 21/466 - Procédé d'apprentissage pour la gestion intelligente, p.ex. apprentissage des préférences d'utilisateurs pour recommander des films
H04N 21/475 - Interface pour utilisateurs finaux pour acquérir des données d'utilisateurs finaux, p.ex. numéro d'identification personnel [PIN] ou données de préférences
G11B 27/022 - Montage électronique de signaux d'information analogiques, p.ex. de signaux audio, vidéo
H04N 21/458 - Ordonnancement de contenu pour créer un flux personnalisé, p.ex. en combinant une publicité stockée localement avec un flux d'entrée; Opérations de mise à jour, p.ex. pour modules de système d'exploitation
H04N 21/442 - Surveillance de procédés ou de ressources, p.ex. détection de la défaillance d'un dispositif d'enregistrement, surveillance de la bande passante sur la voie descendante, du nombre de visualisations d'un film, de l'espace de stockage disponible dans l
61.
SYSTEM AND METHOD FOR ENABLING INTERACTION WITH AN ELECTRONIC DEVICE
In accordance with an embodiment, described herein is a system and method for enabling advertisement interaction with an electronic device, for use in a digital media content environment. A media device includes a touch-sensitive display screen, and user interface that enables access to a stored media application. During the streaming of media content from a media server, an audio advertisement or other type of advertisement can be inserted into a stream, for playback at the media device. Using playback controls provided either by the user interface, or by a headphones, headset, or other type of device, a user can interact with the advertisement (even if the user interface is locked), by performing a defined gesture, which, if received at the media application during a defined time period associated with the advertisement, is interpreted as a signal to select or otherwise interact with the advertisement.
A source device being associated with an account uses playback of a media content item to cause a target device to become associated with the account. The target device enters an association mode and records a portion of the playing content. The target device provides the recording to a server that identifies the song (e.g., using a music fingerprint service) and uses the identification of the song to find the account that caused playback of the identified song. With the account identified, the server provides credentials of the account to target system. The target device accesses content or services using the account. As confirmation of receiving the credentials, the server causes playback of the content to transition to from the source device to the target device.
Methods, systems, and computer programs for generating a playlist of media content items that are popular with the friends of the first user. A first user taste profile is determined and a user taste profile is determined for each of a plurality of social connections. A similarity score is calculated between the first user taste profile and the user taste profile of each social connection. Media content items consumed by social connections with the highest similarity score are selected and placed in a playlist for the first user.
H04L 65/611 - Diffusion en flux de paquets multimédias pour la prise en charge des services de diffusion par flux unidirectionnel, p.ex. radio sur Internet pour la multidiffusion ou la diffusion
H04L 65/612 - Diffusion en flux de paquets multimédias pour la prise en charge des services de diffusion par flux unidirectionnel, p.ex. radio sur Internet pour monodiffusion [unicast]
A system for device discovery for social playback is disclosed. The system operates to connect a host media playback device to a media output device and broadcast a social playback session to guest media playback devices. Upon joining a social playback session, a guest media playback device may control the media playback at the host media playback device. Where the media output for the social playback session is provided by the media output device.
H04L 65/60 - Diffusion en flux de paquets multimédias
H04W 4/80 - Services utilisant la communication de courte portée, p.ex. la communication en champ proche, l'identification par radiofréquence ou la communication à faible consommation d’énergie
65.
TEXT COMMAND BASED GROUP LISTENING SESSION PLAYBACK CONTROL
A method of providing a group listening session to users of a messaging platform includes: receiving, at a media streaming platform, a message feed from the messaging platform; parsing, at the media streaming platform, the message feed to identify one or more commands included in the message feed, wherein the one or more commands are associated with the group listening session provided by the media streaming platform; and controlling the group listening session according to the one or more commands, each of the users being associated with a media device for participating in the group listening session.
G06F 16/638 - Présentation des résultats des requêtes
G06F 16/683 - Recherche de données caractérisée par l’utilisation de métadonnées, p.ex. de métadonnées ne provenant pas du contenu ou de métadonnées générées manuellement utilisant des métadonnées provenant automatiquement du contenu
A first electronic device, while a first media content item of a first type is playing back on a second electronic device, detects a user input selecting a second media content item. In response to the user input, the electronic device determines a type of media content item of the selected second media content item. In accordance with a determination that the second media content item is of the first type of media content item, the first electronic device generates a media control request for controlling playback of the second media content item at the second electronic device, and in accordance with a determination that the second media content item is of a second type of media content item, the first electronic device generates a media control request for controlling playback of the second media content item at a third electronic device that is distinct from the second electronic device.
H04L 65/60 - Diffusion en flux de paquets multimédias
H04N 21/433 - Opération de stockage de contenu, p.ex. opération de stockage en réponse à une requête de pause ou opérations de cache
H04N 21/443 - Procédés de système d'exploitation, p.ex. démarrage d'un boîtier décodeur STB, implémentation d'une machine virtuelle Java dans un boîtier décodeur STB ou gestion d'énergie dans un boîtier décodeur STB
H04N 21/472 - Interface pour utilisateurs finaux pour la requête de contenu, de données additionnelles ou de services; Interface pour utilisateurs finaux pour l'interaction avec le contenu, p.ex. pour la réservation de contenu ou la mise en place de rappels, pour la requête de notification d'événement ou pour la transformation de contenus affichés
H04L 69/329 - Protocoles de communication intra-couche entre entités paires ou définitions d'unité de données de protocole [PDU] dans la couche application [couche OSI 7]
67.
Systems and Methods for Jointly Estimating Sound Sources and Frequencies from Audio
An electronic device receives a first audio content item that includes a plurality of sound sources. The electronic device generates a representation of the first audio content item. The electronic device determines, from the representation of the first audio content item: a representation of an isolated sound source, and frequency data associated with the isolated sound source. Determining the representation of the isolated sound source and the frequency data associated with the isolated sound source includes using a neural network to jointly determine the representation of the isolated sound source and the frequency data associated with the isolated sound source. The electronic device determines that a portion of a second audio content item matches the first audio content item using the representation of the isolated sound source and/or the frequency data associated with the isolated sound source.
G10L 25/51 - Techniques d'analyses de la parole ou de la voix qui ne se limitent pas à un seul des groupes spécialement adaptées pour un usage particulier pour comparaison ou différentiation
A system and method for controlling access to an on-device machine learning model without the use of encryption is described herein. For example, a request is received from an application executing on a device of a user. The request is to download a machine learning model to the device that enables a feature of the application, and the request includes information associated with the user and/or the device. The information is used to create an obfuscation key, and a derivative model can be generated using a reference copy of the machine learning model and the obfuscation key. The derivative model and the obfuscation key are then sent to the application. When the obfuscation key is provided to the derivative model at runtime, values derived from the obfuscation key are provided as additional inputs that enable the derivative model to function properly.
A method of integrating a playback device for use with a backend server of a media streaming platform includes the following steps: providing an application programming interface (API) command processor at a server to send and receive network communication with a cloud playback adapted system; receiving at the API command processor, from a cloud playback client associated with the cloud playback adapted system, a status of the cloud playback adapted system; receiving at the API command processor, from the cloud playback client, a playback command to control playback of a media content item; and sending a message from the API command processor to the cloud playback client in response to the playback command, the message including an identification of the media content item to permit the cloud playback adapted system to retrieve the media content item for playback of the media content item by the cloud playback adapted system.
A computing device comprising a display screen, the computing device being configured to decompose a media recording into a plurality of media recording salient events, apply each of the media recording salient events to a reinforcement model, display on the display screen (i) a mapping of the plurality of media recording salient events and (ii) for at least one of the plurality of media recording salient events, at least one selectable next best action, the computing device further configured to replace at least one of the plurality of media recording salient events with at least one selectable next best action to create a derivative media recording including at least one replacement media recording action.
G06F 16/683 - Recherche de données caractérisée par l’utilisation de métadonnées, p.ex. de métadonnées ne provenant pas du contenu ou de métadonnées générées manuellement utilisant des métadonnées provenant automatiquement du contenu
A computer server system associates one or more media items with a first segment of a first media item, the one or more media items selected based on current location information of a media device. The computer server system receives, from a media device, a request for a media item associated with the first media item, wherein the request includes a media segment identifier for the first segment of the first media item. In response to the request, the computer server system identifies the one or more media items associated with the first segment and provides the one or more media items to the media device.
H04N 21/845 - Structuration du contenu, p.ex. décomposition du contenu en segments temporels
H04L 65/60 - Diffusion en flux de paquets multimédias
H04N 21/472 - Interface pour utilisateurs finaux pour la requête de contenu, de données additionnelles ou de services; Interface pour utilisateurs finaux pour l'interaction avec le contenu, p.ex. pour la réservation de contenu ou la mise en place de rappels, pour la requête de notification d'événement ou pour la transformation de contenus affichés
H04N 21/439 - Traitement de flux audio élémentaires
H04N 21/41 - Structure de client; Structure de périphérique de client
H04N 21/43 - Traitement de contenu ou données additionnelles, p.ex. démultiplexage de données additionnelles d'un flux vidéo numérique; Opérations élémentaires de client, p.ex. surveillance du réseau domestique ou synchronisation de l'horloge du décodeur; Intergiciel de client
H04N 21/2368 - Multiplexage de flux audio et vidéo
H04N 21/4722 - Interface pour utilisateurs finaux pour la requête de contenu, de données additionnelles ou de services; Interface pour utilisateurs finaux pour l'interaction avec le contenu, p.ex. pour la réservation de contenu ou la mise en place de rappels, pour la requête de notification d'événement ou pour la transformation de contenus affichés pour la requête de données additionnelles associées au contenu
H04L 65/61 - Diffusion en flux de paquets multimédias pour la prise en charge des services de diffusion par flux unidirectionnel, p.ex. radio sur Internet
H04L 65/612 - Diffusion en flux de paquets multimédias pour la prise en charge des services de diffusion par flux unidirectionnel, p.ex. radio sur Internet pour monodiffusion [unicast]
A request to play a media content item is received. It is determined whether the play request is ambiguous. Responsive to determining that the play request is ambiguous, then it is determined whether to play a suspended media content item or an alternate media content item. The determination can be made based on a length of time that the suspended media content item has been suspended, a media content item type, or a state, among other factors. Responsive to the determination, playback of the suspended or alternate media content item is initiated.
H04L 65/613 - Diffusion en flux de paquets multimédias pour la prise en charge des services de diffusion par flux unidirectionnel, p.ex. radio sur Internet pour la commande de la source par la destination
A method includes retrieving a text from a database. The text corresponds to audio from a media content item that is provided by a media providing service, and the text includes a plurality of segments. The method also includes assigning a score for each segment in the text by applying the text to a trained computational model. The score corresponds to a predicted relevance of the respective segment to a narrative of the media content item. The method further includes identifying a non-narrative segment within the text using the assigned scores.
G06F 16/683 - Recherche de données caractérisée par l’utilisation de métadonnées, p.ex. de métadonnées ne provenant pas du contenu ou de métadonnées générées manuellement utilisant des métadonnées provenant automatiquement du contenu
A computer extracts a vocal portion from a first audio content item and determines a first representative vector that corresponds to a vocal style of the first audio content item by applying a variational autoencoder (VAE) to the extracted vocal portion of the representation of the audio content item. The computer streams, to an electronic device, a second audio content item, selected from a plurality of audio content items, that has a second representative vector that corresponds to a vocal style of the second audio content item, wherein the second representative vector that corresponds the vocal style of the second audio content item meets similarity criteria with respect to the first representative vector that corresponds to the vocal style of the first audio content item.
G10L 19/22 - Décision en matière de mode, c. à d. fondée sur le contenu du signal audio plutôt que sur des paramètres externes
G10L 25/30 - Techniques d'analyses de la parole ou de la voix qui ne se limitent pas à un seul des groupes caractérisées par la technique d’analyse utilisant des réseaux neuronaux
G06F 16/48 - Recherche caractérisée par l’utilisation de métadonnées, p.ex. de métadonnées ne provenant pas du contenu ou de métadonnées générées manuellement
G10L 25/51 - Techniques d'analyses de la parole ou de la voix qui ne se limitent pas à un seul des groupes spécialement adaptées pour un usage particulier pour comparaison ou différentiation
A system for refinement of a voice query interpretation interprets a voice query received at a voice-enabled device to identify commands responsive to the voice query for execution at the voice-enabled device, and enables refinement of the interpretation of the voice query through a graphical user interface generated and displayed at a GUI-capable device. The graphical user interface includes a set of selectable options relating to the voice query and identifying a refinement of the interpretation of the voice query to enable control and/or adjustment of commands to be executed by the voice-enabled device. For example, if one of the selectable options is selected, then a command associated with the selected option is identified and executed by the voice-enabled device.
Audio translation system includes a feature extractor and a style transfer machine learning model. The feature extractor generates for each of a plurality of source voice files one or more source voice parameters encoded as a collection of source feature vectors, and generates for each of a plurality of target voice files one or more target voice parameters encoded as a collection of target feature vectors. The style transfer machine learning model trained on the collection of source feature vectors for the plurality of source voice files and the collection of target feature vectors for the plurality of target voice files to generate a style transformed feature vector.
G10L 25/45 - Techniques d'analyses de la parole ou de la voix qui ne se limitent pas à un seul des groupes caractérisées par le type de fenêtre d’analyse
G10L 25/75 - Techniques d'analyses de la parole ou de la voix qui ne se limitent pas à un seul des groupes pour la modélisation des paramètres du conduit vocal
G10L 15/06 - Création de gabarits de référence; Entraînement des systèmes de reconnaissance de la parole, p.ex. adaptation aux caractéristiques de la voix du locuteur
77.
SYSTEMS AND METHODS FOR PROVIDING RESPONSES FROM MEDIA CONTENT
A method includes retrieving a plurality of transcripts from a database. Each transcript in the plurality of transcripts corresponds to audio from a media content item of a plurality of media content items that are provided by a media providing service. The method also includes applying each transcript of the plurality of transcripts to a trained computational model, and receiving a user request for information regarding a topic. The method further includes, in response to the user request, identifying a transcript from the database that is relevant to the topic, and a position within the transcript that is relevant to the topic. The method also includes providing, by the media providing service, at least a portion of a media content item corresponding to the identified transcript, beginning at a starting position that is based on the position within the identified transcript that is relevant to the topic.
G10L 15/18 - Classement ou recherche de la parole utilisant une modélisation du langage naturel
G10L 15/10 - Classement ou recherche de la parole utilisant des mesures de distance ou de distorsion entre la parole inconnue et les gabarits de référence
Systems and methods are provided for detecting wake words. An electronic device detects, from a microphone array, an audio signal in an environment proximate to the audio front end system. The electronic device processes the audio signal using a plurality of wake word detection engines, including dynamically adjusting how many wake word detection engines are available for processing the audio signal. The electronic device independently adjusts respective wake word detection thresholds for the plurality of wake word detection engines used to process the audio signal.
G10L 15/30 - Reconnaissance distribuée, p.ex. dans les systèmes client-serveur, pour les applications en téléphonie mobile ou réseaux
G10L 21/0232 - Traitement dans le domaine fréquentiel
H04R 1/40 - Dispositions pour obtenir la fréquence désirée ou les caractéristiques directionnelles pour obtenir la caractéristique directionnelle désirée uniquement en combinant plusieurs transducteurs identiques
A method for controlling media presentation is disclosed. In some implementations, the method is performed at an electronic device having one or more processors and memory storing one or more programs for execution by the one or more processors. The electronic device displays a user interface, the user interface including a control element for media that is provided by a media server associated with a first service provider, the user interface originating from a server associated with a second service provider distinct from the first service provider of the media server. The electronic device receives a user input selecting the control element. In response to the user input, the electronic device sends a media control request to the media server, the media control request causing the media server to control playback of a playlist, including initiating playback of first media content in the playlist associated with the user.
G06F 3/0484 - Techniques d’interaction fondées sur les interfaces utilisateur graphiques [GUI] pour la commande de fonctions ou d’opérations spécifiques, p.ex. sélection ou transformation d’un objet, d’une image ou d’un élément de texte affiché, détermination d’une valeur de paramètre ou sélection d’une plage de valeurs
H04L 65/612 - Diffusion en flux de paquets multimédias pour la prise en charge des services de diffusion par flux unidirectionnel, p.ex. radio sur Internet pour monodiffusion [unicast]
80.
VOICE RECOGNITION SYSTEM FOR USE WITH A PERSONAL MEDIA STREAMING APPLIANCE
A system and method for voice control of a media playback device is disclosed. The method includes receiving an instruction of a voice command, converting the voice command to text, transmitting the text command to the playback device, and having the playback device execute the command. An instruction may include a command to play a set of audio tracks, and the media playback device plays the set of audio tracks upon receiving the instruction.
A server system obtains a collection of images, each image in the collection of images being associated with a first set of text descriptors. The server system obtains a media item being associated with a second set of text descriptors. The server system selects a subset of the collection of images, including: selecting an initial subset of the collection of images, wherein the initial subset of the collection of images consists of images that share a text descriptor with the media item; obtaining a set of preferences for a user of the media-providing service; and selecting the subset of the collection of images from the initial subset of the collection of images based on the set of preferences for the user of the media-providing service. The server system concurrently presents: a respective image of the subset of the collection of images; and the media item.
G06F 16/438 - Présentation des résultats des requêtes
G06F 16/535 - Filtrage basé sur des données supplémentaires, p.ex. sur des profils d'utilisateurs ou de groupes
G06F 16/583 - Recherche caractérisée par l’utilisation de métadonnées, p.ex. de métadonnées ne provenant pas du contenu ou de métadonnées générées manuellement utilisant des métadonnées provenant automatiquement du contenu
82.
System for Managing Transitions Between Media Content Items
A system for playing media content items operates to provide smooth transitions between the media content items to continuously support a user's repetitive motion activity. The system can generate crossfade data containing information for transitions between media content items. The mix-in and mix-out points for the transitions are calculated to eliminate one or more portions of media content items that have lower musical energy than a majority portion of the items, and to maintain substantially consistent and/or stable musical energy (e.g., audio power or sound power) throughout the media content items including transitions therebetween.
G06F 16/40 - Recherche d’informations; Structures de bases de données à cet effet; Structures de systèmes de fichiers à cet effet de données multimédia, p.ex. diaporama comprenant des données d'image et d’autres données audio
G06F 16/635 - Filtrage basé sur des données supplémentaires, p.ex. sur des profils d'utilisateurs ou de groupes
G06F 16/683 - Recherche de données caractérisée par l’utilisation de métadonnées, p.ex. de métadonnées ne provenant pas du contenu ou de métadonnées générées manuellement utilisant des métadonnées provenant automatiquement du contenu
H04W 4/80 - Services utilisant la communication de courte portée, p.ex. la communication en champ proche, l'identification par radiofréquence ou la communication à faible consommation d’énergie
An apparatus for supporting a media playback device on a structure of a vehicle comprising a grille, the apparatus comprising an elongate base body having a front side configured to connect the apparatus to the media playback device and a rear side, and first and second connection members, each comprising a respective connection portion protruding away from the rear side of the base body and comprising at least one slot configured to receive a bar of a grille, wherein the first and second connection members are translatably arranged such that the distance between the first and second connection members may be varied.
B60R 11/02 - Autres aménagements pour tenir ou monter des objets pour postes radio, de télévision, téléphones, ou objets similaires; Disposition de leur commande
B60R 11/00 - Autres aménagements pour tenir ou monter des objets
84.
Human-machine interfaces for utterance-based playlist selection
Systems, methods, and devices for human-machine interfaces for utterance-based playlist selection are disclosed. In one method, a list of playlists is traversed and a portion of each is audibly output until a playlist command is received. Based on the playlist command, the traversing is stopped and a playlist is selected for playback. In examples, the list of playlists is modified based on a modification input.
An apparatus for supporting a media playback device on a structure of a vehicle comprising an optical disc drive, the apparatus comprising an elongate member configured to be inserted into the slot of the optical disc drive and, when inserted, provide a spring force to retain the apparatus in the slot, and an attachment member connected to the elongate member, the attachment member configured to connect the apparatus to the media playback device.
B60R 11/02 - Autres aménagements pour tenir ou monter des objets pour postes radio, de télévision, téléphones, ou objets similaires; Disposition de leur commande
86.
SYSTEMS AND METHODS FOR JOINING A SHARED LISTENING SESSION
A server system stores a list of trusted users that are authorized to access a first electronic device's shared listening sessions. While the first electronic device is hosting a shared listening session playing back media on a second electronic device, the server system receives, from a third electronic device, a request that includes an identifier of the second electronic device. In response to the first request and in accordance with a determination that the third electronic device is associated with a user that is on the list of trusted users, the server system automatically transmits, to the third electronic device, a session identifier for the first shared listening session. The server system receives, from the third electronic device, a request to join the shared listening session, in response to the second request, provides, to the third electronic device, access to control media playback of the first listening session.
A first server system is configured to communicate with a first client device through a first application executing on the first client device. The first server system determines that communication with the first client device through the first application has been lost due to the first client device entering an idle mode. The first server system receives a request from a second client device that triggers reestablishing communication with the first client device through the first application. In response, the first server system transmits a request to a second server system to wake the first client device from the idle mode. The first server system receives, from the first application on the first client device, an indication that communication has been reestablished between the first server system and the first application. The first server system transmits a control command to control the first client device.
H04L 67/025 - Protocoles basés sur la technologie du Web, p.ex. protocole de transfert hypertexte [HTTP] pour la commande à distance ou la surveillance à distance des applications
H04L 12/28 - Réseaux de données à commutation caractérisés par la configuration des liaisons, p.ex. réseaux locaux [LAN Local Area Networks] ou réseaux étendus [WAN Wide Area Networks]
88.
Personalizing explainable recommendations with bandits
Methods, systems and computer program products are provided personalizing recommendations of items with associated explanations. The example embodiments described herein use contextual bandits to personalize explainable recommendations (“recsplanations”) as treatments (“Bart”). Bart learns and predicts satisfaction (e.g., click-through rate, consumption probability) for any combination of item, explanation, and context and, through logging and contextual bandit retraining, can learn from its mistakes in an online setting.
G06F 16/00 - Recherche d’informations; Structures de bases de données à cet effet; Structures de systèmes de fichiers à cet effet
G06F 16/635 - Filtrage basé sur des données supplémentaires, p.ex. sur des profils d'utilisateurs ou de groupes
G06F 16/638 - Présentation des résultats des requêtes
G06F 18/21 - Conception ou mise en place de systèmes ou de techniques; Extraction de caractéristiques dans l'espace des caractéristiques; Séparation aveugle de sources
Methods, systems, and computer programs for generating a playlist of media content items for a group of users. Media content items listened to by the selected users are compared to an average user taste profile to select media content items for playback to the group of users.
G06F 16/2457 - Traitement des requêtes avec adaptation aux besoins de l’utilisateur
G06F 16/41 - Indexation; Structures de données à cet effet; Structures de stockage
G06F 16/44 - Navigation; Visualisation à cet effet
H04N 21/2668 - Création d'un canal pour un groupe dédié d'utilisateurs finaux, p.ex. en insérant des publicités ciblées dans un flux vidéo en fonction des profils des utilisateurs finaux
H04N 21/262 - Ordonnancement de la distribution de contenus ou de données additionnelles, p.ex. envoi de données additionnelles en dehors des périodes de pointe, mise à jour de modules de logiciel, calcul de la fréquence de transmission de carrousel, retardement d
A system, method and computer program product for accessing content. The method comprises processing at least one image with a classifier, and, in response to the at least one image being processed by the classifier, outputting from the classifier a value indicative of the likelihood that the at least one image belongs to at least one classification. The method also comprises determining whether the at least one image belongs to the at least one classification, based on the value, and accessing predetermined content when it is determined that the at least one image belongs to the at least one classification. Images may be classified by, e.g., genre, musical album, concept, or the like, and, in cases where an image belongs to any such classes, predetermined content (e.g., metadata and/or an audio track) relating thereto is identified and presented to the user.
An electronic device obtains a listening history for a media item, the listening history including retention information indicating, for each respective portion of a plurality of portions of the media item, a number of listeners who listened to the respective portion of the media item. The electronic device, using the retention information, determines a pattern indicating a reduction in the number of listeners who listened to corresponding portions of the media item and determines a start time and an end time corresponding to a first portion of the corresponding portions of the media item. In accordance with the determination that the first portion of the corresponding portions of the media item meets predefined sub-content criteria, the electronic device stores an indication that the first portion of the media item comprises first sub-content, different from primary content, embedded in the media item.
H04N 21/24 - Surveillance de procédés ou de ressources, p.ex. surveillance de la charge du serveur, de la bande passante disponible ou des requêtes effectuées sur la voie montante
H04N 21/25 - Opérations de gestion réalisées par le serveur pour faciliter la distribution de contenu ou administrer des données liées aux utilisateurs finaux ou aux dispositifs clients, p.ex. authentification des utilisateurs finaux ou des dispositifs clients ou
H04N 21/234 - Traitement de flux vidéo élémentaires, p.ex. raccordement de flux vidéo ou transformation de graphes de scènes MPEG-4
92.
TEXT-TO-SPEECH AND SPEECH RECOGNITION FOR NOISY ENVIRONMENTS
The present disclosure relates generally to speech processing. Humans change their speech patterns in noisy environments. The systems and devices described herein can compensate for noisy environments to be more human-like. Thus, the configurations and implementations herein can determine a sound profile for the sound environment where the user is listening. Based on the sound profile, the devices can determine a transform to apply to output speech from the device. This transform is applied to the wake word, speech recognition, and to the output speech to compensate for the noise level of the environment by mimicking the Lombard effect.
G10L 13/033 - Procédés d'élaboration de parole synthétique; Synthétiseurs de parole Édition de voix, p.ex. transformation de la voix du synthétiseur
G10L 13/08 - Analyse de texte ou génération de paramètres pour la synthèse de la parole à partir de texte, p.ex. conversion graphème-phonème, génération de prosodie ou détermination de l'intonation ou de l'accent tonique
Systems, devices, apparatuses, components, methods, and techniques for predicting user and media-playback device states are provided. Systems, devices, apparatuses, components, methods, and techniques for media content item caching on a media-playback device are also provided. Systems, devices, apparatuses, components, methods, and techniques for predicting a destination are also provided.
H04L 65/61 - Diffusion en flux de paquets multimédias pour la prise en charge des services de diffusion par flux unidirectionnel, p.ex. radio sur Internet
A skip behavior analyzer is part of a media delivery system that allows for unbiased A/B testing of a plurality of versions of a song. The media delivery system stores a plurality of versions of a song and randomly selects, for each requesting device, a version of the song to associate with that device. Each time the device requests the song, thereafter, the media delivery system will provide the same version of the song for consistency. The media delivery system then gathers song play and skip information, calculates the differences in distribution of the skip behavior, and provides the skip information to allow a music composer to better determine which version of a song is more popular and why that is so.
G06F 16/68 - Recherche de données caractérisée par l’utilisation de métadonnées, p.ex. de métadonnées ne provenant pas du contenu ou de métadonnées générées manuellement
G06F 16/9538 - Présentation des résultats des requêtes
G05B 15/02 - Systèmes commandés par un calculateur électriques
H04N 21/442 - Surveillance de procédés ou de ressources, p.ex. détection de la défaillance d'un dispositif d'enregistrement, surveillance de la bande passante sur la voie descendante, du nombre de visualisations d'un film, de l'espace de stockage disponible dans l
95.
AUTOMATED PAUSE OF MEDIA CONTENT PLAYBACK BASED ON SOUND LEVEL
An example system for playing media content with a media playback device in a vehicle can be programmed to obtain a sound measurement indicative of a sound level associated with playback of the media content by the media playback device in the vehicle. The example system also can determine a deviation in an expected sound level based upon the sound measurement. Finally, the system can modify playback of the media content by the media playback device (110) based upon the deviation.
H03G 3/32 - Commande automatique dans des amplificateurs comportant des dispositifs semi-conducteurs le réglage dépendant du niveau de bruit ambiant ou du niveau sonore ambiant
H04R 1/40 - Dispositions pour obtenir la fréquence désirée ou les caractéristiques directionnelles pour obtenir la caractéristique directionnelle désirée uniquement en combinant plusieurs transducteurs identiques
H03G 3/34 - Commande automatique dans des amplificateurs comportant des dispositifs semi-conducteurs rendant l'amplificateur muet en l'absence de signal
96.
SYSTEM AND METHOD FOR PROVIDING DIGITAL MEDIA CONTENT WITH A CONVERSATIONAL MESSAGING ENVIRONMENT
A media server provides access to digital media content within a third-party conversational messaging environment. The media server includes one or more processors and memory storing one or more programs, including instructions for: receiving, via an application programming interface (API), information from a user's interaction within a conversation of the third-party conversational messaging environment determining, via a recommendation engine, a recommended media item based on the user's interaction within the conversation; and providing, to a device of the user, the recommended media item based on the user's interaction with the conversation.
H04L 51/02 - Messagerie d'utilisateur à utilisateur dans des réseaux à commutation de paquets, transmise selon des protocoles de stockage et de retransmission ou en temps réel, p.ex. courriel en utilisant des réactions automatiques ou la délégation par l’utilisateur, p.ex. des réponses automatiques ou des messages générés par un agent conversationnel
H04L 65/60 - Diffusion en flux de paquets multimédias
H04L 51/046 - Interopérabilité avec d'autres applications ou services réseau
97.
System and method for use of cyclic play queues in a media content environment
An electronic device, while providing a first playlist for playback by a media device, maintains a play queue for a first user that is initially populated with a list of media content items to be played by the media device, the initially populated list of media content items including media content items distinct from the first playlist and selected by the first user and media content items from the first playlist. The electronic device receives a request to change playback from the first playlist to a second playlist different from the first playlist. In response to the request to change the playback from the first playlist to the second playlist, the electronic device maintains, in the play queue, the media content items selected by the first user while changing playback from the first playlist to the second playlist.
H04L 65/75 - Gestion des paquets du réseau multimédia
H04N 21/43 - Traitement de contenu ou données additionnelles, p.ex. démultiplexage de données additionnelles d'un flux vidéo numérique; Opérations élémentaires de client, p.ex. surveillance du réseau domestique ou synchronisation de l'horloge du décodeur; Intergiciel de client
H04N 21/84 - Génération ou traitement de données de description, p.ex. descripteurs de contenu
H04N 21/482 - Interface pour utilisateurs finaux pour la sélection de programmes
G06F 3/0482 - Interaction avec des listes d’éléments sélectionnables, p.ex. des menus
G06F 3/04842 - Sélection des objets affichés ou des éléments de texte affichés
98.
Repetitive-Motion Activity Enhancement Based Upon Media Content Selection
Systems, devices, apparatuses, components, methods, and techniques for repetitive-motion activity enhancement based upon media content selection are provided. An example media-playback device for enhancement of a repetitive-motion activity includes a media-output device that plays media content items, a plurality of media content selection engines, and a repetitive-activity enhancement mode selection engine. The plurality of media content selection engines includes a cadence-based media content selection engine and an enhancement program engine. The cadence-based media content selection engine is configured to select media content items based on a cadence associated with the repetitive-motion activity. The enhancement program engine is configured to select a media content items according to an enhancement program for the repetitive-motion activity. The repetitive-activity enhancement mode selection engine is configured to select a media content selection engine from the plurality of engines and to cause the media-output device to playback media content items selected by the selected engine.
G11B 27/15 - Indexation; Adressage; Minutage ou synchronisation; Mesure de l'avancement d'une bande en utilisant une information non détectable sur le support d'enregistrement l'information provenant du mouvement du support d'enregistrement, p.ex. utilisant un tachymètre utilisant des moyens de détection mécaniques
G11B 27/28 - Indexation; Adressage; Minutage ou synchronisation; Mesure de l'avancement d'une bande en utilisant une information détectable sur le support d'enregistrement en utilisant des signaux d'information enregistrés par le même procédé que pour l'enregistrement principal
G11B 27/10 - Indexation; Adressage; Minutage ou synchronisation; Mesure de l'avancement d'une bande
G11B 27/031 - Montage électronique de signaux d'information analogiques numérisés, p.ex. de signaux audio, vidéo
A target device is associated with a source device. A system includes a target device that enters an association mode, obtains an identification code, and broadcasts the identification code. A source device receives the broadcast, obtains authorization to associate with the target device, and provides a message to an association server. The association server receives the message and associates the target device and the source device in response thereto.
A system implements a dynamic word correlated topic model (DWCTM) to model an evolution of topic popularity, word embedding, and topic correlation within a set of documents, or other dataset, that spans a period of time. For example, the DWCTM receives the set of documents and a quantity of topics for modeling. The DWCTM processes the set computing, for each topic, various distributions to capture a popularity, word embedding, and correlation with other topics across the period of time. In other examples, a dataset of user listening sessions comprised of media content items for modeling by the DWCTM. Media content metadata (e.g., artist or genre) of the media content items, similar to words of a document, can be modeled by the DWCTM.