Research, standards and thoughts for the digital world

Earlier posts by categories:

MPAI MPEG ISO

Imperceptibility, Robustness, and Computational Cost in Neural Network Watermarking

Introduction Research efforts, specific skills, training and processing can cumulatively bring the development costs of a neural network anywhere from a few thousand to a few hundreds of thousand dollars. Therefore, the AI industry needs a technology to ensure traceability and integrity not only of a neural network but also of the content generated by it (so-called inference). Faced with a similar problem, the digital content production and distribution industry has considered watermarking as a tool to insert a payload…

Continue ReadingImperceptibility, Robustness, and Computational Cost in Neural Network Watermarking

Avatars and the MPAI-MMC V2 Call for Technologies

  • Post author:
  • Post category:MPAI

The goal of the MPAI Multimodal Conversation (MPAI-MMC) standard is to enable forms of human-machine conversation that emulate the human-human one in completeness and intensity. While this is clearly a long-term goal, MPAI is focusing on standards providing frameworks which break down – where possible – complex AI functions to facilitate the formation of a component market where solution aggregators can find AI Modules (called AIM) to build AI Workflows (called AIW) corresponding to standard use cases. The AI Framework…

Continue ReadingAvatars and the MPAI-MMC V2 Call for Technologies

The MPAI 2022 Calls for Technologies – Part 3 (Neural Network Watermarking)

Research, personnel, training and processing can bring the development costs of a neural network anywhere from a few thousand to a few hundreds of thousand dollars. Therefore, the AI industry needs a technology to ensure traceability and integrity not only of a neural network, but also of the content generated by it (so-called inference). The content industry facing a  similar problem, has used watermarking to imperceptibly and persistently insert a payload carrying, e.g., owner ID, timestamp, etc. to signal the ownership of a content item. Watermarking…

Continue ReadingThe MPAI 2022 Calls for Technologies – Part 3 (Neural Network Watermarking)

Answering a few basic questions about MPAI

  • Post author:
  • Post category:MPAI

Q: What is the main objective of MPAI? A: There are many languages in the world, but if we want to reach all interested people we have to use a universally recognised language. The same thing happens with data: the more people understand the format the more value they have. The definition of a universal language for music – the MP3 standard – led to a revolution that many – because they did not know the world before – cannot…

Continue ReadingAnswering a few basic questions about MPAI

The MPAI 2022 Calls for Technologies – Part 2 (Multimodal Conversation)

Processing and generation of natural language is an area where artificial Intelligence is expected to make a difference compared to traditional technologies. Version 1 of the MPAI Multimodal Conversation standard (MPAI-MMC V1), specifically the Conversation with Emotion use case, has addressed this and related challenges: processing and generation not only of speech but also of the corresponding human face when both convey emotion. The audio and video produced by a human conversing with the machine represented by the blue box…

Continue ReadingThe MPAI 2022 Calls for Technologies – Part 2 (Multimodal Conversation)

MPAI adds documents and clarification to its currently open three Calls for Technologies

  • Post author:
  • Post category:MPAI

Geneva, Switzerland – 23 August 2022. Today the international, non-profit, unaffiliated Moving Picture, Audio and Data Coding by Artificial Intelligence (MPAI) standards developing organisation has concluded its 23rd General Assembly (MPAI-23). Among the outcomes are three documents produced to facilitate the task of drafting a response to the currently open Calls for Technologies and one document that will facilitate the identification and positioning of the technologies defined in the Multimodal Conversation Use Cases and Functional Requirements V2. MPAI-23 has also…

Continue ReadingMPAI adds documents and clarification to its currently open three Calls for Technologies