Research, standards and thoughts for the digital world

Earlier posts by categories:

MPAI MPEG ISO

The MPAI 2022 Calls for Technologies – Part 3 (Neural Network Watermarking)

Research, personnel, training and processing can bring the development costs of a neural network anywhere from a few thousand to a few hundreds of thousand dollars. Therefore, the AI industry needs a technology to ensure traceability and integrity not only of a neural network, but also of the content generated by it (so-called inference). The content industry facing a  similar problem, has used watermarking to imperceptibly and persistently insert a payload carrying, e.g., owner ID, timestamp, etc. to signal the ownership of a content item. Watermarking…

Continue ReadingThe MPAI 2022 Calls for Technologies – Part 3 (Neural Network Watermarking)

Answering a few basic questions about MPAI

  • Post author:
  • Post category:MPAI

Q: What is the main objective of MPAI? A: There are many languages in the world, but if we want to reach all interested people we have to use a universally recognised language. The same thing happens with data: the more people understand the format the more value they have. The definition of a universal language for music – the MP3 standard – led to a revolution that many – because they did not know the world before – cannot…

Continue ReadingAnswering a few basic questions about MPAI

The MPAI 2022 Calls for Technologies – Part 2 (Multimodal Conversation)

Processing and generation of natural language is an area where artificial Intelligence is expected to make a difference compared to traditional technologies. Version 1 of the MPAI Multimodal Conversation standard (MPAI-MMC V1), specifically the Conversation with Emotion use case, has addressed this and related challenges: processing and generation not only of speech but also of the corresponding human face when both convey emotion. The audio and video produced by a human conversing with the machine represented by the blue box…

Continue ReadingThe MPAI 2022 Calls for Technologies – Part 2 (Multimodal Conversation)

MPAI adds documents and clarification to its currently open three Calls for Technologies

  • Post author:
  • Post category:MPAI

Geneva, Switzerland – 23 August 2022. Today the international, non-profit, unaffiliated Moving Picture, Audio and Data Coding by Artificial Intelligence (MPAI) standards developing organisation has concluded its 23rd General Assembly (MPAI-23). Among the outcomes are three documents produced to facilitate the task of drafting a response to the currently open Calls for Technologies and one document that will facilitate the identification and positioning of the technologies defined in the Multimodal Conversation Use Cases and Functional Requirements V2. MPAI-23 has also…

Continue ReadingMPAI adds documents and clarification to its currently open three Calls for Technologies

The MPAI 2022 Calls for Technologies – Part 1 (AI Framework)

  • Post author:
  • Post category:MPAI

A foundational element of the MPAI architecture is the fact that monolithic AI applications have some characteristics that make them undesirable. For instance, they are single-use, i.e., it is hard to reuse technologies used by the application in another application and they are obscure, i.e., it is hard to understand why a machine has produced a certain output when subjected to a certain input. The first characteristic means that it is hard to make complex applications because an implementer must…

Continue ReadingThe MPAI 2022 Calls for Technologies – Part 1 (AI Framework)

August 1992 – the gates open to global digital television

In this article, I continue the tradition of reporting on major MPEG events when one of them comes to an anniversary. This time the anniversary concerns the word “profile”. In July 1990, when the MPEG-1 standard was far from done (it would only be approved in November 1992), a diverse group of individuals attended the first MPEG-2 session, brainstorming on requirements for the "second phase of MPEG work" as MPEG-2 was warily called at that time. The engine took time…

Continue ReadingAugust 1992 – the gates open to global digital television