Research, standards and thoughts for the digital world

Earlier posts by categories:

MPAI MPEG ISO

Imperceptibility, Robustness, and Computational Cost in Neural Network Watermarking

Introduction Research efforts, specific skills, training and processing can cumulatively bring the development costs of a neural network anywhere from a few thousand to a few hundreds of thousand dollars. Therefore, the AI industry needs a technology to ensure traceability and integrity not only of a neural network but also of the content generated by it (so-called inference). Faced with a similar problem, the digital content production and distribution industry has considered watermarking as a tool to insert a payload…

Continue ReadingImperceptibility, Robustness, and Computational Cost in Neural Network Watermarking

The MPAI 2022 Calls for Technologies – Part 3 (Neural Network Watermarking)

Research, personnel, training and processing can bring the development costs of a neural network anywhere from a few thousand to a few hundreds of thousand dollars. Therefore, the AI industry needs a technology to ensure traceability and integrity not only of a neural network, but also of the content generated by it (so-called inference). The content industry facing a  similar problem, has used watermarking to imperceptibly and persistently insert a payload carrying, e.g., owner ID, timestamp, etc. to signal the ownership of a content item. Watermarking…

Continue ReadingThe MPAI 2022 Calls for Technologies – Part 3 (Neural Network Watermarking)

The MPAI 2022 Calls for Technologies – Part 2 (Multimodal Conversation)

Processing and generation of natural language is an area where artificial Intelligence is expected to make a difference compared to traditional technologies. Version 1 of the MPAI Multimodal Conversation standard (MPAI-MMC V1), specifically the Conversation with Emotion use case, has addressed this and related challenges: processing and generation not only of speech but also of the corresponding human face when both convey emotion. The audio and video produced by a human conversing with the machine represented by the blue box…

Continue ReadingThe MPAI 2022 Calls for Technologies – Part 2 (Multimodal Conversation)

August 1992 – the gates open to global digital television

In this article, I continue the tradition of reporting on major MPEG events when one of them comes to an anniversary. This time the anniversary concerns the word “profile”. In July 1990, when the MPEG-1 standard was far from done (it would only be approved in November 1992), a diverse group of individuals attended the first MPEG-2 session, brainstorming on requirements for the "second phase of MPEG work" as MPEG-2 was warily called at that time. The engine took time…

Continue ReadingAugust 1992 – the gates open to global digital television

MPAI issues a Call for Patent Pool Administrator on behalf of the MPAI-CAE and MPAI-MMC patent holders

Geneva, Switzerland – 23 March 2022. Today the international, non-profit, unaffiliated Moving Picture, Audio and Data Coding by Artificial Intelligence (MPAI) standards developing organisation has concluded its 18th General Assembly. Among the outcomes is the publication of Call for Patent Pool Administrators for two of its approved Technical Specifications. The MPAI process of standard development prescribes that Active Principal Members, i.e., those intending to participate in the development of a Technical Specification, adopt a Framework Licence before initiating the development.…

Continue ReadingMPAI issues a Call for Patent Pool Administrator on behalf of the MPAI-CAE and MPAI-MMC patent holders

With 5 standards approved, MPAI enters a new phase

Geneva, Switzerland – 26 January 2022. Today the Moving Picture, Audio and Data Coding by Artificial Intelligence (MPAI) standards developing organisation has concluded its 16th General Assembly, the first of 2022, approving its 2022 work program. The work program includes the development of reference software, conformance testing and performance assessment for 2 application standards (Context-based Audio Enhancement and Multimodal Conversation), reference software, conformance assessment for 1 infrastructure standard (AI Framework), and the establishment of the MPAI Store, a non-profit foundation…

Continue ReadingWith 5 standards approved, MPAI enters a new phase