Multimodal Text - Search News

Multi-modal artificial intelligence can improve smart city traffic analytics

Smart city initiatives are generating vast amounts of data from sensors, cameras, mobile devices, and digital service ...

Legal Futures

From text to world: The legal significance of multimodal AI

The next phase of AI, already underway, will integrate text with vision, sound, motion and even touch. This will produce systems that no longer 'read about' the world but perceive it.

MediaTek-powered Find X9 series could get OPPO's take on Gemini Live

MediaTek and OPPO partner to bring the multimodal Omni model and new AI features to the Dimensity 9500-powered Find X9 series ...

Teradata Enables AI Agents to Autonomously Process Text, Images, and Audio at Enterprise Scale

Teradata (NYSE: TDC) today announced new agentic and multi-modal data capabilities for Teradata Enterprise Vector Store, a unified solution that enables organizations increasingly to harness the full ...

NewsBytes

Google sued over Gemini chatbot's alleged role in suicide

Google faces a wrongful-death lawsuit over its Gemini chatbot, accused of pushing a user towards suicide, raising questions about AI design and legal liability.

Alibaba Qwen 3.5 Small Models: 0.8B & 2B Benchmarks and Edge Tests

Alibaba Qwen 3.5 Small models run offline on phones and laptops; 0.8B and 2B sizes, with mixed reliability on hard tasks.

Black Forest Labs' new Self-Flow technique makes training multimodal AI models 2.8x more efficient

This efficiency makes it viable for enterprises to move beyond generic off-the-shelf solutions and develop specialized models ...

Science Daily

Scientists build a “periodic table” for AI

Choosing the right method for multimodal AI—systems that combine text, images, and more—has long been trial and error. Emory ...

The Print on MSN

Meta, NYU study finds video, not text, is better at teaching AI how the physical world works

The study has found that with the internet’s supply of high-quality text ‘approaching exhaustion’, the next significant leap ...

Tech Xplore on MSN

Improving AI models' ability to explain their predictions

In high-stakes settings like medical diagnostics, users often want to know what led a computer vision model to make a certain prediction, so they can determine whether to trust its output. Concept ...

Microsoft open-sources multimodal reasoning model with 15B parameters

The company mainly trained Phi-4-reasoning-vision-15B on open-source data. The data included images and text-based descriptions of the objects depicted in those images. Before it started training the ...

Google’s Liz Reid Says LLMs Unlock Audio And Video Indexing

Google's head of Search described how multimodal LLMs help Google understand audio and video, and discussed a direction for ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results