Trending

#MLLM

Latest posts tagged with #MLLM on Bluesky

Latest Top
Trending

Posts tagged #MLLM

Original post on mastodon.social

RE: https://dobbs.town/@hobbs/116032781720531564

dear #lazyweb
hit me with your favorite RSS feeds for #homelab #selfhosting #linux #opensource #computing #programming #computerscience #cpu #microarchitecture #electronics #robotics #ai #llm #vlm #mllm #cognitivescience #consciousness […]

4 0 0 0
Post image

Apple's latest AI research introduces DeepMMSearch-R1 and Manzano, revolutionizing image understanding and generation. Explore how these advancements enhance user interactions. #AppleAI #MLLM #TechInnovation Link: thedailytechfeed.com/apple-advanc...

1 0 0 0
Preview
UniGen 1.5: el modelo de IA de Apple que puede ver, crear y editar imágenes El equipo de investigadores de IA de Apple presentan UniGen 1.5: un MLLM unificado para la comprensión, generación y edición avanzada de imágenes.

UniGen 1.5: el modelo de IA de Apple que puede ver, crear y editar imágenes

#MLLM #ML #AI

mecambioamac.com/unigen-1-5-e...

0 0 0 0
Preview
New AI Model Is Shockingly Good at “Reading” Human Minds A new AI model is demonstrating an unprecedented ability to anticipate human actions by interpreting visual and contextual cues in real time. Rather than simply reacting to movement, the system reason...

New AI Model Is Shockingly Good at “Reading” Human Minds

#AI #MindReading #MLLM #LLM #ReadingHumanBehaviour
scitechdaily.com/new-ai-model...

1 0 0 0
Post image

The future of local food is data-smart and community-strong.
Read the full JAFSCD article for free, as always: doi.org/10.5304/jafs...

#FarmersMarkets #LocalFood #AIResearch #FoodSystems #MLLM #DataScience #MichiganState

1 0 0 0
Cambrian-S: El MLLM que domina la cognición espacial y visión predictiva en video
Cambrian-S: El MLLM que domina la cognición espacial y visión predictiva en video YouTube video by En la mente de la máquina, Inteligencia Artificial

¡El futuro de la #IA en video está aquí! 🚀 Cambrian-S redefine la comprensión de video con el #SpatialSupersensing. Utiliza la 'sorpresa' para dominar la memoria y el conteo en videos continuos (VSI-SUPER), superando a los modelos de contexto largo. youtu.be/t34IOLsDioY #MLLM

0 0 0 0
Why AI Sucks At Telling Time...
Why AI Sucks At Telling Time... YouTube video by Nick Espinosa

Why AI Sucks At Telling Time... and why this should concern us for autonomous vehicles and more.

#News #TechNews #AI #MLLM #AIlimitations #SelfDriving #MedTech

1 0 0 0
Preview
Why AI Sucks At Telling Time... and why this should concern us for autonomous vehicles and more.

Daily podcast: Why AI Sucks At Telling Time... and why this should concern us for autonomous vehicles and more.

#News #TechNews #AI #MLLM #AIlimitations #SelfDriving #MedTech #podcast

soundcloud.com/nickaesp/acr

1 0 0 0
DeepMMSearch-R1: El MLLM que supera a GPT en búsqueda web multimodal y VQA
DeepMMSearch-R1: El MLLM que supera a GPT en búsqueda web multimodal y VQA YouTube video by En la mente de la máquina, Inteligencia Artificial

¡Revolución en IA! DeepMMSearch-R1 es el primer #MLLM en superar a #GPT en búsqueda web multimodal y #VQA. Usa auto-corrección y búsqueda con recorte de imágenes para un conocimiento en tiempo real. ¡El futuro de la #BúsquedaWeb ha llegado! #DeepMMSearchR1 youtu.be/7MapdFGHl1o

0 0 0 0
New Study Finds Vision Representation Predicts MLLM Performance

New Study Finds Vision Representation Predicts MLLM Performance

A new study reports that vision representation can predict performance of multimodal large language models. Read more: getnews.me/new-study-finds-vision-r... #vision #mllm

0 0 0 0
HoloV Introduces Holistic Visual Token Pruning for Efficient MLLMs

HoloV Introduces Holistic Visual Token Pruning for Efficient MLLMs

HoloV cuts visual tokens by about 89% yet keeps roughly 95.8% of LLaVA-1.5’s original accuracy, offering faster, lower-memory multimodal inference. Read more: getnews.me/holov-introduces-holisti... #holov #mllm #llava1.5

0 0 0 0
VER‑Bench Introduces Fine‑Grained Visual Evidence Evaluation for MLLMs

VER‑Bench Introduces Fine‑Grained Visual Evidence Evaluation for MLLMs

VER‑Bench adds a visual evidence reasoning benchmark with 374 questions, each using clues that cover just 0.25 % of an image. MLLMs lose performance on these fine‑grained tasks. Read more: getnews.me/ver-bench-introduces-fin... #verbench #mllm #ai

0 0 0 0
Original post on hcommons.social

At the @bifold.berlin conference "AI-based methods in the humanities", I have just attended a great talk by Seid Muhie Yimam of Hamburg University who confirmed my impression that there is a kind of momentum in this area at the moment. He mentioned many datasets, publications and shared tasks on […]

1 3 1 0
Efficient MLLM Evaluation with a Multi‑to‑One Interview Approach

Efficient MLLM Evaluation with a Multi‑to‑One Interview Approach

A two‑stage interview framework for Multi‑Modal LLMs boosts evaluation efficiency, delivering up to 17.6% higher Pearson and 16.7% higher Spearman correlation while using fewer questions. Read more: getnews.me/efficient-mllm-evaluatio... #mllm #ai

0 0 0 0
Multimodal LLMs Boost AI Assistance for Diabetic Retinopathy Screening

Multimodal LLMs Boost AI Assistance for Diabetic Retinopathy Screening

GPT‑4o reached AUROC 0.96 on diabetic retinopathy screening using MedGemma’s text outputs; MedGemma had higher baseline sensitivity on IDRiD and Messidor‑2 datasets. Read more: getnews.me/multimodal-llms-boost-ai... #diabeticretinopathy #mllm

0 0 0 0
Original post on hcommons.social

In der Sektion über #GlobalHistory from a global perspective geht's gerade um die Begrenzungen von LLMs für "low-resourced" languages. Da tut sich allerdings viel - nicht bei OpenAI, Google, Meta & Co. aber andernorts. Ich suche später noch weitere Links, für den Moment muss es dieser tun […]

0 6 1 0
The image illustrates an architecture for a large language model, highlighting the Task-Adaptive Gated Router component. It features connections between text and vision tokens, a ViT encoder, and 3D position encoding. Examples demonstrate how the gated router activates based

The image illustrates an architecture for a large language model, highlighting the Task-Adaptive Gated Router component. It features connections between text and vision tokens, a ViT encoder, and 3D position encoding. Examples demonstrate how the gated router activates based

OmniEVA: Bridging the 2D–3D Gap in Embodied AI

New paper introduces OmniEVA, a versatile embodied planner that pushes the boundaries of multimodal large language models (MLLMs) for robotics and spatial reasoning.

Results: OmniEVA achieves state-of-the-art […]

[Original post on det.social]

0 2 0 0
Image from article in Radiology: Artificial Intelligence

Image from article in Radiology: Artificial Intelligence

Report presents #cybersecurity challenges posed by #LLMs in health care and strategies for mitigation https://doi.org/10.1148/ryai.240739 @alitejanimd.bsky.social #MLLM #VLM #AI

3 1 0 0
Image from article in Radiology: Artificial Intelligence

Image from article in Radiology: Artificial Intelligence

Cybersecurity risks associated with LLMs must be assessed carefully before deploying LLMs in health care https://doi.org/10.1148/ryai.240739 @alitejanimd.bsky.social #MLLM #MLM #AI

2 2 0 0
Image from article in Radiology: Artificial Intelligence

Image from article in Radiology: Artificial Intelligence

Special report on #cybersecurity threats and mitigation strategies for #LLMs in health care https://doi.org/10.1148/ryai.240739 @alitejanimd.bsky.social #MLLM #MLM #VLM

3 0 0 0
Image from article in Radiology: Artificial Intelligence

Image from article in Radiology: Artificial Intelligence

Cybersecurity Threats and Mitigation Strategies for Large Language Models in Health Care https://doi.org/10.1148/ryai.240739 @alitejanimd.bsky.social #MLLM #VLM #cyberattack

2 1 0 0
Image from article in Radiology: Artificial Intelligence

Image from article in Radiology: Artificial Intelligence

Cybersecurity Threats and Mitigation Strategies for Large Language Models in Health Care https://doi.org/10.1148/ryai.240739 @alitejanimd.bsky.social #cybersecurity #MLLM #ML

3 0 0 0

See you in Vienna! #ACL2025 !

(6/6)

#MLLM #AISafety #Jailbreak #Multimodal #ConInstruction #ACL2025 #LLMRedTeaming #VisionLanguage #AudioLanguage#NLProc

1 0 0 0
Image from article in Radiology: Artificial Intelligence

Image from article in Radiology: Artificial Intelligence

Report presents #cybersecurity challenges posed by #LLMs in health care and strategies for mitigation https://doi.org/10.1148/ryai.240739 @alitejanimd.bsky.social #MLLM #MLM #VLM

3 0 0 0

MLLM-Based UI2Code Automation Guided by UI Layout Information
Cuiyun Gao, Fan Wu et al.
Paper
Details
#MLLM #UI2CodeAutomation #UserInterfaceDesign

0 0 0 0

We wish Jonathan insightful discussions and a great experience at this remarkable venue!
(3/3)

#UKPLab #KULeuven #MLLM #ExplainableAI #ScientificVisualizations #KVAB #Brussels #NLProc

0 0 0 0
Post image

🧵1/10 Excited to share our #SIGGRAPH paper "MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills" 🌟
We explore how to make MLLMs operation-aware by solving visual puzzles and propose a procedural framework for image retouching
#MLLM

4 2 1 0

open-source #a.i.. what happens when multimodal large language models #mllm turn the whole internet into your private #metaverse.

1 0 0 0
Post image

Research Scientist available to relocate globally. Experienced in UI Agents, 5 years of industry experience training large models. #LLMs #MLLM #VLM

DM or email me for an intro if you're in the #Agents space and hiring - #UIAgents, Web Agents, Software test agents or similar!

0 0 0 0