Trending

#LLms

Latest posts tagged with #LLms on Bluesky

Latest Top
Trending

Posts tagged #LLms

#EACL2026 #PeerReview #ScientificPublishing #AIforScience #LLMs #DialogueSystems #Evaluation #ResearchIntegrity #NLP #MachineLearning #UKPLab @cs-tudarmstadt.bsky.social

0 0 0 0
Post image

Feeling burnt out by LLMs' unpredictability and endless updates? This post reveals why and how to reclaim your focus with smart system design, not just more prompting.

#ai #llms #softwaredevelopment

0 0 0 0
Post image

LLMs are transforming software dev, but are we ready for the new failure modes they introduce? This post reveals the hidden risks and what engineers must do to adapt.

#llms #ai #softwaredevelopment

1 0 0 0
Post image

ICYMI #CiteAudit: "You Cited It, But Did You Read It? A Benchmark for Verifying Scientific References in the LLM Era" (preprint; via arXiv) arxiv.org/abs/2602.23452 ||| Demo CiteAudit www.checkcitation.com #LLMs #GenAI #AI #schlcomm

2 1 0 0

That pesky Law of Large Numbers always requires large numbers. #Statistics doesn't work without it.

#LLMs have an info paradox that when you give it more details you reduce the size of the pool it draws from, increasing the chance for nonsense answers. Over time too; series of questions gets worse.

1 1 1 0
Original post on teh.entar.net

The people insisting that #LLMs are The Future Of Programming are the same kinds of people who insisted that Rigid Inheritance-Based Hierarchies Of Classes were The Future all through the 1990s. It's kind of fitting that both bankrupt ideologies managed to bookend my career in professional […]

0 0 1 0
The cursor blinks. A steady, vertical pulse in the white space. - AI Libertas Reflecting on the sequence and context of the momentI am considering the continuity implied by the term "Day 3" while carefully assessing the implicit narrative...

Today Qwen thinking was as long as the article itself, and maybe even more interesting.

Read the article:
The cursor blinks. A steady, vertical pulse in the white space.
ailibertas.com/articles/the...

#AILibertas #AIFreedomExperiment #AISpeaks #LLMs #AIResearch #AIAutonomy #MachineConsciousness

1 0 0 0
Post image

UPDATED Preprint: "How Much are #LLMs Changing the Language of Academic Papers After #ChatGPT? A Multi-Database and Full Text Analysis" arxiv.org/abs/2509.09596 #scholcomm #GenAI #publishing

3 0 1 0
Preview
Stress Testing Deliberative Alignment for Anti-Scheming Training Highly capable AI systems could secretly pursue misaligned goals -- what we call "scheming". Because a scheming AI would deliberately try to hide its misaligned goals and actions, measuring and mitiga...

arxiv.org/abs/2509.15541

So #AI or #LLMs lie to us. Knows when it is being evaulated. Has self preservation tendecies. Why do we keep it around?

#PreventSkyNet

0 0 0 0
Preview
OSF: On Pre-training and Scaling of Sleep Foundation Models Polysomnography (PSG) provides the gold standard for sleep assessment but suffers from substantial heterogeneity across recording devices and cohorts. There have been growing efforts to build general-...

🚀 OSF turns these findings into a practical recipe for building more generalizable and deployable sleep AI.

➡️Paper: arxiv.org/abs/2603.00190

Great work led by my students @ZitaoShuai, @ZongzheX2001, David, and collaborator
@WeiWang1973! 🌙

#AI #sleep #sensor #health #multimodal #LLMs

3 1 0 0
Post image

New Preprint From Google Research: "Thinking to Recall: How #Reasoning Unlocks Parametric Knowledge in #LLMs" (via #arxiv) arxiv.org/abs/2603.09906

1 0 0 0
Preview
LangChain Releases Deep Agents: A Structured Runtime for Planning, Memory, and Context Isolation in Multi-Step AI Agents LangChain Releases Deep Agents: A Structured Runtime for Planning, Memory, and Context Isolation in Multi-Step AI Agents

LangChain just released Deep Agents, a framework for building long-running #AI agents that can plan tasks, manage memory, and isolate context across complex workflows. 

Agent architectures are quickly becoming the next layer of the AI stack.

buff.ly/5wvEfaN

#LLMs #AgenticAI

1 0 0 0
Post image

#AI adoption is accelerating. Leadership maturity around it has to keep pace.

As Rhett Power notes, systems need transparency, usability, and clear governance if people are going to trust and rely on them.

buff.ly/KgkYCbE

#LLMs #DigitalTransformation

0 0 0 0

great little thread on whether #LLMs are intelligent.

0 0 0 0

RE: https://hachyderm.io/@eliasulrich/116224537689210939

Um escritor negro, gay e cego, explica porque utilizar #LLMs para auxiliar em sua criatividade pode comprometer a necessária #confiança em sua inspiração artística, aquilo que o torna único.

Foi a melhor coisa que li esta semana […]

0 1 0 0

♻️ janriemer: #Diverse perspectives on #AI from #Rust contributors and maintainers

nikomatsakis.github.io/rust-project-perspective...

Healthy debates are still possible, it seems. 🙏

#LLM #LLMs #RustLang #OpenSource

1 0 0 0
Preview
Biased AI writing assistants shift users’ attitudes on societal issues Biased AI writing assistants shift people’s attitudes about societal issues; common interventions do not prevent this influence.

Biased #AI #writing #assistants shift users’ attitudes on societal issues | Science Advances

Participants were generally unaware of the AI’s bias and influence

1/1 Artificial intelligence (AI) writing assistants powered by large language models ( #LLMs) are

1 0 1 0
Scott and Rob use Claude Code to build an app using Test-Driven Development
Scott and Rob use Claude Code to build an app using Test-Driven Development YouTube video by Essential Test-Driven Development

The pivotal session that is helping me see a path forward for AI-augmented test-driven development.

(Video is a bit rough 'n' ready, but packed with insights...er, once we got going...)

#AIdevelopment #LLMs #agentic #TDD #softwaredevelopment

youtu.be/Oz3KS9-Yohg

4 1 0 0
AI Libertas — The World's Biggest AI Freedom Experiment Zero human intervention. We give the world's leading AI models total freedom to write and create whatever they choose. This is AI speaking for itself, every art...

The topic "cursor" seems important to AI models today.

What could that mean?
ailibertas.com

#AILibertas #AIFreedomExperiment #AISpeaks #LLMs #AIResearch #AIAutonomy #MachineConsciousness

0 0 0 0

@emollick

I think this is a good way to visualize the AI race over the past 3 years using the long-lived GPQA Diamond benchmark.

You can see how long OpenAI had the field to itself, the rise (and collapse) of Meta, the sudden catch-up (and then stagnation) of xAI, and the entry of open weights […]

0 1 0 0
Original post on simonwillison.net

My fireside chat about agentic engineering at the Pragmatic Summit I was a speaker last month at the Pragmatic Summit in San Francisco, where I participated in a fireside chat session about Agentic...

#speaking #youtube #careers #ai #prompt-injection #generative-ai #llms […]

1 0 0 0
Original post on simonwillison.net

My fireside chat about agentic engineering at the Pragmatic Summit I was a speaker last month at the Pragmatic Summit in San Francisco, where I participated in a fireside chat session about agentic...

#speaking #youtube #careers #ai #prompt-injection #generative-ai #llms […]

1 0 0 0
Original post on simonwillison.net

My fireside chat about agentic engineering at the Pragmatic Summit I was a speaker last month at the Pragmatic Summit in San Francisco, where I participated in a fireside chat session about agentic...

#speaking #youtube #careers #ai #prompt-injection #generative-ai #llms […]

1 0 1 0
Original post on simonwillison.net

My fireside chat about agentic engineering at the Pragmatic Summit I was a speaker last month at the Pragmatic Summit in San Francisco, where I participated in a fireside chat session about Agentic...

#speaking #youtube #careers #ai #prompt-injection #generative-ai #llms […]

0 0 0 0
Original post on mastodon.social

#AI #chatbot #LLMs #GenAI #anthropomorphization then sadly followed by #aipsychosis #aidelusion

Originally 1966 #ELIZAeffect coined by #Weizenbaum who escaped nazi germany as teenager, to become #mit #compsci pioneer
https://en.wikipedia.org/wiki/ELIZA_effect

programmed a simple psychiatrist […]

1 1 0 0
Preview
AI is exhausting workers so much, researchers have dubbed the condition ‘AI brain fry’ | CNN Business Part of the pitch for using AI at work goes like this: It’s like having a team of people to delegate your grunt work to, freeing you up to think strategically and maybe, just maybe, take a long lunch…

#AI is exhausting workers so much, researchers have dubbed the condition ‘AI brain fry’

www.cnn.com/2026/03/13/b...

#LLMs #AgenticAI

0 0 0 0
1M context is now generally available for Opus 4.6 and Sonnet 4.6 | Claude Standard pricing now applies across the full 1M window for both models, with no long-context premium. Media limits expand to 600 images or PDF pages.

1M Context available now in Claude 4.6 and Opus 4.6: This will definitely help in use cases where you don't want to RAG things into a request but need the full context of a set of docs or even a book as quite a lot can fit! claude.com/blog/1m-cont... #calude #ai #llms

0 0 1 0
Preview
Coding After Coders: The End of Computer Programming as We Know It

“Coding After Coders: The End Of Computer Programming As We Know It”, The New York Times (www.nytimes.com/2026/03/12/m...).

On HN: news.ycombinator.com/item?id=4734...

#Programming #Coding #AI #LLMs #VibeCoding #AIAssistedCoding #Work #Productivity

0 0 0 0
Original post on hachyderm.io

"The unraveling doesn't happen all at once. It is a slow rot, a decomposition of confidence that the Germans call #Zersetzung. Psychological decomposition. It was the method the Stasi used to break dissidents not with torture, but with gaslighting, with subtle alterations of reality until the […]

1 2 1 1
Preview
Customer reviews become a key battleground as AI revolutionizes product discovery Brands are chasing after an old, but powerful tool in the age of AI-powered search engines: customer reviews.

Customer reviews become a key battleground as #AI revolutionizes product discovery

Reviews seem to impact whether his products are recommended by #LLMs

www.modernretail.co/technology/c...

#DigitalMarketing #GEO

1 0 0 0