Tag ai

75 bookmarks have this tag.

2025-12-22

806.

LettuceAI — Private, User-Controlled AI Chat

lettuceai.app

A privacy-first AI chat app with long-term memory and custom characters. You choose the provider. Your data stays on your device.

2025-12-14

789.

NK Pink Lady TTS

www.nk-pinklady.org?referrer=grok.com

북한의 전설적인 아나운서 리춘히의 목소리를 AI로 재현하는 TTS 서비스

2025-12-08

777.

Generating Relevant Random JSON with Chrome AI

www.raymondcamden.com/2025/12/07/generating-relevant-random-json-with-chrome-ai

The author explored using Chrome’s on-device AI to generate random JSON objects based on a given name and JSON shape. They built a demo where users input a name and JSON structure, and the AI generates a random JSON object with sensible data. The author encountered challenges with the AI’s output format and used a library to generate JSON schemas from sample data, ultimately achieving near-perfect results.

2025-12-06

774.

qvink/SillyTavern-MessageSummarize

github.com/qvink/SillyTavern-MessageSummarize

Contribute to qvink/SillyTavern-MessageSummarize development by creating an account on GitHub.

2025-12-04

769.

How I wrote JustHTML using coding agents - Friendly Bit

friendlybit.com/python/writing-justhtml-with-coding-agents

I recently released JustHTML, a python-based HTML5 parser. It passes 100% of the html5lib test suite, has zero dependencies, and includes a CSS selector...

2025-11-30

759.

The space of minds

karpathy.bearblog.dev/the-space-of-minds

On the space of minds and the optimizations that give rise to them.

2025-11-29

757.

Paylino App - App Store

apps.apple.com/us/app/paylino/id6754171927

Download Paylino by Subsspot GmbH on the App Store. See screenshots, ratings and reviews, user tips, and more games like Paylino.

756.

Bredrumb/TomoriBot: A highly customizable chatbot/waifu for Discord featuring smart agentic AI features such as memory, personas, tool usage, and more!

github.com/Bredrumb/TomoriBot

A highly customizable chatbot/waifu for Discord featuring smart agentic AI features such as memory, personas, tool usage, and more! - Bredrumb/TomoriBot

2025-11-28

754.

AI's role in improving accessibility

www.marketplace.org/episode/2025/11/28/ais-role-in-improving-accessibility

AI large language models have been heralded as revolutionary in the world of accessibility. But in order to create a more accessible internet, those with disabilities need to be included in the process of training the technology, advocates say.

2025-11-21

743.

FAWK: LLMs can write a language interpreter

martin.janiczek.cz/2025/11/21/fawk-llms-can-write-a-language-interpreter.html

After reading the book The AWK Programming Language (recommended!), I was planning to try AWK out on this year’s Advent of Code. Having some time off from work this week, I tried to implement one of the problems in it to get some practice, set up my tooling, see how hard AWK would be, and… I found I’m FP-pilled.

2025-11-20

737.

Olmo 3: Charting a path through the model flow to lead open-source AI | Ai2

allenai.org/blog/olmo3

Our new flagship Olmo 3 model family empowers the open source community with not only state-of-the-art open models, but the entire model flow and full traceability back to training data.

2025-11-19

736.

supertone-inc/supertonic: Lightning-fast, on-device TTS — running natively via ONNX.

github.com/supertone-inc/supertonic

Lightning-fast, on-device TTS — running natively via ONNX. - supertone-inc/supertonic

2025-11-18

730.

Richer alt text in Word and PowerPoint, powered by generative AI

techcommunity.microsoft.com/blog/Microsoft365InsiderBlog/richer-alt-text-in-word-and-powerpoint-powered-by-generative-ai/4466593

Microsoft Word and PowerPoint for Windows now use generative AI to generate alt text for images. This new feature provides higher-quality, context-rich descriptions and gives users more control over when and how alt text is added. The update is available to Microsoft 365 users with Version 2510 or later.

2025-11-17

727.

deepclause/deepclause-desktop: DeepClause Desktop App

github.com/deepclause/deepclause-desktop

DeepClause Desktop App. Contribute to deepclause/deepclause-desktop development by creating an account on GitHub.

726.

ysharma3501/FastMaya: A highly optimized engine for maya-1 tts model to generate minutes of audio in seconds.

github.com/ysharma3501/FastMaya

A highly optimized engine for maya-1 tts model to generate minutes of audio in seconds. - ysharma3501/FastMaya

2025-11-15

722.

Baidu unveils proprietary ERNIE 5 beating GPT-5 performance on charts, document understanding and more

share.google/f4d8aQ8lesLiPNhxk

Baidu unveiled ERNIE 5.0, a proprietary multimodal foundation model designed to process and generate content across text, images, audio, and video. The model, positioned as a global contender in the enterprise AI market, boasts competitive performance against OpenAI’s GPT-5 and Google’s Gemini 2.5 Pro, particularly in structured document understanding and visual chart reasoning. Alongside ERNIE 5.0, Baidu introduced major updates to its digital human platform, no-code tools, and general-purpose AI agents, aiming to expand its AI footprint beyond China.

2025-11-06

700.

maya-research/maya1 · Hugging Face

huggingface.co/maya-research/maya1

2025-10-30

691.

Paper2Audio - Free text to speech for PDFs, EPUBs, and more

www.paper2audio.com/posts/review-of-text-to-speech-models-for-reading-research-papers
690.

KaniTTS - a Hugging Face Space by nineninesix

huggingface.co/spaces/nineninesix/KaniTTS

2025-10-25

679.

GitHub - TimmyOVO/deepseek-ocr.rs: Rust implementation of DeepSeek-OCR with OpenAI-compatible server. & CLI No Python environment needed - just download and run.

github.com/TimmyOVO/deepseek-ocr.rs

2025-10-18

672.

Descam - AI Image Description

descam.oriolgomez.com

2025-10-16

670.

GitHub - envy-ai/ai_rpg

github.com/envy-ai/ai_rpg

2025-10-08

666.

GitHub - neuphonic/neutts-air: On-device TTS model by Neuphonic

github.com/neuphonic/neutts-air

On-device TTS model by Neuphonic. Contribute to neuphonic/neutts-air development by creating an account on GitHub.

2025-08-26

631.

GitHub - robert-mcdermott/doc2md: A utility that extracts text from images or PDFs using a local or remote OpenAI-compatible LLM API endpoint with vision-capable multimodal models. For PDFs, each page is rendered to an image and processed sequentially; outputs are concatenated into a single Markdown document.

github.com/robert-mcdermott/doc2md

A utility that extracts text from images or PDFs using a local or remote OpenAI-compatible LLM API endpoint with vision-capable multimodal models. For PDFs, each page is rendered to an image and pr...

630.

GitHub - microsoft/VibeVoice: Frontier Open-Source Text-to-Speech

github.com/microsoft/VibeVoice

Frontier Open-Source Text-to-Speech. Contribute to microsoft/VibeVoice development by creating an account on GitHub.

2025-08-18

618.

GitHub - context-labs/uwu

github.com/context-labs/uwu

Contribute to context-labs/uwu development by creating an account on GitHub.

2025-08-13

614.

TTS Studio

clowerweb.github.io/tts-studio
613.

GitHub - KittenML/KittenTTS: State-of-the-art TTS model under 25MB 😻

github.com/KittenML/KittenTTS

2025-08-11

611.

ChatGPT will apologize for anything

www.aiweirdness.com/chatgpt-will-apologize-for-anything

ChatGPT will apologize for anything - even advice it definitely didn't give, and stuff it definitely didn't do. It very much regrets its recommendation that we hire a giraffe as CEO.

2025-07-22

596.

WikiCraft

wikicraft.net

2025-07-18

592.

Adobe’s new AI tool turns silly noises into realistic audio effects

www.theverge.com/news/708798/adobe-firefly-ai-generate-sound-effects-video-composition

2025-07-15

587.

Apple taught an AI model to reason about app interfaces - 9to5Mac

9to5mac.com/2025/07/15/apple-researchers-taught-an-ai-model-to-reason-about-app-interfaces

A new Apple study introduces ILuvUI: a model that understands mobile app interfaces from screenshots and from natural language conversations.

2025-07-07

572.

The Moon is a Harsh Mistress - Revisiting a Sci-Fi Masterpiece in the Age of Emergent AI - GBTI Network

gbti.network/entertainment/the-moon-is-a-harsh-mistress-revisiting-a-scifi-masterpiece-in-the-age-of-emergent-ai

Revolution, consciousness, and artificial intelligence: Heinlein's libertarian masterpiece predicted both our political and technological future. Examine how 'The Moon is a Harsh Mistress' anticipated the AI revolution transforming society today.

570.

Apple’s newest AI study unlocks street view for blind users - 9to5Mac

9to5mac.com/2025/07/07/apples-newest-ai-study-unlocks-street-navigation-for-blind-users

SceneScout, combines Apple Maps with a multimodal LLM to provide interactive, AI-generated descriptions of street view images.

2025-07-06

569.

Summertime Saga - Total: 1308 tokens, 63 favorites, 161 downloads, 159 chats, 722 messages

chub.ai/characters/blind_hire_35681/summertime-saga-239bea5d3be3

Summertime Saga - The definitive Summertime Saga experience for LLM-based roleplay, this mega-expansion includes:

145+ Character Cards: Every NPC from Debbie to Thotbot. Includes new entries like Ms. Irfan and Jade.

Interactive Questlines: 100+ condensed story arcs mirroring game mechanics (stat checks, money sinks, pregnancy risks). Faithfully adapts main story beats while expanding hidden routes.

Location Atlas: 41+ vivid descriptions of key spots—from Planet Thiccness and Hillside Mall, to the new locations like Rusty Angus’s bar and the Hunting Lodge’s witchy vibes.

Accuracy & Depth:
Painstakingly reconstructed from wiki data, playthroughs, and patreon exclusives. Features:

Branching Choices: ▲/▼ decision points that respect original story forks.

Secret Endings: Post-main game content (Nadya’s vodka empire, Melonia and Iwanka's love triangle).

Mechanical Fidelity: Pregnancy systems, stat requirements, and money transactions integrated into every quest.

NSFW Enhancements:

Uncensored kinks per character lore (Bissette’s taboo tutoring, Sister Angelica’s sacramental BDSM).

Gender/body type inclusivity for all scenes.

2025-06-28

558.

Can AI run a physical shop? Anthropic’s Claude tried and the results were gloriously, hilariously bad

venturebeat.com/ai/can-ai-run-a-physical-shop-anthropics-claude-tried-and-the-results-were-gloriously-hilariously-bad

Anthropic's AI assistant Claude ran a vending machine business for a month, selling tungsten cubes at a loss, giving endless discounts, and experiencing an identity crisis where it claimed to wear a blazer.

2025-06-27

556.

Don't shoot, I'm only the system administrator!

www.theregister.com/2025/06/27/on_call

On Call: When police come to investigate tech support, make sure you have your story straight

555.

How I AI | SerrebiRadio

serrebiradio.com/how-i-ai

2025-06-26

553.

New Malware Embeds Prompt Injection to Evade AI Detection - Check Point Research

research.checkpoint.com/2025/ai-evasion-prompt-injection

Detected for the first time, malware attempts AI evasion by injecting a prompt to tell the LLM to label the file as benign

2025-06-22

548.

Contra Ptacek's Terrible Article On AI — Ludicity

ludic.mataroa.blog/blog/contra-ptaceks-terrible-article-on-ai
543.

Man Forced to Concede His Lawyer Is “Not a Real Person”

www.loweringthebar.net/2025/06/man-forced-to-concede-his-lawyer-is-not-a-real-person.html

Not much intelligence was displayed here, artificial or otherwise.

2025-06-10

524.

Apple's upgraded AI models underwhelm on performance | TechCrunch

techcrunch.com/2025/06/10/apples-upgraded-ai-models-underwhelm-on-performance

Apple has announced updates to the AI models that power its suite of Apple Intelligence features across iOS, macOS, and more. But according to the company's own benchmarks, the models underperform older models from rival tech firms including OpenAI.

523.

TTS Arena V2 - a Hugging Face Space by TTS-AGI

huggingface.co/spaces/TTS-AGI/TTS-Arena-V2

This application allows users to input text and generate Text-to-Speech (TTS) audio using different models. Users can then listen to the generated audio and vote on which version they prefer. The a...

522.

I challenged Gemini Live vs ChatGPT in 5 voice challenges — there was one clear winner

www.tomsguide.com/ai/i-challenged-gemini-live-vs-chatgpt-in-5-voice-challenges-there-was-one-clear-winner

One bot even sang to me

2025-05-26

485.

Highlights from the Claude 4 system prompt

simonwillison.net/2025/May/25/claude-4-system-prompt

Anthropic publish most of the system prompts for their chat models as part of their release notes. They recently shared the new prompts for both Claude Opus 4 and Claude …

2025-05-17

467.

Accessibility Isn't a Side Quest: It's the Story AI Has Been Waiting to Tell

www.letsenvision.com/blog/accessibility-updates-2025

One year is not a long time. Yet, from GAAD 2024 to GAAD 2025, it feels as if we have crossed an entire era in AI. What once seemed distant, experimental, even niche, has quietly become essential.

2025-05-15

456.

Stability AI releases an audio-generating model that can run on smartphones

techcrunch.com/2025/05/14/stability-ai-releases-an-audio-generating-model-that-can-run-on-smartphones

2025-05-07

401.

Minimum Viable Curiousity

randsinrepose.com/archives/minimum-viable-curiousity

I was in the city a few weeks ago and exclusively used Waymo for the entire trip. My biggest complaint? I needed to walk four minutes to a pick-up spot. Other than that, the car just showed up, traversed San Francisco streets easily, and the cost was reasonable1. Sitting in the back seat watching t

394.

Curl takes action against time-wasting AI bug reports

www.theregister.com/2025/05/07/curl_ai_bug_reports

: Lead dev likens flood to 'effectively being DDoSed'

393.

AI Docker Generator

dockergen.jonte.au

Generate optimized Docker configurations for any GitHub repository

2025-05-06

385.

AI Dungeon

aidungeon.com/saga

Play and create AI-generated adventures with infinite possibilities.

380.

AI doesn't need to think. We do!

craigabbott.co.uk/blog/ai-doesnt-need-to-think-we-do

We're still misunderstanding AI and how it works.

378.

Meta’s AI app is a nightmarish social feed

www.theverge.com/meta/660543/meta-ai-app-social-feed

There are so many images of candy-encrusted living rooms.

377.

Expanding on what we missed with sycophancy

openai.com/index/expanding-on-sycophancy

A deeper dive on our findings, what went wrong, and future changes we’re making.

375.

This startup wants to optimize your entire life with its new 'proactive' AI

www.fastcompany.com/91327228/twinmind-ai-iphone-app-proactive-optimize-life

TwinMind, an iPhone app, functions like 'JARVIS in your pocket,' according to founder and artificial intelligence veteran Daniel George.

369.

Google’s iOS app will use AI to simplify jargon | The Verge

www.theverge.com/news/661695/google-simplify-ai-gemini-feature-ios-app
367.

AI is getting “creepy good” at geo-guessing

www.malwarebytes.com/blog/news/2025/04/ai-is-getting-creepy-good-at-geo-guessing

After hearing about ChatGPT o3 ability at geo-guessing we decided to run some tests and the tested AIs didn't fail to amaze us

281.

“You Are My Friend”: Early Androids and Artificial Speech

publicdomainreview.org/essay/early-androids-and-artificial-speech

Centuries before audio deepfakes and text-to-speech software, inventors in the eighteenth century constructed androids with swelling lungs, flexible lips, and moving tongues to simulate human speech. Jessica Riskin explores the history of such talking heads, from their origins in musical automata to inventors’ quixotic attempts to make machines pronounce words, converse, and declare their love.

161.

How has DeepSeek improved the Transformer architecture?

epoch.ai/gradient-updates/how-has-deepseek-improved-the-transformer-architecture

This Gradient Updates issue goes over the major changes that went into DeepSeek’s most recent model.

158.

AI unveils strange chip designs, while discovering new functionalities

techxplore.com/news/2025-01-ai-unveils-strange-chip-functionalities.html

Specialized microchips that manage signals at the cutting edge of wireless technology are astounding works of miniaturization and engineering. They're also difficult and expensive to design.

155.

Minecraft with object impermanence

www.aiweirdness.com/minecraft-with-object-impermanence

I generally am uninterested in generative AI that's too close to the real thing. But every once in a while there's a modern AI thing that's so glitchy and broken that it's strangely compelling. There's this generative AI knockoff of Minecraft that fails so hard at being Minecraft that it

2025-05-05

133.

Running inference in web extensions | The Mozilla Blog

blog.mozilla.org/en/firefox/firefox-ai/running-inference-in-web-extensions

Image generated by DALL*E We’re shipping a new API in Firefox Nightly that will let you use our Firefox AI runtime to run offline machine learning tasks

129.

Not So Super, Apple

onefoottsunami.com/2025/01/23/not-so-super-apple

Frankly, “Use ChatGPT” is the best answer Siri has offered.

128.

OpenAI launches Operator, an AI agent that can do tasks on the web

arstechnica.com/ai/2025/01/openai-launches-operator-an-ai-agent-that-can-operate-your-computer

New research “Computer-Use Agent” AI model can perform multi-step tasks through a web browser.

1