<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"><channel><title>LLMOps Notes</title><description>Practical notes on LLMOps, AI agents, MCP, Langfuse, Cloudflare, and related experiments.</description><link>https://llm-lab.dev/</link><item><title>When You Build a Minimal API Loop, You Stop Designing Prompts and Start Designing Stop Conditions</title><link>https://llm-lab.dev/en/posts/llm-loop-engineering-minimal-api/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/llm-loop-engineering-minimal-api/</guid><description>I implemented a minimal generate-evaluate-feedback-regenerate loop in a verification script. This post organizes the stop conditions and evaluation units that actually matter when stabilizing AI output.</description><pubDate>Sun, 28 Jun 2026 00:00:00 GMT</pubDate></item><item><title>When Using OpenAI-Compatible APIs in Flue, Check the Model Specifier First</title><link>https://llm-lab.dev/en/posts/flue-openai-compatible-provider-note/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/flue-openai-compatible-provider-note/</guid><description>A note on getting stuck with &apos;Unknown model specifier&apos; in Flue 1.0 Beta by mixing up the actual model ID and the provider-id/model-id format.</description><pubDate>Sat, 27 Jun 2026 00:00:00 GMT</pubDate></item><item><title>Before Growing Hermes Agent: Creating Synthetic Support Triage Scenarios</title><link>https://llm-lab.dev/en/posts/hermes-agent-002-support-triage-scenarios/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/hermes-agent-002-support-triage-scenarios/</guid><description>As a preparatory step before delegating support triage to Hermes Agent, I built three evaluation scenarios using synthetic data—fixing decision criteria and safety constraints in advance without relying on real customer data.</description><pubDate>Thu, 25 Jun 2026 00:00:00 GMT</pubDate></item><item><title>Observing the Sakana Fugu API with Langfuse: Understanding Hidden Costs in Multi-Agent Systems</title><link>https://llm-lab.dev/en/posts/sakana-fugu-langfuse-experiment/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/sakana-fugu-langfuse-experiment/</guid><description>A hands-on report instrumenting Sakana Fugu&apos;s OpenAI-compatible API with Langfuse, measuring how latency, token consumption, and TTFT change across Level 1–3 tasks.</description><pubDate>Wed, 24 Jun 2026 00:00:00 GMT</pubDate></item><item><title>Eve&apos;s TUI vs HTTP Event Streams: A Side-by-Side Look at Tool Calling</title><link>https://llm-lab.dev/en/posts/vercel-eve-http-stream-events/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/vercel-eve-http-stream-events/</guid><description>A hands-on log comparing how the same weather tool call looks in Vercel&apos;s Eve agent framework when observed from the TUI versus the HTTP API, separating the developer-friendly display from the integration-friendly event stream.</description><pubDate>Tue, 23 Jun 2026 00:00:00 GMT</pubDate></item><item><title>Observing the Black Box of a Multi-Agent API with Sakana Fugu and Langfuse</title><link>https://llm-lab.dev/en/posts/sakana-fugu-langfuse-observability-plan/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/sakana-fugu-langfuse-observability-plan/</guid><description>I subscribed to Sakana Fugu to understand its nature as an OpenAI-compatible API and to plan how to observe its black-box cooperative reasoning from the outside.</description><pubDate>Tue, 23 Jun 2026 00:00:00 GMT</pubDate></item><item><title>Streaming Flue Observe Events to Langfuse: Monitoring an Issue Triage Agent</title><link>https://llm-lab.dev/en/posts/flue-langfuse-observability-issue-triage/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/flue-langfuse-observability-issue-triage/</guid><description>An experiment log where I redact Flue 1.0 Beta observe events before sending them to Langfuse, tracking the issue triage workflow&apos;s runId, model, and results.</description><pubDate>Mon, 22 Jun 2026 00:00:00 GMT</pubDate></item><item><title>Observing Eve TUI Execution and Tool Calls with Langfuse</title><link>https://llm-lab.dev/en/posts/vercel-eve-langfuse-observability/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/vercel-eve-langfuse-observability/</guid><description>A hands-on comparison of two ways to send Eve tool-calling executions to Langfuse as trace/span/generation data.</description><pubDate>Sun, 21 Jun 2026 00:00:00 GMT</pubDate></item><item><title>Building a Tool-Calling Agent with Vercel&apos;s Eve and Running It from the TUI</title><link>https://llm-lab.dev/en/posts/vercel-eve-deep-dive/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/vercel-eve-deep-dive/</guid><description>A follow-up to my first look: adding tools and evals to Eve, configuring models via the Vercel AI Gateway, invoking tools from the TUI, and exploring the info and eval commands.</description><pubDate>Sat, 20 Jun 2026 00:00:00 GMT</pubDate></item><item><title>Moving Issue Triage into CI: Running a Flue Workflow from GitHub Actions</title><link>https://llm-lab.dev/en/posts/flue-github-actions-issue-triage-workflow/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/flue-github-actions-issue-triage-workflow/</guid><description>A verification log of dry-running a GitHub Issue triage workflow built with Flue 1.0 Beta from GitHub Actions&apos; issues.opened, instead of a persistent webhook server.</description><pubDate>Sat, 20 Jun 2026 00:00:00 GMT</pubDate></item><item><title>Building a GitHub Issue Triage Agent with Flue 1.0 Beta</title><link>https://llm-lab.dev/en/posts/flue-1-0-beta-issue-triage-agent/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/flue-1-0-beta-issue-triage-agent/</guid><description>An experimental log of building a triage agent with Flue 1.0 Beta&apos;s Agent, Skill, and Workflow features that returns structured severity, reproducibility, and label suggestions for GitHub issues.</description><pubDate>Fri, 19 Jun 2026 00:00:00 GMT</pubDate></item><item><title>A Quick Look at Eve, Vercel&apos;s New Agent Framework</title><link>https://llm-lab.dev/en/posts/vercel-eve-first-look/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/vercel-eve-first-look/</guid><description>A quick validation log of running Vercel&apos;s open-source agent framework Eve locally through init, dev startup, and the first session.</description><pubDate>Fri, 19 Jun 2026 00:00:00 GMT</pubDate></item><item><title>How to Use GLM-5.2 on Cloudflare Workers AI: Model ID, Pricing, and TypeScript Setup</title><link>https://llm-lab.dev/en/posts/cloudflare-worker-ai-glm-5-2/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/cloudflare-worker-ai-glm-5-2/</guid><description>A technical note covering the model ID, pricing, context length, Wrangler configuration, and TypeScript implementation for calling GLM-5.2 on Cloudflare Workers AI.</description><pubDate>Thu, 18 Jun 2026 00:00:00 GMT</pubDate></item><item><title>Before Using Flue: Figuring Out What This Framework Actually Is</title><link>https://llm-lab.dev/en/posts/flue-framework-overview/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/flue-framework-overview/</guid><description>A rough summary of how Flue thinks about harnesses, agents, workflows, skills, tools, sandboxes, and persistence — before actually running anything.</description><pubDate>Thu, 18 Jun 2026 00:00:00 GMT</pubDate></item><item><title>What Is Flue 1.0 Beta? New Features and Quickstart Caveats from Local Testing</title><link>https://llm-lab.dev/en/posts/flue-1-0-beta-local-check/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/flue-1-0-beta-local-check/</guid><description>Hands-on notes on running the Astro team&apos;s agent framework Flue 1.0 Beta locally, covering init, build, and run behavior plus the rough edges I hit.</description><pubDate>Thu, 18 Jun 2026 00:00:00 GMT</pubDate></item><item><title>Pre-Implementation Notes on Vercel&apos;s Agent Framework, eve</title><link>https://llm-lab.dev/en/posts/eve-vercel-agent-framework-survey/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/eve-vercel-agent-framework-survey/</guid><description>A summary of my pre-implementation research into Vercel&apos;s eve agent framework: directory-based design, the difference between tools and skills, sandboxing, durable execution, and more.</description><pubDate>Thu, 18 Jun 2026 00:00:00 GMT</pubDate></item><item><title>Moving Cloudflare Pages Deploys to npm Scripts</title><link>https://llm-lab.dev/en/posts/cloudflare-pages-deploy-script-note/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/cloudflare-pages-deploy-script-note/</guid><description>A short operations note on adding deploy scripts to package.json so I don&apos;t have to remember wrangler commands every time I redeploy to Cloudflare Pages.</description><pubDate>Wed, 17 Jun 2026 00:00:00 GMT</pubDate></item><item><title>Training Hermes Agent as a Business Decision Partner</title><link>https://llm-lab.dev/en/posts/hermes-agent-001-support-triage-agent-start/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/hermes-agent-001-support-triage-agent-start/</guid><description>An experiment in growing Hermes Agent into a business-ready support agent for decision-making.</description><pubDate>Sun, 14 Jun 2026 00:00:00 GMT</pubDate></item><item><title>Natural Language Data Analysis with ClickHouse and Claude MCP — Is the Era of Writing SQL Coming to an End?</title><link>https://llm-lab.dev/en/posts/clickhouse-001-claude-mcp/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/clickhouse-001-claude-mcp/</guid><description>Building an environment where Claude Desktop can operate ClickHouse through natural language using the ClickHouse MCP.</description><pubDate>Fri, 12 Jun 2026 00:00:00 GMT</pubDate></item><item><title>Why Cloudflare KV and a Small Admin Panel Fit a Parking Lot Site</title><link>https://llm-lab.dev/en/posts/local-site-admin-kv-note/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/local-site-admin-kv-note/</guid><description>A short design note on handling frequently changing public values—like parking availability and phone numbers—using Cloudflare KV and a simple admin panel instead of environment variables.</description><pubDate>Thu, 11 Jun 2026 00:00:00 GMT</pubDate></item><item><title>Building a Langfuse Morning Briefing for Slack</title><link>https://llm-lab.dev/en/posts/langfuse-morning-briefing-trial/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/langfuse-morning-briefing-trial/</guid><description>A practical experiment that aggregates Langfuse traces in a Cloudflare Worker and sends only token, cost, and latency anomalies to Slack.</description><pubDate>Mon, 08 Jun 2026 00:00:00 GMT</pubDate></item><item><title>I Want to Build a Langfuse Morning Briefing That Pushes Only Anomalies to Slack</title><link>https://llm-lab.dev/en/posts/langfuse-morning-briefing/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/langfuse-morning-briefing/</guid><description>Manually checking traces every day is unsustainable. A personal GenAIOps plan to aggregate Langfuse failures, token spikes, and low-score outputs into a single Slack morning briefing.</description><pubDate>Sat, 06 Jun 2026 00:00:00 GMT</pubDate></item><item><title>Why I Stopped Polishing Prompts and Started Using Feedback Loops</title><link>https://llm-lab.dev/en/posts/llm-loop-engineering-first-step/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/llm-loop-engineering-first-step/</guid><description>I explain why output quality stays unstable even with careful prompt design, and how I switched to a generate-evaluate-feedback-regenerate loop. Includes the smallest manual steps to start today.</description><pubDate>Wed, 03 Jun 2026 00:00:00 GMT</pubDate></item><item><title>How Much of Your LLM Traffic Can Cloudflare AI Gateway Actually Log?</title><link>https://llm-lab.dev/en/posts/cloudflare-ai-gateway-llm-request-response-logging/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/cloudflare-ai-gateway-llm-request-response-logging/</guid><description>Using Cloudflare AI Gateway as an OpenAI-compatible endpoint, I walk through its logging, payload controls, metadata, cost estimation, and OTel integration to see how far it can serve as an entry point for LLM observability.</description><pubDate>Sat, 16 May 2026 00:00:00 GMT</pubDate></item><item><title>What to Know First When Introducing Cloudflare AI Gateway</title><link>https://llm-lab.dev/en/posts/cloudflare-ai-gateway-introduction/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/cloudflare-ai-gateway-introduction/</guid><description>The minimum steps to start using Cloudflare AI Gateway from Workers AI, and what to understand about its role, settings, and checkpoints before moving to the OpenAI-compatible API.</description><pubDate>Fri, 15 May 2026 00:00:00 GMT</pubDate></item><item><title>What Engineers Should Design After AI Makes Coding Faster</title><link>https://llm-lab.dev/en/posts/aidd-after-code-generation-design/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/aidd-after-code-generation-design/</guid><description>As tools like Claude Code and Codex accelerate code generation, the scope of what engineers must design expands from the code itself to problems, constraints, operations, and validation.</description><pubDate>Sun, 03 May 2026 00:00:00 GMT</pubDate></item><item><title>From Work Logs to Blog Notes: Extract One Decision at a Time</title><link>https://llm-lab.dev/en/posts/work-log-to-blog-note/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/work-log-to-blog-note/</guid><description>How I turn stuck points and decisions from Codex work logs into short blog notes.</description><pubDate>Fri, 01 May 2026 00:00:00 GMT</pubDate></item><item><title>Stop Repeating Yourself in AIDD: Turn Repeat Work into Standard Entry Points</title><link>https://llm-lab.dev/en/posts/aidd-standardization-repeat-work/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/aidd-standardization-repeat-work/</guid><description>How to move beyond personal AI prompt tricks by turning them into standard commands, templates, and review checklists that the whole team can reuse.</description><pubDate>Mon, 27 Apr 2026 00:00:00 GMT</pubDate></item><item><title>OpenUI: A Framework for Rapid Generative UI Development</title><link>https://llm-lab.dev/en/posts/generative-ui-fast-development-openui/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/generative-ui-fast-development-openui/</guid><description>Implementing Generative UI from scratch means dealing with messy component management and streaming control. OpenUI is a framework that neatly abstracts and hides all of that.</description><pubDate>Tue, 21 Apr 2026 00:00:00 GMT</pubDate></item><item><title>Taming Thumbnail White Space in an Astro Blog</title><link>https://llm-lab.dev/en/posts/astro-thumbnail-contain-note/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/astro-thumbnail-contain-note/</guid><description>A quick UI note on switching article card thumbnails from `object-cover` to `object-contain` to prevent cropping, and why I rolled the background back to near-white with a light border.</description><pubDate>Fri, 17 Apr 2026 00:00:00 GMT</pubDate></item><item><title>Publishing a Next.js Static Site on Cloudflare Pages for Free</title><link>https://llm-lab.dev/en/posts/cloudflare-pages-static-site-deploy/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/cloudflare-pages-static-site-deploy/</guid><description>An experiment log verifying deployment of a Next.js static site to Cloudflare Pages, from local verification and build to production deployment and custom domain setup.</description><pubDate>Sat, 11 Apr 2026 00:00:00 GMT</pubDate></item><item><title>Tips for Reliably Extracting Structured JSON from LLMs</title><link>https://llm-lab.dev/en/posts/generative-ui-correct-json-output/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/generative-ui-correct-json-output/</guid><description>I summarize the approaches and tips for extracting structured data from llms as reliably, accurately, and with as low latency as possible.</description><pubDate>Sun, 18 Jan 2026 00:00:00 GMT</pubDate></item><item><title>AgentOps Sounds New, but the Problems Are Familiar</title><link>https://llm-lab.dev/en/posts/agentops-old-automation-note/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/agentops-old-automation-note/</guid><description>How old automation failures — batch jobs, notification bots, admin UIs, outdated runbooks — raise the same questions for AI agent operations.</description><pubDate>Thu, 23 Oct 2025 00:00:00 GMT</pubDate></item><item><title>Renaming the Blog: Tsurezure Agent OPS</title><link>https://llm-lab.dev/en/posts/rename-blog-tsurezure-agentops/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/rename-blog-tsurezure-agentops/</guid><description>Why I moved from Field Ops Notes to Tsurezure Agent OPS — a space for operations, automation, and AI agent topics, rooted in small daily frictions.</description><pubDate>Mon, 18 Aug 2025 00:00:00 GMT</pubDate></item><item><title>What to Check When Asked &apos;Can AI Do This?&apos;</title><link>https://llm-lab.dev/en/posts/can-ai-do-this-check-note/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/can-ai-do-this-check-note/</guid><description>A note on evaluating whether a task can be handled by AI based on input variability, failure impact, and human review cost rather than model performance alone.</description><pubDate>Thu, 12 Jun 2025 00:00:00 GMT</pubDate></item><item><title>Why Success Logs Alone Aren&apos;t Enough for Operations</title><link>https://llm-lab.dev/en/posts/success-log-not-enough-note/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/success-log-not-enough-note/</guid><description>A short note on log design: clean success logs alone don&apos;t help you diagnose failures or improve recovery.</description><pubDate>Tue, 04 Mar 2025 00:00:00 GMT</pubDate></item><item><title>Write Down Manual Decision Criteria Before Delegating to an AI Agent</title><link>https://llm-lab.dev/en/posts/manual-decision-before-agent-note/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/manual-decision-before-agent-note/</guid><description>A note on why you need to audit what operators actually look at before you start agentifying a workflow.</description><pubDate>Thu, 16 Jan 2025 00:00:00 GMT</pubDate></item><item><title>Small Admin Panels Are Where Audit Logs Get Left Behind</title><link>https://llm-lab.dev/en/posts/admin-audit-log-later-note/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/admin-audit-log-later-note/</guid><description>Notes on how losing track of who changed what and when in a small internal admin panel makes later investigations surprisingly painful.</description><pubDate>Thu, 07 Nov 2024 00:00:00 GMT</pubDate></item><item><title>What to Decide Before Letting AI Summarize Inquiries</title><link>https://llm-lab.dev/en/posts/ai-inquiry-summary-before-note/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/ai-inquiry-summary-before-note/</guid><description>A note on why you should define who reviews AI summaries and how to trace back to the original message before prioritizing convenience when summarizing inquiry emails or Slack threads with AI.</description><pubDate>Wed, 18 Sep 2024 00:00:00 GMT</pubDate></item><item><title>GitLab Duo Goes Free for All Users: Overview and Impact on Development Workflows</title><link>https://llm-lab.dev/en/posts/gitlab-duo-release/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/gitlab-duo-release/</guid><description>GitLab announced something developers cannot afford to miss. The AI assistance feature &apos;GitLab Duo&apos;, previously limited to paid plans (Ultimate and Premium) or requiring an additional add-on license, will now be provided by default to all GitLab users, including those on the free plan.</description><pubDate>Thu, 22 Aug 2024 00:00:00 GMT</pubDate></item><item><title>Documentation Isn&apos;t Ignored — It&apos;s Left Unupdated</title><link>https://llm-lab.dev/en/posts/docs-are-not-updated-note/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/docs-are-not-updated-note/</guid><description>A note on why procedures lose trust on the ground: not because they go unread, but because operational gaps never make it back into the docs.</description><pubDate>Tue, 02 Jul 2024 00:00:00 GMT</pubDate></item><item><title>Building a Notification Bot Turned Me Into Its Support Desk</title><link>https://llm-lab.dev/en/posts/notification-bot-owner-note/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/notification-bot-owner-note/</guid><description>A note on how small notification bots blur the lines around accuracy and responsibility as they become more useful.</description><pubDate>Tue, 14 May 2024 00:00:00 GMT</pubDate></item><item><title>Why CSV Import Edge Cases Escape Your Runbook</title><link>https://llm-lab.dev/en/posts/csv-import-exception-note/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/csv-import-exception-note/</guid><description>A note on how small divergences in CSV imports—column shifts, encoding issues, end-of-month exceptions—gradually drift outside the runbook.</description><pubDate>Thu, 21 Mar 2024 00:00:00 GMT</pubDate></item><item><title>Remotion Fundamentals: A Field Note on Using React Components for Video</title><link>https://llm-lab.dev/en/posts/react-remotioni-movie-test/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/react-remotioni-movie-test/</guid><description>A write-up on how the powerful React and TypeScript ecosystem can be applied directly to video production through Remotion, and where this approach shines.</description><pubDate>Wed, 14 Feb 2024 00:00:00 GMT</pubDate></item><item><title>Rethinking Nightly Batch Failure Alerts as an Operations Entry Point</title><link>https://llm-lab.dev/en/posts/nightly-batch-alert-agentops-note/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/nightly-batch-alert-agentops-note/</guid><description>A short operations note on treating nightly batch failure alerts as more than simple warnings—breaking them down into detection, diagnosis, retry decisions, and human handoff.</description><pubDate>Thu, 08 Feb 2024 00:00:00 GMT</pubDate></item><item><title>Starting This Blog</title><link>https://llm-lab.dev/en/posts/start-blog-self-introduction/</link><guid isPermaLink="true">https://llm-lab.dev/en/posts/start-blog-self-introduction/</guid><description>The first post from an engineer who maintains business systems and builds small automations: documenting what gets stuck in operations and keeping technical notes.</description><pubDate>Fri, 12 Jan 2024 00:00:00 GMT</pubDate></item></channel></rss>