Weaxs in Nokstella

GEMINI-CLI Settings Parameter Details

2025-12-09·2236 words·11 mins

Gemini CLI Configuration AI Agent LLM A2A-Server DevTools

In-depth analysis of Gemini CLI core configuration parameters, covering key modules such as Checkpointing, Model Aliases, Context, and Tools. This article details the differences in parameter support between CLI mode and A2A-Server mode, helping developers accurately configure and optimize AI Agent workflows.

Using sing-box Tun Mode to Implement a Transparent Proxy for V2rayU

2025-11-09·1866 words·9 mins

MacOS Sing-Box V2rayU TUN Transparent Proxy Gemini CLI Troubleshooting

This post documents the troubleshooting process for a gemini-cli OAuth login failure on macOS. Since V2rayU lacks a native Tun mode, it cannot proxy the gemini-cli’s random port callback. This article introduces a clever solution: using sing-box to enable Tun mode for transparent proxying, intercepting all system traffic, and forwarding it back to V2rayU’s SOCKS port, perfectly solving the proxy challenge for CLI tools.

A Brief Analysis of Claude Code's Execution and Prompts

2025-10-12·12464 words·59 mins

Claude Code Agent Prompt Engineering Reverse Engineering LLM Tool Use Anthropic Agentic

Through reverse engineering, this article provides a deep dive into the internal architecture and working principles of Anthropic’s AI coding assistant, Claude Code. It breaks down the collaborative mechanisms of its Main and Sub-Agents, system prompts, toolset definitions, and context management strategies, helping you to fully understand the autonomous execution flow of this powerful AI tool.

How do Multimodal Models Process and Understand Images?

2025-06-15·4327 words·21 mins

AI Multimodal Machine Learning ViT CLIP Visual Encoding

From Vision Transformers to image-text alignment, exploring the core technical principles and implementation methods behind multimodal models, including CLIP, SigLIP, and visual encoding strategies of mainstream multimodal large models.

A Survey of Open Source DeepResearch Implementation Solutions

2025-03-06·4631 words·22 mins

DeepResearch DeepSearch Agent LLM Dify LangChain HuggingFace Zilliz Intelligent Agents Large Language Model Applications

Analyzing open source DeepResearch implementations based on source code, including the engineering architecture, Agent design, prompts, and core processes of solutions such as Dify, LangChain, HuggingFace, and Zilliz Cloud.

A Brief Look at Chain of Thought and Reinforcement Learning in DeepSeek-R1 and Kimi k1.5 Papers

2025-02-01·1292 words·7 mins

AI LLM CoT Reinforcement Learning DeepSeek Kimi Model Distillation Chain of Thought

A brief overview of the technical features in reasoning capabilities of DeepSeek-R1 and Kimi k1.5: DeepSeek employs GRPO algorithm and model distillation to enhance reasoning performance, while Kimi explores the integration of long-form Chain of Thought with reinforcement learning.

Building a LightRAG Knowledge Base with TiDB Vector

2024-12-22·1446 words·7 mins

RAG LLM AI TiDB Engineering Practice

After reviewing LightRAG, I found that its persistence support was still limited, missing the most important TiDB (not really). So I took some time to contribute and write about it.

From paper to source code: a detailed explanation of the RAG algorithm

2024-11-30·9744 words·46 mins

RAG LLM AI

This article aims to explore the architectural design and specific code implementation of the RAG algorithm through the interpretation of papers and source code. This article mainly discusses GraphRAG, LightRAG and RAPTOR RAG, and also mentions Contextual Retrieval proposed by Anthropic and the evaluation method of the RAG algorithm. In the end, it is recommended that different methods be selected according to the size of the knowledge base document.

Rerank Models

2024-10-20·2454 words·12 mins

Search AI RAG

With the popularity of the Transformer architecture, many Embedding and Rerank models are now based on this architecture. Taking this opportunity, we will sort out the process and history of the research, and take stock of the architectures adopted by several well-known Rerank models and the companies that developed them. Finally, we will return to the topic and briefly discuss whether Rerank should be used in RAG scenarios.

Weaxs

Recent