Skip to main content
Articles

Articles

2025

Using sing-box Tun Mode to Implement a Transparent Proxy for V2rayU

Using sing-box Tun Mode to Implement a Transparent Proxy for V2rayU

This post documents the troubleshooting process for a gemini-cli OAuth login failure on macOS. Since V2rayU lacks a native Tun mode, it cannot proxy the gemini-cli’s random port callback. This article introduces a clever solution: using sing-box to enable Tun mode for transparent proxying, intercepting all system traffic, and forwarding it back to V2rayU’s SOCKS port, perfectly solving the proxy challenge for CLI tools.
A Brief Analysis of Claude Code's Execution and Prompts

A Brief Analysis of Claude Code's Execution and Prompts

Through reverse engineering, this article provides a deep dive into the internal architecture and working principles of Anthropic’s AI coding assistant, Claude Code. It breaks down the collaborative mechanisms of its Main and Sub-Agents, system prompts, toolset definitions, and context management strategies, helping you to fully understand the autonomous execution flow of this powerful AI tool.

2024

From paper to source code: a detailed explanation of the RAG algorithm

From paper to source code: a detailed explanation of the RAG algorithm

·9744 words·46 mins
This article aims to explore the architectural design and specific code implementation of the RAG algorithm through the interpretation of papers and source code. This article mainly discusses GraphRAG, LightRAG and RAPTOR RAG, and also mentions Contextual Retrieval proposed by Anthropic and the evaluation method of the RAG algorithm. In the end, it is recommended that different methods be selected according to the size of the knowledge base document.
Rerank Models

Rerank Models

·2454 words·12 mins
With the popularity of the Transformer architecture, many Embedding and Rerank models are now based on this architecture. Taking this opportunity, we will sort out the process and history of the research, and take stock of the architectures adopted by several well-known Rerank models and the companies that developed them. Finally, we will return to the topic and briefly discuss whether Rerank should be used in RAG scenarios.
HTTP/2 and CONTINUATION Flood

HTTP/2 and CONTINUATION Flood

·2348 words·12 mins
This article mainly introduces the HTTP/2 protocol and its CONTINUATION Flood problem. The article shows how to parse the Frame structure in Http2-related code through the golang.org/x/net source code, and analyzes in detail the three security risks of the CONTINUATION Flood attack and the corresponding solutions.
Mixed Expert (MoE) Model Notes

Mixed Expert (MoE) Model Notes

·1388 words·7 mins
This article mainly sorts out the relevant concepts of the hybrid expert model (MoE), and introduces the architectures and optimization methods of several open source MoE models, such as GShard, Switch Transformers, DeepSeek-MoE, and LLaMA-MoE. The characteristics and optimization methods of these models are also introduced.

2023

Vector similarity search methods

Vector similarity search methods

·3244 words·7 mins
This paper provides a detailed introduction to various vector similarity search methods, such as KD trees, IVF inverted indexes, HNSW and LSH. It provides a detailed introduction from data structures to algorithm implementations by analyzing the specific implementations in the source codes of Annoy, Faiss, PGVector and FALCONN.
Java & Go thread model comparison

Java & Go thread model comparison

This paper compares in detail the threading models and scheduling mechanisms of the Java and Go programming languages. It analyzes their specific implementations and design ideas from a source code perspective, especially the 1:1 correspondence between Java’s Thread and operating system threads, and the n:m relationship between Go’s goroutine managed through the GPM model.
Hugo + umami Blog Statistics Panel

Hugo + umami Blog Statistics Panel

·1228 words·6 mins
This article describes in detail the specific configuration steps for multiple Umami deployment solutions, and also briefly explains the advantages, disadvantages and applicable scenarios of each solution. The article focuses on the method of configuring Umami under the Hugo framework, including the specific steps for adding Umami tracking code to different themes, as well as some advanced configuration options such as the TrackEvent and Tracker parameters.
Exploring the built-in Spring-Boot server

Exploring the built-in Spring-Boot server

This article explores in detail the principles and use of the web servers built into Spring Boot (including Tomcat, Jetty, Undertow, and Netty), with a particular focus on the differences between the Servlet and Reactive frameworks and their implementations in Spring Framework 5.0, including the WebServer interface and the concrete implementation of WebServerFactory.

2022

A brief introduction to blog building

A brief introduction to blog building

·764 words·4 mins
This article details the process of setting up a blog using Hugo and a Docker image, including the selection of different image versions, the use of the Blowfish theme, custom icons, the integration of comment plugins, the deployment of GitHub Pages, and the integration of the Umami statistics panel.