AI Library

AI Library

Books for Reading AI

Choose a book, then read it in order from the table of contents.

Leaving It to AI and Stepping Away cover

27 posts

Leaving It to AI and Stepping Away

Kim Kyung-jin

A Complete Beginner's Guide to YOLO Mode. Table of contents and 26 chapters

A beginner-friendly online book on YOLO mode in Claude Code and Codex. It explains how to let AI read files, write code, run commands, and finish work while keeping rollback, Docker sandboxing, and safety checks close at hand.

Share

Facebook X LinkedIn Threads
Artificial Intelligence Fighter, Artificial Intelligence Air Force book cover

43 posts available

Artificial Intelligence Fighter, Artificial Intelligence Air Force

Kim Kyung-jin

Table of Contents, Preface, 40 Chapters, Epilogue

Artificial Intelligence Fighter, Artificial Intelligence Air Force is an online AI Library book by Kim Kyung-jin. It covers AI fighters, autonomous air power, unmanned combat aircraft, CCA, MUM-T, sixth-generation fighters and is organized as Table of Contents, Preface, 40 Chapters, Epilogue.

Share

Artificial Intelligence on Trial book cover

26 posts available

Artificial Intelligence on Trial

Attorney Kyungjin Kim

Table of Contents, Preface, 21 Chapters, 3 Appendices

Artificial Intelligence on Trial is an online AI Library book by Attorney Kyungjin Kim. It covers artificial intelligence and law, AI liability, algorithmic judgment, courts and technology and is organized as Table of Contents, Preface, 21 Chapters, 3 Appendices.

Share

PALANTIR book cover

16 posts available

PALANTIR: War, Surveillance, Artificial Intelligence

Attorney Kyungjin Kim

Table of Contents, Preface, 14 Chapters

PALANTIR: War, Surveillance, Artificial Intelligence is an online AI Library book by Attorney Kyungjin Kim. It covers Palantir, war, surveillance, artificial intelligence, data analytics, national security and is organized as Table of Contents, Preface, 14 Chapters.

Share

Brain Readers: Neuralink and the Final Human Revolution book cover

21 posts available

Brain Readers: Neuralink and the Final Human Revolution

Kim Kyung-jin

Table of Contents, Prologue, 18 Chapters, Epilogue

Brain Readers: Neuralink and the Final Human Revolution is an online AI Library book by Kim Kyung-jin. It follows Neuralink, brain-computer interfaces, brain data, medicine, neurorights, and the future of human enhancement.

Facebook X LinkedIn Threads Telegram
Artificial Intelligence and the Reshaping of Society book cover

16 posts available

Artificial Intelligence and the Reshaping of Society

Kim Kyung-jin

Table of Contents, Preface, 13 Chapters, Epilogue

Artificial Intelligence and the Reshaping of Society is an online AI Library book by Kim Kyung-jin. It follows how artificial intelligence changes work, education, inequality, cities, democracy, and human relationships.

Facebook X LinkedIn Threads Telegram
Ten Questions AI Poses to Humanity book cover

12 posts available

Ten Questions AI Poses to Humanity

Kim Kyung-jin

Table of Contents, Preface, 10 Chapters

Ten Questions AI Poses to Humanity is an online AI Library book by Kim Kyung-jin. It asks how artificial intelligence changes truth, weapons, work, data, identity, and human control.

Facebook X LinkedIn Threads Telegram
Malaysia and the Malacca Strait book cover

23 posts available

Malaysia and the Malacca Strait: Whoever Controls It Controls the World

Kim Kyung-jin

Table of Contents, Preface, 20 Chapters, Epilogue

Malaysia and the Malacca Strait is an online AI Library book by Kim Kyung-jin. It covers Malaysia, the Malacca Strait, maritime logistics, geopolitics, global trade, and Southeast Asia’s strategic future.

Facebook X LinkedIn Threads Telegram
Georgia history and culture travel book cover

24 posts available

A Journey Through Georgia's History and Culture

Kim Kyung-jin

Table of Contents, Preface, 17 Chapters, 4 Appendices, Epilogue

A Journey Through Georgia’s History and Culture is an online AI Library book by Kim Kyung-jin. It covers Georgia’s history, culture, religion, politics, travel, and the Caucasus crossroads between Europe and Asia.

Facebook X LinkedIn Threads Telegram
Reading Armenia book cover

13 posts available

Reading Armenia: A Thousand Prayers, One Mountain

Kim Kyung-jin

Table of Contents, Preface, 10 Chapters, Epilogue

Reading Armenia: A Thousand Prayers, One Mountain is an online AI Library book by Kim Kyung-jin. It covers Armenian history, faith, Mount Ararat, cultural memory, travel, and the endurance of a small nation.

Facebook X LinkedIn Threads Telegram
Mastering Claude Code book cover

41 posts available

Mastering Claude Code

Kim Kyung-jin

Table of Contents, Preface, Chapters, Appendices

Mastering Claude Code is an online AI Library book by Kim Kyung-jin. It covers Claude Code setup, commands, workflows, automation, agents, and practical methods for using Claude Code in real work.

Facebook X LinkedIn Threads Telegram
Claude Cowork and Agent manual book cover

11 posts available

Claude Cowork and Agent Utilization Manual

Kim Kyung-jin

Table of Contents, Preface, 8 Chapters, Closing Note

Claude Cowork and Agent Utilization Manual is an online AI Library book by Kim Kyung-jin. It covers Claude Code, AI agents, coding automation, work automation, and practical agent-based collaboration.

Facebook X LinkedIn Threads Telegram
2026 U.S.-Iran War and the Global Energy Crisis book cover

39 posts available

The 2026 U.S.-Iran War and the Global Energy Crisis

Kim Kyung-jin

Table of Contents, Preface, Chapters and Appendices

The 2026 U.S.-Iran War and the Global Energy Crisis is an online AI Library book by Kim Kyung-jin. It covers war, oil, the Strait of Hormuz, maritime security, energy markets, and the global consequences of conflict.

Facebook X LinkedIn Threads Telegram
The Traces Han Dong-hoon Left on South Korea book cover

13 posts available

The Traces Han Dong-hoon Left on South Korea

Kim Kyung-jin

Table of Contents, Prologue, Chapters, Epilogue

The Traces Han Dong-hoon Left on South Korea is an online AI Library book by Kim Kyung-jin. It examines his record in justice policy, immigration reform, public institutions, and the structural questions facing South Korea.

Facebook X LinkedIn Threads Telegram
The Han Dong-hoon Story book cover

39 posts available

The Han Dong-hoon Story

Kim Kyung-jin

Table of Contents, Prologue, Chapters, Epilogue

The Han Dong-hoon Story is an online AI Library book by Kim Kyung-jin. It traces Han Dong-hoon’s life, public career, political choices, and the changing landscape of South Korean conservative politics.

Facebook X LinkedIn Threads Telegram
Beyond the Glass Ceiling cover

39 entries

Beyond the Glass Ceiling

Kim Kyung-jin

Table of contents, prologue, 31 chapters, epilogue, 5 appendices

A political biography tracing Sanae Takaichi's rise from Nara to Japan's premiership, through party struggles, security policy, diplomacy, and the meaning of Japan's first female prime minister.

Sam Altman Biography: Pioneer of the AI Revolution cover

22 posts

Sam Altman Biography: Pioneer of the AI Revolution

Kim Kyung-jin, Kim Kyung-ran

Table of contents, preface, 7 parts, 20 chapters

An online biography following Sam Altman's childhood, startups, Y Combinator, OpenAI, ChatGPT, the 2023 board crisis, and his sense of responsibility in the AI era.

Facebook X LinkedIn Threads Telegram
From Chaiwala to Prime Minister cover

13 entries

From Chaiwala to Prime Minister

Kim Kyung-jin

Table of contents, preface, 10 chapters, epilogue

A political biography tracing Narendra Modi from a chai-selling boy in Vadnagar to RSS organizer, Gujarat chief minister, and three-term prime minister, while reading modern India, Korea-India relations, and the risks of a rising power.

AI Classroom: Your Grades Will Change book cover

26 posts available

AI Classroom: Your Grades Will Change

Kim Kyung-jin

Table of Contents, Preface, 24 Sections

An online AI Library book by Kim Kyung-jin on how AI can support elementary, middle, and high school learning, teaching, assessment, and educational equity.

Facebook X LinkedIn Threads Telegram
Military Artificial Intelligence cover

17 entries

Military Artificial Intelligence

Kim Kyung-jin and Kim Won-tae

Table of contents, preface, 14 chapters, epilogue

A full-length study of military artificial intelligence, from autonomous weapons, drones, command systems, logistics, and cyber defense to the strategies of the United States, China, Israel, Korea, and global defense AI companies.

Global Case Studies in Introducing AI into Public Administration book cover

25 posts available

Global Case Studies in Introducing AI into Public Administration

Kim Kyung-jin

Table of Contents, 23 Chapters, Epilogue

An online AI Library book by Kim Kyung-jin on public-sector AI adoption, national strategies, administrative services, governance, and future policy tasks.

Facebook X LinkedIn Threads Telegram
Seven Misunderstandings About the Arctic Route book cover

10 posts available

Seven Misunderstandings About the Arctic Route

Kim Kyung-jin

Table of Contents, Preface, 7 Chapters, Epilogue

An online AI Library book by Kim Kyung-jin on seven common misunderstandings about the Arctic Route, including speed, liner service, insurance, safety rules, year-round access, carbon impact, and infrastructure.

Facebook X LinkedIn Threads Telegram
Artificial Intelligence Election cover

14 posts

Artificial Intelligence Election

Kim Kyung-jin

Table of contents, author preface, 11 chapters, closing essay

An online book on campaign messaging, publicity materials, digital campaigning, data analysis, campaign operations, disinformation defense, legal risk, and ready-to-use prompts.

Facebook X LinkedIn Threads Telegram
Demis Hassabis book cover

34 posts available

Demis Hassabis, Father of Google’s Artificial Intelligence

Kim Kyung-ran, Kim Kyung-jin

Table of Contents, Author’s Preface, 31 Chapters, Epilogue

Demis Hassabis, Father of Google’s Artificial Intelligence is an online AI Library book by Kim Kyung-ran, Kim Kyung-jin. It covers Demis Hassabis, Google DeepMind, artificial intelligence, AlphaGo, AI research and is organized as Table of Contents, Author’s Preface, 31 Chapters, Epilogue.

Share

The Dhammapada 423 Verses book cover

28 posts available

The Dhammapada: 423 Verses

Kim Kyung-jin

Table of Contents, Editor's Note, 26 Chapters, 423 Verses

An online AI Library book by Kim Kyung-jin. This edition arranges all 423 verses of the Dhammapada into 26 chapters for slow, poetic reading.

Facebook X LinkedIn Threads Telegram
Nano Banana Pro Practical Prompt Book cover

24 posts

Nano Banana Pro Practical Prompt Book

Kim Kyung-jin

6 parts, 22 chapters, classroom prompt appendix

An online book for using Nano Banana Pro in classes and real work, covering image generation, editing, text rendering, character consistency, business use cases, and monetization.

Facebook X LinkedIn Threads Telegram
Liberal Arts AI for College Students book cover

16 posts available

Liberal Arts AI for College Students

Kim Kyung-jin

Table of Contents, Preface, 13 Chapters, Closing Essay

An online AI Library textbook for college students. It introduces AI history, daily use, document work, research, images, presentations, video, productivity, learning, careers, copyright, and governance.

Facebook X LinkedIn Threads Telegram
Legal Practice and Artificial Intelligence book cover

16 posts available

Legal Practice and Artificial Intelligence

Kim Kyung-jin

Table of Contents, Preface, 14 Parts

An online AI Library book by Kim Kyung-jin on legal research, drafting, evidence analysis, contract review, NotebookLM, and practical generative AI workflows for legal practice.

Facebook X LinkedIn Threads Telegram
Hello, I Am Kim Kyung-jin book cover

10 posts available

Hello, I Am Kim Kyung-jin

Kim Kyung-jin

Table of Contents, Preface, Recommendations, 6 Chapters, Closing

An online AI Library book on Kim Kyung-jin's life, science and technology policy, parliamentary diplomacy, legislative battles, Dongdaemun vision, and proposals for Korea's demographic future.

Facebook X LinkedIn Threads Telegram
Politics and People book cover

25 posts available

Politics and People

Kim Kyung-jin

Table of Contents, Prologue, 22 Chapters, Epilogue

An online AI Library book by Kim Kyung-jin on how politics begins with reading people, winning trust, keeping relationships, and enduring seasons of crisis.

Facebook X LinkedIn Threads Telegram

[AI Library] Chapter 11: Hybrid Search and Semantic Storage: Building Long-Term Memory

Mastering Claude Code
Author
Kim Kyung-jin
Date
2026-05-06 09:09
Views
353

Mastering Claude Code

Chapter 11: Hybrid Search and Semantic Storage: Building Long-Term Memory

Kim Kyung-jin

Mastering Claude Code

Introduction

I handed a 68-page vacuum cleaner manual PDF to Claude Code and asked, "How do I clean the filter?" The agent performed a search, then showed me step-by-step instructions in text. Below that appeared a parts diagram image from the manual. The agent made its own judgment that for physical devices, pictures could be far clearer than words.

The 68-page document could not fit entirely within the agent's single read window. So how did it find the exact information on the exact page? The answer lies in Retrieval-Augmented Generation.

What is Retrieval-Augmented Generation?

AI agents face a fundamental constraint: they cannot know information absent from their training data. A company's internal manuals, meeting notes from last week, field photos taken this morning,such data never existed in the model's training. You could fit all documents into a single read window, but past a few dozen pages, you hit the token limit the model can process.

Retrieval-Augmented Generation is an architecture that sidesteps this problem. It works in three stages.

Retrieval: The agent searches an external data repository for information relevant to the user's question. Rather than reading the entire document, it extracts only the pieces that match the query.

Augmentation: The retrieved fragments are attached to the agent's prompt. The agent can now draw on information it originally did not possess, as if it had just read it.

Generation: The agent composes an answer based on the enriched context. The agent's reasoning fuses with external data to produce a response neither could deliver alone.

[Figure 11-1: Three stages of Retrieval-Augmented Generation: Retrieval → Augmentation → Generation flowchart]

Think of it as an open-book exam. The student (the agent) need not memorize every textbook. During the test, the student can flip to the relevant page. Yet the student must master two things: the ability to quickly decide which page to open, and the ability to combine that page's information with existing knowledge to construct an answer. These skills belong to the student.

In Retrieval-Augmented Generation, the technology that decides "which page to open" is embedding.

The Concept of Embedding

Embedding is the process of converting text into numerical vectors. The word "vector" may sound mathematical, but the core idea is simple: representing a sentence's meaning as a list of numbers.

The sentence "I want to drink a cup of coffee" and the sentence "I want to order a cappuccino" use different words. Yet their meanings are similar. An embedding model converts both sentences into comparable numerical vectors. By contrast, "The stock market fell today" transforms into a completely different vector. The meaning differs.

When these vectors are arranged in multidimensional space, semantically similar sentences cluster close together, while semantically different ones drift far apart. This is how semantic similarity-based search works. When a user asks "filter cleaning method," the system converts the question into a vector, then finds the closest vectors among pre-stored document fragment vectors.

Keywords need not match exactly. If the meaning aligns, the search returns it. A document labeled "dust cover washing procedure" can surface as an answer to "filter cleaning method."

[Figure 11-2: Visualization of similar sentences forming clusters in embedding space]

Summarizing embedding's role in Retrieval-Augmented Generation: Divide documents into small fragments, called chunks. Pass each chunk through an embedding model to convert it into a vector. Store the resulting vectors in a database. When a question arrives, convert it to a vector as well. Find the stored vectors closest to the question vector and deliver them to the agent.

Using Google Gemini Multimodal Embedding

So far, we have discussed text. But real-world data is not text alone. Manuals contain assembly diagrams. Field reports include photographs. Educational materials embed video.

Google's Gemini Embedding 2 is a multimodal embedding model that places text, images, video, and audio into the same vector space.

Let us examine the changes this model brings through concrete examples.

Vacuum cleaner manual example: Embed the entire 68-page PDF. Not only text fragments but diagram images are converted to vectors. When asked "filter cleaning method," the system returns the text explanation alongside the corresponding diagram image. You can now see component locations in the image that text alone would struggle to convey.

Roof repair company example: Embed thirteen past project photographs. Each photo carries metadata,cost, duration, workforce size. Upload a new roof photo, and the system returns five similar past projects with similarity scores. These serve as references for preparing estimates.

[Figure 11-3: Multimodal embedding space: 2D visualization of text, images, and video positioned by meaning]

The power of multimodal embedding lies in different data types coexisting in the same space. A smiley-face fries photograph lands in the "food" category, a dog playing guitar video in the "entertainment" category, a Claude Code tutorial in the "technology" category. Though data types differ, AI grasps the meaning and places each in the right location. If all data were roof photographs, the system auto-classifies them into subcategories: flood damage, age deterioration, structural defects.

As of now, video supports MP4 or MOV files up to 120 seconds long. Images handle up to 6 PNG or JPEG files per request. Audio is also supported; providing accurate descriptive metadata alongside audio improves search accuracy.

Hands-On Practice: Connecting Pinecone Semantic Data Storage

Embedded vectors must be stored and searchable somewhere. That repository is a vector database, and Pinecone ranks among the most widely used semantic data storage services.

The overall flow of this exercise is as follows.

Step 1: Environment Setup

In VS Code, create a new folder and open Claude Code. Switch to Plan Mode and give the agent this instruction.

The agent designs the project structure, catalogs dependencies, and presents a step-by-step plan.

Three API keys are needed. Pinecone accesses the vector store; Gemini calls the embedding model; OpenRouter accesses the chat model for answer generation. Pinecone offers a free starter plan, Gemini API keys come from Google AI Studio, and OpenRouter provides access to multiple models through a single API endpoint.

Step 2: Data Embedding

Enter the three keys into your .env file and save. Place the files you want to embed into a data folder. Mix text files, images, and video,no problem. Tell the agent, "The data is ready, put it in Pinecone." The agent creates a Pinecone index and embeds each file for storage.

During this process, the agent recognizes each file's type and applies the appropriate embedding method. Text is split into chunks and embedded. Images are converted to vectors capturing visual meaning. Videos are analyzed for frames and audio, then converted to vectors.

[Figure 11-4: Exercise pipeline: Original files → Embedding model → Pinecone index]

Step 3: Building a Chat Interface

Ask the agent, "Build me a chat web app I can test on my local machine." The agent constructs the web application and runs it on localhost. Type a question in the browser; Pinecone searches for relevant vectors, and the agent generates an answer based on what it finds.

In actual validation, when asked 'How should we procure a workflow client?', it finds the relevant content in a text file and answers. When requested 'Show me a video of a golden retriever playing guitar', it locates that video's metadata and plays it inline.

Step 4: Iterative Improvement

The first result may not be perfect. Images might not be returned, or video descriptions could be incomplete. When you describe the problem to the agent, it enriches the metadata or fixes the app. Request 'Add better descriptions to the videos and re-embed them', and it deletes the existing vectors, then saves them anew with improved metadata.

This entire process happens within 30 minutes. Building the same multimodal vector repository in a no-code tool like n8n would take hours to days. You must manually configure chunking strategy, image capture and storage methods, and search result formatting. Claude Code handles all of this with natural language instructions alone.

How Hybrid Search and Semantic Storage Transform Agent Workflows

An agent without hybrid search and semantic storage answers only within its training scope. It references only information that fits in its reading range at any moment. The world beyond that might as well not exist.

An agent equipped with hybrid search and semantic storage is different. It can read a company's 68-page manual. It can search hundreds of construction photographs. It can find the context of a specific decision in last quarter's meeting minutes. The agent's knowledge expands beyond its training data to include all data the organization holds.

Hybrid search and semantic storage serves multiple scenarios in agent workflows.

Customer support automation: Embed product manuals and FAQs, then generate answers that reference exact pages and images for customer questions. You can also search past ticket records to answer 'Have we received similar inquiries before?'

Internal knowledge management: Embed the team's project documents, decision logs, and brand guidelines. When a new team member asks 'What are our company's logo usage rules?', it finds the relevant section in the brand guidelines and answers.

Research assistance: Embed academic papers, reports, and market research materials. When asked 'What are recent trends in this field?', it searches relevant materials and generates a summary. It provides original sources and confidence scores, making fact-checking possible.

[Figure 11-5] Diagram comparing agent knowledge scope before and after implementing hybrid search and semantic storage.

What matters here is subject matter expertise. More than technical skill in building a hybrid search and semantic storage pipeline, result quality depends on how you describe the data and in what way. As the roof repair example showed, sparse metadata on a photo means sparse search results.

A photo described as 'This shows hail damage on a 10-year-old asphalt shingle roof; repair cost was 4.5 million won, took 3 days' and a photo tagged only as 'roof photo' have vastly different search utility.

The value of technical implementation skill is diminishing. We watched Claude Code build in 30 minutes what took days in n8n. Agents handle technical details,composing JSON, configuring HTTP requests. But clearly describing a process, precisely articulating what data means, spotting gaps and naming them remain human work.

Giving agents long-term memory meant connecting a database. Now let's look at how to give agents skilled techniques they can perform repeatedly,reusable patterns of behavior that, once taught, run at consistent quality any time.

AI Specialist Attorney Kim Kyung-jin

Specialist in AI law policy. Former member of the National Assembly. Author of multiple books.

If this book has been at your side, however briefly, support it so the next story can reach the world.

(Voluntary support account: NH Bank 302-1096-0948-81 Account holder: Kim Kyung-jin)

Kim Kyung-jin

Attorney · Former Member of the National Assembly · AI Policy Researcher

kimkj.com

© 2026 Kim Kyung-jin. All rights reserved.

#KimKyungjin #AILibrary #MasteringClaudeCode
Scroll to Top
kimkj.com Home