Fix/memory leaks, add functions to report context usage, add unittest #20

andinux · 2025-11-13T04:23:17Z

fix(chat): avoid memory leaks and unnecessary re-initialization in llm_chat_check_context
Adds a check to return early if the chat struct is already initialized, preventing memory leaks and unnecessary re-initialization and ensuring correct chat context handling.
fix(chat): remove the duplicate_content arg in llm_messages_append to avoid a crash on llm_chat_free()
Refactored llm_messages_append to always duplicate message content using sqlite_strdup, removing the duplicate_content parameter. This option was causing a double-free crash in llm_chat_free with non-duplicated content. Also, the ai->chat.response buffer is reset on new prompts in the same chat so we cannot rely on that buffer to store previous response messages.
test: add C unit tests
Introduces a new C unit test runner at tests/c/unittest.c with basic tests for the SQLite AI extension. The Makefile is updated to build and run these tests, including logic to download a test model if needed and link against the appropriate SQLite libraries.
ci: add GGUF model caching to workflow
Introduces a new 'download-model' job in the GitHub Actions workflow to cache and restore the GGUF model, reducing redundant downloads. Updates the Makefile to use GGUF model variables, simplifies test model handling, and ensures the test binary is built with the correct SQLite source.
Add SQLite amalgamation for testing.
feat: add llm_context_used and llm_context_size functions to report context usage
Add two functions: llm_context_size, which returns the total context size, and llm_context_used, which returns the number of tokens used.

…m_chat_check_context Adds a check to return early if the chat struct is already initialized, preventing memory leaks and unnecessary re-initialization and ensuring correct chat context handling.

… avoid a crash on llm_chat_free() Refactored llm_messages_append to always duplicate message content using sqlite_strdup, removing the duplicate_content parameter. This option was causing a double-free crash in llm_chat_free with non-duplicated content. Also, the ai->chat.response buffer is reset on new prompts in the same chat so we cannot rely on that buffer to store previous response messages.

Introduces the llm_context_usage SQLite function, which returns a JSON object with the current context size, tokens used, and usage ratio. This helps users monitor LLM context utilization and manage resources more effectively.

Introduces a new C unit test runner at tests/c/unittest.c with basic tests for the SQLite AI extension. The Makefile is updated to build and run these tests, including logic to download a test model if needed and link against the appropriate SQLite libraries.

Replaces the llm_context_usage function with two separate functions: llm_context_size, which returns the total context size, and llm_context_used, which returns the number of tokens used. Updates function registration accordingly for improved clarity and granularity.

Introduces a new test, test_llm_chat_vtab, to verify the llm_chat virtual table functionality and memory usage. Adds the exec_select_rows helper for row-counting and verbose output. Updates test_llm_chat_respond_repeated to use varied prompts and more detailed context usage queries. Removes redundant SQLITE_ENABLE_LOAD_EXTENSION macro definition.

Introduces a new 'download-model' job in the GitHub Actions workflow to cache and restore the GGUF model, reducing redundant downloads. Updates the Makefile to use GGUF model variables, simplifies test model handling, and ensures the test binary is built with the correct SQLite source. Add SQLite amalgamation for testing.

Replaces the use of the ETag header with a SHA-256 hash of the GGUF model URL for cache key generation in the workflow. This improves reliability by avoiding dependency on remote server ETag support.

…ndows and linux-musl Introduces a new output 'model-path' in the download-model job and updates subsequent steps to use this output. This improves maintainability by centralizing the model path reference.

Introduces a conditional test target for the Android platform in the Makefile. The Android test runs only the sqlite3 CLI smoke test, while other platforms also run additional tests. This prevents the Android job from trying to execute build/tests/sqlite_ai_tests (and other host-only tools) inside the emulator while keeping the rest of the test workflow unchanged. The commands on Android were trying to use the host’s prebuilt toolchain path inside an Android emulator, which doesn’t exist there

The step that verified the existence of the GGUF model file has been removed from the GitHub Actions workflow because on windows and linux-musl runners it was failing. If the file isn’t present, the Makefile’s $(GGUF_MODEL_PATH) rule will download it as before, so we no longer need to enforce the explicit test -f check that was failing in containerized environments.

- Added SKIP_UNITTEST flag to the Makefile so platforms lacking the necessary tooling (Android, Linux-musl, Windows) can run only the sqlite CLI smoke test while macOS/Linux keep running the full C harness. - Fixed platform-specific issues: removed -ldl on Windows builds, adjusted workflow verification steps, and ensured model paths/devices are handled cleanly.

danielebriggi · 2025-11-13T09:25:05Z

tests/c/unittest.c

+#include "sqlite-ai.h"
+#endif
+
+#define DEFAULT_MODEL_PATH "tests/models/unsloth/gemma-3-270m-it-GGUF/gemma-3-270m-it-UD-IQ2_M.gguf"


chore: I'd add a comment to say that // Just a lightweight model to use for testing

added: 054798d

danielebriggi · 2025-11-13T09:30:07Z

tests/c/unittest.c

+static const test_case TESTS[] = {
+    {"issue15_llm_chat_without_context", test_issue15_chat_without_context},
+    {"llm_chat_respond_repeated", test_llm_chat_respond_repeated},
+    {"llm_chat_vtab", test_llm_chat_vtab},
+};


question: maybe this array should be placed in the head of file so it's easier to remember to fill it with new tests

if I put the array above the function implementations, I would also need to add the functions prototypes before the array

danielebriggi · 2025-11-13T09:36:18Z

@andinux better to update the API.md as well with the new functions

Added comments clarifying the use of a lightweight model for testing in both the Makefile and unittest.c. Reordered and grouped related test variables in the Makefile for better readability.

Adds documentation for `llm_context_size()`, `llm_context_used()`, and `llm_token_count(text)` functions.

andinux · 2025-11-13T14:33:59Z

@andinux better to update the API.md as well with the new functions

API.md updated: 754e5d4

andinux added 13 commits November 12, 2025 11:33

fix(chat): avoid memory leaks and unnecessary re-initialization in ll…

9469601

…m_chat_check_context Adds a check to return early if the chat struct is already initialized, preventing memory leaks and unnecessary re-initialization and ensuring correct chat context handling.

chore: bump version to 0.7.58

79b8b3a

ci: switch GGUF model cache key to URL hash

deac4cd

Replaces the use of the ETag header with a SHA-256 hash of the GGUF model URL for cache key generation in the workflow. This improves reliability by avoiding dependency on remote server ETag support.

ci: refactor GGUF model path usage in workflow to fix execution on wi…

1b006bb

…ndows and linux-musl Introduces a new output 'model-path' in the download-model job and updates subsequent steps to use this output. This improves maintainability by centralizing the model path reference.

andinux requested a review from danielebriggi November 13, 2025 04:23

andinux self-assigned this Nov 13, 2025

danielebriggi approved these changes Nov 13, 2025

View reviewed changes

andinux added 2 commits November 13, 2025 07:46

chore: add comments for test model usage

054798d

Added comments clarifying the use of a lightweight model for testing in both the Makefile and unittest.c. Reordered and grouped related test variables in the Makefile for better readability.

chore(docs): document context and token utility functions

754e5d4

Adds documentation for `llm_context_size()`, `llm_context_used()`, and `llm_token_count(text)` functions.

andinux merged commit c3fd345 into main Nov 13, 2025
21 checks passed

andinux deleted the fix/memory-leaks-chat-usage-tests branch November 13, 2025 15:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix/memory leaks, add functions to report context usage, add unittest #20

Fix/memory leaks, add functions to report context usage, add unittest #20

Uh oh!

andinux commented Nov 13, 2025

Uh oh!

danielebriggi Nov 13, 2025 •

edited

Loading

Uh oh!

andinux Nov 13, 2025

Uh oh!

danielebriggi Nov 13, 2025

Uh oh!

andinux Nov 13, 2025

Uh oh!

danielebriggi commented Nov 13, 2025

Uh oh!

andinux commented Nov 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix/memory leaks, add functions to report context usage, add unittest #20

Fix/memory leaks, add functions to report context usage, add unittest #20

Uh oh!

Conversation

andinux commented Nov 13, 2025

Uh oh!

danielebriggi Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

andinux Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

danielebriggi Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

andinux Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

danielebriggi commented Nov 13, 2025

Uh oh!

andinux commented Nov 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

danielebriggi Nov 13, 2025 •

edited

Loading