Add wrapper tool for FactReasoner #79

jbrry · 2025-07-23T15:16:57Z

Summary 📝

This PR adds FactCheckTool which is a wrapper to call the hosted FactReasoner service.

It also adds some helper tools which enable further downstream analysis: a VectorDBTool which is based on a Chroma vector database is added so that search artefacts can be stored for downstream analysis or so that FactReasoner can use it for context retrieval.

Details

VectorDBTool: akd/tools/vector_db_tool.py used for interacting with a vector database. It uses ChromaDB for persistent storage and sentence-transformers for creating embeddings. The .index() method is used for adding documents, and _arun runs the retrieval based on a query.

DeepLitSearchAgent Integration: The new functionality has been added as part of a DeepLitSearchAgent run. This shows how we can use the new tools to work along with the DeepLitSearchAgent.

Usage

Documentation for the backend deployment is available here.

Standalone example:

from akd.tools.fact_check import FactCheckInputSchema, FactCheckOutputSchema, FactCheckToolConfig, FactCheckTool

from pydantic import HttpUrl

fact_check_config = FactCheckToolConfig(base_url=HttpUrl("https://factreasoner-service-app.1yhbkn094k2v.us-south.codeengine.appdomain.cloud"), polling_interval_seconds=10)
fact_check_tool = FactCheckTool(config=fact_check_config)


sample_question = "What is the coriolis effect?"
sample_answer = "The Coriolis effect describes the pattern of deflection taken by objects not firmly connected to the ground as they travel long distances around the Earth."

fact_check_input = FactCheckInputSchema(
    question=sample_question,
    answer=sample_answer,
)

print(f"\n--- Running Fact-Check on sample answer ---\n{sample_answer}\n")
try:
    fact_check_result = await fact_check_tool.arun(params=fact_check_input)

    print("\n--- Fact-Check Complete ---")
    score = fact_check_result.fact_reasoner_score.get("factuality_score", 0)
    num_supported = len(fact_check_result.supported_atoms)
    num_not_supported = len(fact_check_result.not_supported_atoms)

    print(f"Factuality Score: {score:.2%}")
    print(f"Supported Atoms: {num_supported}")
    print(f"Not Supported Atoms: {num_not_supported}")
    print(f"Graph ID: {fact_check_result.graph_id}")
    
    # graph methods
    if fact_check_result.graph_id:
        graph_id = fact_check_result.graph_id

    print(f"\nFetching graph structure for ID: {graph_id}...")
    graph_json_url = f"{fact_check_tool.config.graph_json_endpoint}/{graph_id}?format=full"
    graph_response = await fact_check_tool.api_client.get(graph_json_url)

    if graph_response.status_code == 200:
        graph_data = graph_response.json()
        print(f"Successfully fetched graph JSON. Found {len(graph_data.get('nodes', []))} nodes and {len(graph_data.get('edges', []))} edges.")
        # print(graph_data) # uncomment for full object.
    else:
        print(f"Failed to fetch graph JSON. Status: {graph_response.status_code}")

except Exception as e:
    print(f"\nExample failed. Ensure the fact-checking service is running.")
    print(f"Error: {e}")

It returns the factuality score based on the question and response pair:

--- Fact-Check Complete ---
Factuality Score: 100.00%
Supported Atoms: 3
Not Supported Atoms: 0
Graph ID: 92ed5024-ad09-48f6-b296-357bdc3e38f7

Fetching graph structure for ID: 92ed5024-ad09-48f6-b296-357bdc3e38f7...
Successfully fetched graph JSON. Found 15 nodes and 36 edges.

I am also listing the test files for each of these tools, which cover initialization and running of the tools and can serve as documentation:

VectorDBTool: test script example
FactCheckTool: test script example.

NISH1001 · 2025-07-23T15:31:22Z

@jbrry Could you please add usage section in the PR as well.

Some sample PRs:

etc

akd/agents/litsearch.py

NISH1001

[Initial superficial ocmments]

akd/tools/text_splitter.py

…vector_database_tool

… testing

movinam

Thanks for the PR! I think the text splitter tool is not needed - we can just use the langchain implementation/or any existing implementation and pass the documents to the vector db to be indexed. Please see comments.

tests/tools/semantic_scholar_search_test.py

akd/tools/text_splitter.py

akd/tools/vector_db_tool.py

akd/tools/fact_check.py

akd/tools/vector_db_tool.py

tests/tools/vector_db_tool_test.py

NISH1001 · 2025-09-03T18:16:48Z

@jbrry There's merge conflict with develop. Could you pull in develop latest changes and resolve?

github-actions · 2025-09-03T19:21:38Z

❌ Tests failed (exit code: )

📊 Test Results

Passed: 0
Failed: 0
Warnings: 0
Coverage: 0%

⚠️ Note: Test counts are 0, which may indicate parsing issues or early test failure. Check the workflow logs for details.

Branch: feature/vector_database_tool
PR: #79
Commit: 982ae7e

📋 Full coverage report and logs are available in the workflow run.

github-actions · 2025-09-03T19:36:19Z

❌ Tests failed (exit code: 1)

📊 Test Results

Passed: 221
Failed: 6
Warnings: 109
Coverage: 72%

Branch: feature/vector_database_tool
PR: #79
Commit: cc83fbe

📋 Full coverage report and logs are available in the workflow run.

github-actions · 2025-09-03T20:00:27Z

❌ Tests failed (exit code: )

📊 Test Results

Passed: 0
Failed: 0
Warnings: 0
Coverage: 0%

⚠️ Note: Test counts are 0, which may indicate parsing issues or early test failure. Check the workflow logs for details.

Branch: feature/vector_database_tool
PR: #79
Commit: 9ed9312

📋 Full coverage report and logs are available in the workflow run.

github-actions · 2025-09-03T20:18:12Z

❌ Tests failed (exit code: 1)

📊 Test Results

Passed: 221
Failed: 6
Warnings: 109
Coverage: 72%

Branch: feature/vector_database_tool
PR: #79
Commit: 03cea67

📋 Full coverage report and logs are available in the workflow run.

github-actions · 2025-09-03T20:25:04Z

❌ Tests failed (exit code: 1)

📊 Test Results

Passed: 221
Failed: 6
Warnings: 109
Coverage: 72%

Branch: feature/vector_database_tool
PR: #79
Commit: 6a87351

📋 Full coverage report and logs are available in the workflow run.

github-actions · 2025-09-03T20:46:42Z

❌ Tests failed (exit code: 1)

📊 Test Results

Passed: 221
Failed: 6
Warnings: 108
Coverage: 72%

Branch: feature/vector_database_tool
PR: #79
Commit: 6a87351

📋 Full coverage report and logs are available in the workflow run.

NISH1001 · 2025-09-04T16:49:43Z

akd/tools/vector_db_tool.py

+        description="Path to the persistent ChromaDB directory.",
+    )
+    collection_name: str = Field(
+        default="litagent_demo",


Can we rename the default name to something else? like akd_vdb or something?

NISH1001

Nitpick

github-actions · 2025-09-04T17:13:40Z

❌ Tests failed (exit code: 1)

📊 Test Results

Passed: 249
Failed: 5
Warnings: 113
Coverage: 73%

Branch: feature/vector_database_tool
PR: #79
Commit: 5f7c9b6

📋 Full coverage report and logs are available in the workflow run.

… testing

github-actions · 2025-09-04T17:14:33Z

❌ Tests failed (exit code: 1)

📊 Test Results

Passed: 221
Failed: 6
Warnings: 108
Coverage: 72%

Branch: feature/vector_database_tool
PR: #79
Commit: 69533eb

📋 Full coverage report and logs are available in the workflow run.

jbrry added 5 commits July 23, 2025 10:54

Add VectorDBTool based on Chroma

727d5f3

Merge branch 'develop' into feature/vector_database_tool

1c75c1a

Adopt config strategy and add text splitter and vector db tools

d85bf70

Update litsearch agent with new indexing capabilities

63e5cbe

Remove comment line

f1f6a76

jbrry requested review from NISH1001 and movinam July 23, 2025 15:16

NISH1001 reviewed Jul 23, 2025

View reviewed changes

akd/agents/litsearch.py Outdated Show resolved Hide resolved

NISH1001 reviewed Jul 23, 2025

View reviewed changes

akd/tools/text_splitter.py Outdated Show resolved Hide resolved

akd/tools/text_splitter.py Outdated Show resolved Hide resolved

jbrry added 14 commits August 7, 2025 11:30

Changes to standalone script for vectordb

2e78325

Add test to run SemanticScholarSearchTool

2586621

Merge changes from develop

21727e0

Add test for text splitter tool

25ccec3

Merge branch 'develop' into feature/add_semantic_scholar_search_test

8d3e4b0

Test the from_params method of the class

384c9d3

Add tests for vector db and text splitter tools

231e6ff

Fix chunking test

c4a57c9

Leave config initialisation to super class

d947cb7

Add FactReasoner fact-check to deep research pipeline

2220443

Add test for FactCheck tool

7a2c13d

Merge branch 'feature/add_semantic_scholar_search_test' into feature/…

a492e6b

…vector_database_tool

Updated test file

595cb38

Wrap string in HttpUrl

8d9ed47

jbrry changed the title ~~Feature/vector database tool~~ Add wrapper tool for FactReasoner Aug 14, 2025

Update demo script with fact check

fce0cfb

github-actions bot added a commit that referenced this pull request Aug 20, 2025

Auto-merge PR #79 (feature/vector_database_tool) into integration for…

b97722d

… testing

movinam reviewed Aug 20, 2025

View reviewed changes

movinam mentioned this pull request Aug 20, 2025

Add test to run SemanticScholarSearchTool #110

Closed

Enable polling for long running processes

6c50cb7

NISH1001 approved these changes Sep 3, 2025

View reviewed changes

Fix conflict from develop

03f0f7f

jbrry had a problem deploying to integration September 3, 2025 19:21 — with GitHub Actions Failure

Fix conflict from develop

e7ef733

jbrry temporarily deployed to integration September 3, 2025 19:26 — with GitHub Actions Inactive

github-actions bot mentioned this pull request Sep 3, 2025

Integration branch merge conflict #175

Closed

jbrry had a problem deploying to integration September 3, 2025 19:56 — with GitHub Actions Failure

jbrry force-pushed the feature/vector_database_tool branch from de16045 to e7ef733 Compare September 3, 2025 20:09

jbrry temporarily deployed to integration September 3, 2025 20:09 — with GitHub Actions Inactive

Sync version of pyproject.toml from integration branch

8dbdc07

jbrry temporarily deployed to integration September 3, 2025 20:17 — with GitHub Actions Inactive

github-actions bot mentioned this pull request Sep 3, 2025

Integration branch merge conflict #176

Closed

github-actions bot mentioned this pull request Sep 3, 2025

Integration branch merge conflict #177

Closed

jbrry temporarily deployed to integration September 3, 2025 20:39 — with GitHub Actions Inactive

NISH1001 reviewed Sep 4, 2025

View reviewed changes

Change vector db collection name to something more generic

2a50d76

jbrry temporarily deployed to integration September 4, 2025 17:04 — with GitHub Actions Inactive

Merge branch 'develop' into feature/vector_database_tool

39c8b8e

jbrry temporarily deployed to integration September 4, 2025 17:05 — with GitHub Actions Inactive

github-actions bot added a commit that referenced this pull request Sep 4, 2025

Auto-merge PR #79 (feature/vector_database_tool) into integration for…

b7adb32

… testing

Add wrapper tool for FactReasoner #79

Are you sure you want to change the base?

Add wrapper tool for FactReasoner #79

Uh oh!

Conversation

jbrry commented Jul 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary 📝

Details

Usage

Uh oh!

NISH1001 commented Jul 23, 2025

Uh oh!

Uh oh!

NISH1001 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

movinam left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

NISH1001 commented Sep 3, 2025

Uh oh!

github-actions bot commented Sep 3, 2025

📊 Test Results

Uh oh!

github-actions bot commented Sep 3, 2025

📊 Test Results

Uh oh!

github-actions bot commented Sep 3, 2025

📊 Test Results

Uh oh!

github-actions bot commented Sep 3, 2025

📊 Test Results

Uh oh!

github-actions bot commented Sep 3, 2025

📊 Test Results

Uh oh!

github-actions bot commented Sep 3, 2025

📊 Test Results

Uh oh!

NISH1001 Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

NISH1001 left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Sep 4, 2025

📊 Test Results

Uh oh!

github-actions bot commented Sep 4, 2025

📊 Test Results

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jbrry commented Jul 23, 2025 •

edited

Loading