16a6c7ac3-linux

total: 63, pass: 58, fail: 5

add error handling to `tests/tests.py`

There's an edge case was clean up process of the test harness. When a
test dies, it removes tmp directories and changes directory to root. But
sometimes, a test case spawns children and dies while the children are
still running. The children are still writing something to tmp
directories and it would mess up `shutil.rmtree()`. So I added another
`try-catch` to the cleanup code, and added the error message to the
result.

The error messages from cleanup code are not visible from the ci
dashboard, but we can still see them when we open the result file.

commit: 16a6c7ac3c520d2cc0a4e96bd6cfdb4472971a19
platform: Linux-5.15.0-136-generic-x86_64-with-glibc2.35
ragit version: ragit 0.4.0-dev
rustc version: rustc 1.87.0-nightly (78948ac25 2025-03-20)
cargo version: cargo 1.87.0-nightly (6cf826701 2025-03-14)
python version: 3.10.12
tested at: 2025-05-24T20:40:39.036733Z (20 days ago)
total elapsed time: 5,605,441 ms

cargo_tests
add_and_rm
add_and_rm2
ignore
recover
clone
clone_empty
pull
server
server_permission
cli
outside
archive
many_chunks
many_jobs
ls
meta
symlink
gh_issue_20
ii
cat_file
generous_file_reader
clean_up_erroneous_chunk
images
markdown_reader
csv_reader
real_repos
real_repos_regression
subdir
tfidf
merge
external_bases
end_to_end dummy
end_to_end llama3.3-70b
audit llama3.3-70b
logs llama3.3-70b
prompts dummy
prompts gpt-4o-mini
prompts gemini-2.0-flash
prompts claude-3.5-sonnet
empty dummy
empty llama3.3-70b
server_chat llama3.3-70b
server_chat gemini-2.0-flash
images2 gpt-4o-mini
images3 gpt-4o-mini
pdl gpt-4o-mini
pdf gpt-4o-mini
svg gpt-4o-mini
web_images gpt-4o-mini
images2 claude-3.5-sonnet
extract_keywords dummy
extract_keywords gpt-4o-mini
orphan_process llama3.3-70b
write_lock llama3.3-70b
ragit_api command-r
query_options llama3.3-70b
query_with_schema llama3.3-70b
models_init
test_home_config_override
config
migrate
migrate2

Cases

cargo_tests

elapsed time: 291,758 ms

history

add_and_rm

elapsed time: 34,533 ms

history

add_and_rm2

elapsed time: 11,640 ms

history

ignore

elapsed time: 3,196 ms

history

recover

elapsed time: 3,099 ms

history

clone

elapsed time: 126,926 ms

history

clone_empty

elapsed time: 5,382 ms

history

pull

elapsed time: 8,979 ms

history

server

elapsed time: 137,829 ms

history

server_permission

elapsed time: 34,286 ms

history

cli

elapsed time: 5,605 ms

history

outside

elapsed time: 3,027 ms

history

meta

elapsed time: 1,717 ms

history

symlink

elapsed time: 3,083 ms

history

gh_issue_20

elapsed time: 2,732 ms

history

elapsed time: 154,964 ms

Error

tfidf result on term 'let bitxor' is not close enough. error: `answer[2] not in approximation`, answer: ['549a87567bd9ec4c3145df3f3db3c7f285e9b1551269c1720000000100000200', 'faae85ed9ecfe34ecc388f777a6adabbde32f4628b65cd820000000100000200', '445d453adb1948c8f9cce13eb8f0974869152c2f1e9e4b95000000010000016e', 'ddd47326a625b60b374ab71c97a87a8e3de207a73288da380000000100000204', '78973af7ecb7137f86b7e126889bdb947505308e35fad8b50000000100000200', '5b252d0f47ac8d0bee53b3c3c0387897869e9834d9e43edf000000010000020f', 'f28f8995ed5aad84ba253b191d1257e234e37f8f1ad572240000000100000202', '74a9821e38482554bd3e78b0f1c474a927848122a54223420000000100000212', '63cb62bb8a33a334f049b70be4cdd592d385215ade59690c0000000100000207', '5d6c16dc1991936cb1b2a3d821c43a8bf648d298a24992a70000000100000200'], approximation: ['549a87567bd9ec4c3145df3f3db3c7f285e9b1551269c1720000000100000200', 'faae85ed9ecfe34ecc388f777a6adabbde32f4628b65cd820000000100000200', '78973af7ecb7137f86b7e126889bdb947505308e35fad8b50000000100000200', '5b252d0f47ac8d0bee53b3c3c0387897869e9834d9e43edf000000010000020f', '74a9821e38482554bd3e78b0f1c474a927848122a54223420000000100000212', '63cb62bb8a33a334f049b70be4cdd592d385215ade59690c0000000100000207', 'd2178deab15f668fd917a5bc4fc39360e3c3d3d87a353a920000000100000200', 'b82a8fb6160d01b1c8a289e0af1b714cf92d89aaced6cbdb0000000100000202', 'b7f52a1abcdae26519257214f1837e3885856589649424170000000100000203', 'fa10425e1445e6bd8b59f7c047762787e114d2a8d4d1ef5b0000000100000200']
Traceback (most recent call last):
  File "/home/baehyunsol/Documents/ragit/tests/ii.py", line 100, in ii_worker
    raise AssertionError(f"answer[{i}] not in approximation")
AssertionError: answer[2] not in approximation

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/home/baehyunsol/Documents/ragit/tests/tests.py", line 672, in <module>
    test()
  File "/home/baehyunsol/Documents/ragit/tests/ii.py", line 36, in ii
    ii_worker()
  File "/home/baehyunsol/Documents/ragit/tests/ii.py", line 116, in ii_worker
    raise AssertionError(f"tfidf result on term '{term}' is not close enough. error: `{e}`, answer: {answer}, approximation: {approximation}")
AssertionError: tfidf result on term 'let bitxor' is not close enough. error: `answer[2] not in approximation`, answer: ['549a87567bd9ec4c3145df3f3db3c7f285e9b1551269c1720000000100000200', 'faae85ed9ecfe34ecc388f777a6adabbde32f4628b65cd820000000100000200', '445d453adb1948c8f9cce13eb8f0974869152c2f1e9e4b95000000010000016e', 'ddd47326a625b60b374ab71c97a87a8e3de207a73288da380000000100000204', '78973af7ecb7137f86b7e126889bdb947505308e35fad8b50000000100000200', '5b252d0f47ac8d0bee53b3c3c0387897869e9834d9e43edf000000010000020f', 'f28f8995ed5aad84ba253b191d1257e234e37f8f1ad572240000000100000202', '74a9821e38482554bd3e78b0f1c474a927848122a54223420000000100000212', '63cb62bb8a33a334f049b70be4cdd592d385215ade59690c0000000100000207', '5d6c16dc1991936cb1b2a3d821c43a8bf648d298a24992a70000000100000200'], approximation: ['549a87567bd9ec4c3145df3f3db3c7f285e9b1551269c1720000000100000200', 'faae85ed9ecfe34ecc388f777a6adabbde32f4628b65cd820000000100000200', '78973af7ecb7137f86b7e126889bdb947505308e35fad8b50000000100000200', '5b252d0f47ac8d0bee53b3c3c0387897869e9834d9e43edf000000010000020f', '74a9821e38482554bd3e78b0f1c474a927848122a54223420000000100000212', '63cb62bb8a33a334f049b70be4cdd592d385215ade59690c0000000100000207', 'd2178deab15f668fd917a5bc4fc39360e3c3d3d87a353a920000000100000200', 'b82a8fb6160d01b1c8a289e0af1b714cf92d89aaced6cbdb0000000100000202', 'b7f52a1abcdae26519257214f1837e3885856589649424170000000100000203', 'fa10425e1445e6bd8b59f7c047762787e114d2a8d4d1ef5b0000000100000200']

history

cat_file

elapsed time: 14,142 ms

history

generous_file_reader

elapsed time: 297,970 ms

history

clean_up_erroneous_chunk

elapsed time: 1,421 ms

history

images

elapsed time: 4,389 ms

history

markdown_reader

elapsed time: 5,193 ms

history

csv_reader

elapsed time: 2,809 ms

history

real_repos

elapsed time: 2,320,500 ms

Error

Command '['cargo', 'run', '--release', '--', 'init']' returned non-zero exit status 101.
Traceback (most recent call last):
  File "/home/baehyunsol/Documents/ragit/tests/tests.py", line 672, in <module>
    test()
  File "/home/baehyunsol/Documents/ragit/tests/real_repos.py", line 64, in real_repos
    cargo_run(["init"])
  File "/home/baehyunsol/Documents/ragit/tests/utils.py", line 74, in cargo_run
    result = subprocess.run(args, **kwargs)
  File "/usr/lib/python3.10/subprocess.py", line 526, in run
    raise CalledProcessError(retcode, process.args,
subprocess.CalledProcessError: Command '['cargo', 'run', '--release', '--', 'init']' returned non-zero exit status 101.

history

real_repos_regression

elapsed time: 2,543 ms

Error


Traceback (most recent call last):
  File "/home/baehyunsol/Documents/ragit/tests/tests.py", line 672, in <module>
    test()
  File "/home/baehyunsol/Documents/ragit/tests/real_repos_regression.py", line 157, in real_repos_regression
    assert count_files() == (len(reproductions), 1, len(reproductions) - 1)  # (total, staged, processed)
AssertionError

history

subdir

elapsed time: 9,002 ms

history

tfidf

elapsed time: 10,134 ms

history

merge

elapsed time: 14,380 ms

history

external_bases

elapsed time: 70,458 ms

history

end_to_end dummy

elapsed time: 42,427 ms

history

end_to_end llama3.3-70b

elapsed time: 54,228 ms

history

audit llama3.3-70b

elapsed time: 5,978 ms

history

logs llama3.3-70b

elapsed time: 3,843 ms

history

prompts dummy

elapsed time: 4,576 ms

history

prompts gpt-4o-mini

elapsed time: 44,748 ms

history

prompts gemini-2.0-flash

elapsed time: 26,573 ms

history

prompts claude-3.5-sonnet

elapsed time: 61,614 ms

history

empty dummy

elapsed time: 2,797 ms

history

empty llama3.3-70b

elapsed time: 3,791 ms

history

server_chat llama3.3-70b

elapsed time: 21,599 ms

history

server_chat gemini-2.0-flash

elapsed time: 44,996 ms

history

images2 gpt-4o-mini

elapsed time: 6,645 ms

history

images3 gpt-4o-mini

elapsed time: 8,662 ms

history

pdl gpt-4o-mini

elapsed time: 18,365 ms

history

pdf gpt-4o-mini

elapsed time: 142,277 ms

Error


Traceback (most recent call last):
  File "/home/baehyunsol/Documents/ragit/tests/tests.py", line 672, in <module>
    test()
  File "/home/baehyunsol/Documents/ragit/tests/tests.py", line 622, in <lambda>
    ("pdf gpt-4o-mini", lambda: pdf(test_model="gpt-4o-mini")),
  File "/home/baehyunsol/Documents/ragit/tests/pdf.py", line 51, in pdf
    assert any([pdf["name"] in r["source"] for r in search_result])
AssertionError

history

svg gpt-4o-mini

elapsed time: 7,873 ms

Error


Traceback (most recent call last):
  File "/home/baehyunsol/Documents/ragit/tests/tests.py", line 672, in <module>
    test()
  File "/home/baehyunsol/Documents/ragit/tests/tests.py", line 623, in <lambda>
    ("svg gpt-4o-mini", lambda: svg(test_model="gpt-4o-mini")),
  File "/home/baehyunsol/Documents/ragit/tests/svg.py", line 123, in svg
    assert "ragit" in cargo_run(["pdl", "test1.pdl"], stdout=True).lower()
AssertionError

history

web_images gpt-4o-mini

elapsed time: 62,873 ms

history

images2 claude-3.5-sonnet

elapsed time: 9,756 ms

history

extract_keywords dummy

elapsed time: 1,632 ms

history

extract_keywords gpt-4o-mini

elapsed time: 9,060 ms