Home Explore Blog CI



16a6c7ac3-linux

result.json

total: 63, pass: 58, fail: 5

add error handling to `tests/tests.py`

There's an edge case was clean up process of the test harness. When a
test dies, it removes tmp directories and changes directory to root. But
sometimes, a test case spawns children and dies while the children are
still running. The children are still writing something to tmp
directories and it would mess up `shutil.rmtree()`. So I added another
`try-catch` to the cleanup code, and added the error message to the
result.

The error messages from cleanup code are not visible from the ci
dashboard, but we can still see them when we open the result file.

TOC

  1. ignore  
  2. recover  
  3. clone  
  4. pull  
  5. server  
  6. cli  
  7. outside  
  8. archive  
  9. ls  
  10. meta  
  11. symlink  
  12. ii  
  13. images  
  14. subdir  
  15. tfidf  
  16. merge  
  17. config  
  18. migrate  

Cases

cargo_tests
 

elapsed time: 291,758 ms

history

add_and_rm
 

elapsed time: 34,533 ms

history

add_and_rm2
 

elapsed time: 11,640 ms

history

ignore
 

elapsed time: 3,196 ms

history

recover
 

elapsed time: 3,099 ms

history

clone
 

elapsed time: 126,926 ms

history

clone_empty
 

elapsed time: 5,382 ms

history

pull
 

elapsed time: 8,979 ms

history

server
 

elapsed time: 137,829 ms

history

server_permission
 

elapsed time: 34,286 ms

history

cli
 

elapsed time: 5,605 ms

history

outside
 

elapsed time: 3,027 ms

history

archive
 

elapsed time: 181,736 ms

history

many_chunks
 

elapsed time: 568,013 ms

history

many_jobs
 

elapsed time: 154,813 ms

history

ls
 

elapsed time: 77,940 ms

history

meta
 

elapsed time: 1,717 ms

history

symlink
 

elapsed time: 3,083 ms

history

gh_issue_20
 

elapsed time: 2,732 ms

history

ii
 

elapsed time: 154,964 ms

Error

tfidf result on term 'let bitxor' is not close enough. error: `answer[2] not in approximation`, answer: ['549a87567bd9ec4c3145df3f3db3c7f285e9b1551269c1720000000100000200', 'faae85ed9ecfe34ecc388f777a6adabbde32f4628b65cd820000000100000200', '445d453adb1948c8f9cce13eb8f0974869152c2f1e9e4b95000000010000016e', 'ddd47326a625b60b374ab71c97a87a8e3de207a73288da380000000100000204', '78973af7ecb7137f86b7e126889bdb947505308e35fad8b50000000100000200', '5b252d0f47ac8d0bee53b3c3c0387897869e9834d9e43edf000000010000020f', 'f28f8995ed5aad84ba253b191d1257e234e37f8f1ad572240000000100000202', '74a9821e38482554bd3e78b0f1c474a927848122a54223420000000100000212', '63cb62bb8a33a334f049b70be4cdd592d385215ade59690c0000000100000207', '5d6c16dc1991936cb1b2a3d821c43a8bf648d298a24992a70000000100000200'], approximation: ['549a87567bd9ec4c3145df3f3db3c7f285e9b1551269c1720000000100000200', 'faae85ed9ecfe34ecc388f777a6adabbde32f4628b65cd820000000100000200', '78973af7ecb7137f86b7e126889bdb947505308e35fad8b50000000100000200', '5b252d0f47ac8d0bee53b3c3c0387897869e9834d9e43edf000000010000020f', '74a9821e38482554bd3e78b0f1c474a927848122a54223420000000100000212', '63cb62bb8a33a334f049b70be4cdd592d385215ade59690c0000000100000207', 'd2178deab15f668fd917a5bc4fc39360e3c3d3d87a353a920000000100000200', 'b82a8fb6160d01b1c8a289e0af1b714cf92d89aaced6cbdb0000000100000202', 'b7f52a1abcdae26519257214f1837e3885856589649424170000000100000203', 'fa10425e1445e6bd8b59f7c047762787e114d2a8d4d1ef5b0000000100000200']
Traceback (most recent call last):
  File "/home/baehyunsol/Documents/ragit/tests/ii.py", line 100, in ii_worker
    raise AssertionError(f"answer[{i}] not in approximation")
AssertionError: answer[2] not in approximation

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/home/baehyunsol/Documents/ragit/tests/tests.py", line 672, in <module>
    test()
  File "/home/baehyunsol/Documents/ragit/tests/ii.py", line 36, in ii
    ii_worker()
  File "/home/baehyunsol/Documents/ragit/tests/ii.py", line 116, in ii_worker
    raise AssertionError(f"tfidf result on term '{term}' is not close enough. error: `{e}`, answer: {answer}, approximation: {approximation}")
AssertionError: tfidf result on term 'let bitxor' is not close enough. error: `answer[2] not in approximation`, answer: ['549a87567bd9ec4c3145df3f3db3c7f285e9b1551269c1720000000100000200', 'faae85ed9ecfe34ecc388f777a6adabbde32f4628b65cd820000000100000200', '445d453adb1948c8f9cce13eb8f0974869152c2f1e9e4b95000000010000016e', 'ddd47326a625b60b374ab71c97a87a8e3de207a73288da380000000100000204', '78973af7ecb7137f86b7e126889bdb947505308e35fad8b50000000100000200', '5b252d0f47ac8d0bee53b3c3c0387897869e9834d9e43edf000000010000020f', 'f28f8995ed5aad84ba253b191d1257e234e37f8f1ad572240000000100000202', '74a9821e38482554bd3e78b0f1c474a927848122a54223420000000100000212', '63cb62bb8a33a334f049b70be4cdd592d385215ade59690c0000000100000207', '5d6c16dc1991936cb1b2a3d821c43a8bf648d298a24992a70000000100000200'], approximation: ['549a87567bd9ec4c3145df3f3db3c7f285e9b1551269c1720000000100000200', 'faae85ed9ecfe34ecc388f777a6adabbde32f4628b65cd820000000100000200', '78973af7ecb7137f86b7e126889bdb947505308e35fad8b50000000100000200', '5b252d0f47ac8d0bee53b3c3c0387897869e9834d9e43edf000000010000020f', '74a9821e38482554bd3e78b0f1c474a927848122a54223420000000100000212', '63cb62bb8a33a334f049b70be4cdd592d385215ade59690c0000000100000207', 'd2178deab15f668fd917a5bc4fc39360e3c3d3d87a353a920000000100000200', 'b82a8fb6160d01b1c8a289e0af1b714cf92d89aaced6cbdb0000000100000202', 'b7f52a1abcdae26519257214f1837e3885856589649424170000000100000203', 'fa10425e1445e6bd8b59f7c047762787e114d2a8d4d1ef5b0000000100000200']

history

cat_file
 

elapsed time: 14,142 ms

history

generous_file_reader
 

elapsed time: 297,970 ms

history

clean_up_erroneous_chunk
 

elapsed time: 1,421 ms

history

images
 

elapsed time: 4,389 ms

history

markdown_reader
 

elapsed time: 5,193 ms

history

csv_reader
 

elapsed time: 2,809 ms

history

real_repos
 

elapsed time: 2,320,500 ms

Error

Command '['cargo', 'run', '--release', '--', 'init']' returned non-zero exit status 101.
Traceback (most recent call last):
  File "/home/baehyunsol/Documents/ragit/tests/tests.py", line 672, in <module>
    test()
  File "/home/baehyunsol/Documents/ragit/tests/real_repos.py", line 64, in real_repos
    cargo_run(["init"])
  File "/home/baehyunsol/Documents/ragit/tests/utils.py", line 74, in cargo_run
    result = subprocess.run(args, **kwargs)
  File "/usr/lib/python3.10/subprocess.py", line 526, in run
    raise CalledProcessError(retcode, process.args,
subprocess.CalledProcessError: Command '['cargo', 'run', '--release', '--', 'init']' returned non-zero exit status 101.

history

real_repos_regression
 

elapsed time: 2,543 ms

Error


Traceback (most recent call last):
  File "/home/baehyunsol/Documents/ragit/tests/tests.py", line 672, in <module>
    test()
  File "/home/baehyunsol/Documents/ragit/tests/real_repos_regression.py", line 157, in real_repos_regression
    assert count_files() == (len(reproductions), 1, len(reproductions) - 1)  # (total, staged, processed)
AssertionError

history

subdir
 

elapsed time: 9,002 ms

history

tfidf
 

elapsed time: 10,134 ms

history

merge
 

elapsed time: 14,380 ms

history

external_bases
 

elapsed time: 70,458 ms

history

end_to_end dummy
 

elapsed time: 42,427 ms

history

end_to_end llama3.3-70b
 

elapsed time: 54,228 ms

history

audit llama3.3-70b
 

elapsed time: 5,978 ms

history

logs llama3.3-70b
 

elapsed time: 3,843 ms

history

prompts dummy
 

elapsed time: 4,576 ms

history

prompts gpt-4o-mini
 

elapsed time: 44,748 ms

history

prompts gemini-2.0-flash
 

elapsed time: 26,573 ms

history

prompts claude-3.5-sonnet
 

elapsed time: 61,614 ms

history

empty dummy
 

elapsed time: 2,797 ms

history

empty llama3.3-70b
 

elapsed time: 3,791 ms

history

server_chat llama3.3-70b
 

elapsed time: 21,599 ms

history

server_chat gemini-2.0-flash
 

elapsed time: 44,996 ms

history

images2 gpt-4o-mini
 

elapsed time: 6,645 ms

history

images3 gpt-4o-mini
 

elapsed time: 8,662 ms

history

pdl gpt-4o-mini
 

elapsed time: 18,365 ms

history

pdf gpt-4o-mini
 

elapsed time: 142,277 ms

Error


Traceback (most recent call last):
  File "/home/baehyunsol/Documents/ragit/tests/tests.py", line 672, in <module>
    test()
  File "/home/baehyunsol/Documents/ragit/tests/tests.py", line 622, in <lambda>
    ("pdf gpt-4o-mini", lambda: pdf(test_model="gpt-4o-mini")),
  File "/home/baehyunsol/Documents/ragit/tests/pdf.py", line 51, in pdf
    assert any([pdf["name"] in r["source"] for r in search_result])
AssertionError

history

svg gpt-4o-mini
 

elapsed time: 7,873 ms

Error


Traceback (most recent call last):
  File "/home/baehyunsol/Documents/ragit/tests/tests.py", line 672, in <module>
    test()
  File "/home/baehyunsol/Documents/ragit/tests/tests.py", line 623, in <lambda>
    ("svg gpt-4o-mini", lambda: svg(test_model="gpt-4o-mini")),
  File "/home/baehyunsol/Documents/ragit/tests/svg.py", line 123, in svg
    assert "ragit" in cargo_run(["pdl", "test1.pdl"], stdout=True).lower()
AssertionError

history

web_images gpt-4o-mini
 

elapsed time: 62,873 ms

history

images2 claude-3.5-sonnet
 

elapsed time: 9,756 ms

history

extract_keywords dummy
 

elapsed time: 1,632 ms

history

extract_keywords gpt-4o-mini
 

elapsed time: 9,060 ms

history

orphan_process llama3.3-70b
 

elapsed time: 99,776 ms

history

write_lock llama3.3-70b
 

elapsed time: 83,057 ms

history

ragit_api command-r
 

elapsed time: 1,049 ms

history

query_options llama3.3-70b
 

elapsed time: 5,559 ms

history

query_with_schema llama3.3-70b
 

elapsed time: 1,624 ms

history

models_init
 

elapsed time: 283 ms

history

test_home_config_override
 

elapsed time: 134 ms

history

config
 

elapsed time: 32,730 ms

history

migrate
 

elapsed time: 143,731 ms

history

migrate2
 

elapsed time: 43,784 ms

history


<< 3aa665352-linux
774e5c41d-linux >>