Home Explore Blog Models CI



prompts gpt-4o-mini

total: 86, pass: 82, fail: 4

    prompts [model=dummy]       run `prompts` test
                                It's the smallest set of commands that parses and executes all
                                the `.pdl` files in `prompts/` directory.

TOC

Cases

4c3b9e864-linux
 

tested at: 2025-01-31T13:19:57.917499Z (321 days ago)

elapsed time: 79,468 ms

suite

c256e7191-linux
 

tested at: 2025-02-01T13:33:17.370823Z (320 days ago)

elapsed time: 63,567 ms

suite

96a9da6f8-linux
 

tested at: 2025-02-01T14:43:58.869373Z (320 days ago)

elapsed time: 73,262 ms

suite

9bbffcadd-linux
 

tested at: 2025-02-02T09:05:10.457957Z (319 days ago)

elapsed time: 58,743 ms

suite

ed30e8e33-linux
 

tested at: 2025-02-02T14:10:34.499863Z (319 days ago)

elapsed time: 76,422 ms

suite

ea4dd6340-linux
 

tested at: 2025-02-02T16:15:48.629665Z (319 days ago)

elapsed time: 75,896 ms

suite

11bcd4263-linux
 

tested at: 2025-02-03T17:04:26.540585Z (318 days ago)

elapsed time: 81,185 ms

suite

bbc20ccd9-linux
 

tested at: 2025-02-04T15:10:26.917676Z (317 days ago)

elapsed time: 57,938 ms

Error

Command '['cargo', 'run', '--release', '--', 'query', "You're looking at a source code of a command line utility. What does the main function do?"]' returned non-zero exit status 1.
Traceback (most recent call last):
  File "/home/ubuntu/Documents/ci/ragit/tests/tests.py", line 332, in <module>
    test()
  File "/home/ubuntu/Documents/ci/ragit/tests/tests.py", line 291, in <lambda>
    ("prompts gpt-4o-mini", lambda: prompts(test_model="gpt-4o-mini")),
                                    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/ubuntu/Documents/ci/ragit/tests/prompts.py", line 46, in prompts
    cargo_run(["query", "You're looking at a source code of a command line utility. What does the main function do?"])
  File "/home/ubuntu/Documents/ci/ragit/tests/utils.py", line 61, in cargo_run
    result = subprocess.run(args, **kwargs)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/lib/python3.12/subprocess.py", line 571, in run
    raise CalledProcessError(retcode, process.args,
subprocess.CalledProcessError: Command '['cargo', 'run', '--release', '--', 'query', "You're looking at a source code of a command line utility. What does the main function do?"]' returned non-zero exit status 1.

suite

6b1add466-linux
 

tested at: 2025-02-05T15:29:51.351113Z (316 days ago)

elapsed time: 69,239 ms

suite

c3b2d5dc0-linux
 

tested at: 2025-02-05T16:21:42.336285Z (316 days ago)

elapsed time: 3,148 ms

Error

Command '['cargo', 'run', '--release', '--', 'build']' returned non-zero exit status 101.
Traceback (most recent call last):
  File "/home/ubuntu/Documents/ci/ragit/tests/tests.py", line 332, in <module>
    test()
  File "/home/ubuntu/Documents/ci/ragit/tests/tests.py", line 291, in <lambda>
    ("prompts gpt-4o-mini", lambda: prompts(test_model="gpt-4o-mini")),
                                    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/ubuntu/Documents/ci/ragit/tests/prompts.py", line 34, in prompts
    cargo_run(["build"])
  File "/home/ubuntu/Documents/ci/ragit/tests/utils.py", line 61, in cargo_run
    result = subprocess.run(args, **kwargs)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/lib/python3.12/subprocess.py", line 571, in run
    raise CalledProcessError(retcode, process.args,
subprocess.CalledProcessError: Command '['cargo', 'run', '--release', '--', 'build']' returned non-zero exit status 101.

suite

5d6414fd8-linux
 

tested at: 2025-02-06T15:55:46.906731Z (315 days ago)

elapsed time: 61,978 ms

suite

b014ba9c9-linux
 

tested at: 2025-02-07T16:11:19.427628Z (314 days ago)

elapsed time: 65,271 ms

suite

936dff1b8-linux
 

tested at: 2025-02-08T07:18:57.662601Z (313 days ago)

elapsed time: 55,347 ms

suite

e049b3670-linux
 

tested at: 2025-02-08T09:14:50.625391Z (313 days ago)

elapsed time: 52,338 ms

suite

e1bd987f2-linux
 

tested at: 2025-02-09T13:52:50.577849Z (312 days ago)

elapsed time: 58,673 ms

suite

119cb72f0-mac
 

tested at: 2025-02-13T11:03:41.308826Z (308 days ago)

elapsed time: 66,780 ms

suite

e38845246-mac
 

tested at: 2025-02-14T10:49:06.096222Z (307 days ago)

elapsed time: 85,525 ms

suite

470ab07da-linux
 

tested at: 2025-02-15T18:21:48.654767Z (306 days ago)

elapsed time: 54,542 ms

suite

c6f1d1ec3-linux
 

tested at: 2025-02-15T19:25:27.035811Z (306 days ago)

elapsed time: 51,634 ms

suite

a282bee7d-mac
 

tested at: 2025-02-16T22:19:46.278551Z (304 days ago)

elapsed time: 59,431 ms

suite

3e76f8b4f-linux
 

tested at: 2025-02-17T01:09:23.586875Z (304 days ago)

elapsed time: 57,468 ms

suite

0bfcdc3e5-linux
 

tested at: 2025-02-17T23:02:16.263362Z (303 days ago)

elapsed time: 53,754 ms

suite

5d15f3ade-linux
 

tested at: 2025-02-19T01:05:38.745266Z (302 days ago)

elapsed time: 69,474 ms

suite

d283fd55c-mac
 

tested at: 2025-02-20T10:48:05.861169Z (301 days ago)

elapsed time: 72,061 ms

suite

7912db142-linux
 

tested at: 2025-02-20T21:08:53.813099Z (300 days ago)

elapsed time: 57,956 ms

suite

319b5e4fe-mac
 

tested at: 2025-02-21T11:22:37.525002Z (300 days ago)

elapsed time: 72,350 ms

suite

632949da6-linux
 

tested at: 2025-02-22T00:04:30.493006Z (299 days ago)

elapsed time: 59,991 ms

suite

5b3ae7fd7-linux
 

tested at: 2025-02-22T13:23:38.813172Z (299 days ago)

elapsed time: 37,905 ms

suite

a53323dbc-windows
 

tested at: 2025-02-23T12:06:32.581533Z (298 days ago)

elapsed time: 51,671 ms

suite

499147737-linux
 

tested at: 2025-02-23T19:52:59.344108Z (298 days ago)

elapsed time: 39,719 ms

suite

6ae9fcff5-linux
 

tested at: 2025-02-24T21:11:27.969515Z (296 days ago)

elapsed time: 48,400 ms

suite

976387664-mac
 

tested at: 2025-02-25T10:21:57.114315Z (296 days ago)

elapsed time: 52,921 ms

suite

8eb57e98b-linux
 

tested at: 2025-02-26T00:11:13.124361Z (295 days ago)

elapsed time: 54,737 ms

suite

8071c3063-linux
 

tested at: 2025-02-26T20:31:03.132225Z (295 days ago)

elapsed time: 38,443 ms

suite

b08604d5b-linux
 

tested at: 2025-02-27T22:17:10.555252Z (293 days ago)

elapsed time: 55,128 ms

suite

5387e6dec-mac
 

tested at: 2025-02-28T11:26:54.429209Z (293 days ago)

elapsed time: 52,040 ms

suite

fec3323ea-linux
 

tested at: 2025-03-01T12:48:39.169037Z (292 days ago)

elapsed time: 62,628 ms

suite

c5b003a21-linux
 

tested at: 2025-03-02T01:39:46.519770Z (291 days ago)

elapsed time: 42,658 ms

suite

7811aeb05-linux
 

tested at: 2025-03-03T19:29:08.807193Z (290 days ago)

elapsed time: 44,218 ms

suite

81df26c11-linux
 

tested at: 2025-03-03T21:49:32.573405Z (289 days ago)

elapsed time: 30,861 ms

suite

85ae32984-linux
 

tested at: 2025-03-04T22:05:09.327920Z (288 days ago)

elapsed time: 39,752 ms

suite

56f0686db-linux
 

tested at: 2025-03-06T00:32:55.162195Z (287 days ago)

elapsed time: 39,132 ms

suite

0d837ef41-linux
 

tested at: 2025-03-07T01:10:52.766008Z (286 days ago)

elapsed time: 38,130 ms

suite

98a44e386-linux
 

tested at: 2025-03-08T13:37:45.309833Z (285 days ago)

elapsed time: 44,685 ms

suite

bf63890c0-linux
 

tested at: 2025-03-09T00:38:56.569526Z (284 days ago)

elapsed time: 42,022 ms

suite

bf63890c0-windows
 

tested at: 2025-03-09T14:13:04.841105Z (284 days ago)

elapsed time: 44,417 ms

suite

97b4dd02c-mac
 

tested at: 2025-03-10T11:01:50.525266Z (283 days ago)

elapsed time: 55,936 ms

suite

0550e1646-linux
 

tested at: 2025-03-10T22:06:31.618023Z (282 days ago)

elapsed time: 31,914 ms

suite

1ad288e5e-mac
 

tested at: 2025-03-12T11:49:54.685992Z (281 days ago)

elapsed time: 45,027 ms

suite

ee72580d7-linux
 

tested at: 2025-03-15T01:23:08.009762Z (278 days ago)

elapsed time: 46,586 ms

suite

ec6d09311-linux
 

tested at: 2025-03-21T01:05:42.480054Z (272 days ago)

elapsed time: 43,356 ms

suite

bac550a12-linux
 

tested at: 2025-03-23T22:32:14.898790Z (269 days ago)

elapsed time: 43,550 ms

suite

585c4f8ba-linux
 

tested at: 2025-03-31T22:46:05.278555Z (261 days ago)

elapsed time: 38,656 ms

suite

239d2df2c-linux
 

tested at: 2025-04-01T22:38:25.777417Z (260 days ago)

elapsed time: 44,803 ms

suite

90d1f221f-mac
 

tested at: 2025-04-02T11:15:28.201699Z (260 days ago)

elapsed time: 50,668 ms

suite

ca1d9b482-mac
 

tested at: 2025-04-03T11:32:16.038744Z (259 days ago)

elapsed time: 57,734 ms

suite

51578f8d5-mac
 

tested at: 2025-04-04T13:59:04.442731Z (258 days ago)

elapsed time: 62,748 ms

suite

376ca3ee4-linux
 

tested at: 2025-04-11T21:47:12.055124Z (250 days ago)

elapsed time: 44,192 ms

suite

04ee6286f-mac
 

tested at: 2025-05-09T11:29:30.226975Z (223 days ago)

elapsed time: 54,046 ms

suite

027a29d16-mac
 

tested at: 2025-05-14T14:26:51.020259Z (218 days ago)

elapsed time: 55,682 ms

suite

adcf25624-linux
 

tested at: 2025-05-15T23:00:24.413178Z (216 days ago)

elapsed time: 48,355 ms

suite

7cebb43a6-linux
 

tested at: 2025-05-17T00:48:25.067654Z (215 days ago)

elapsed time: 47,931 ms

suite

ba3b6b026-linux
 

tested at: 2025-05-18T09:48:01.799303Z (214 days ago)

elapsed time: 44,970 ms

suite

5ba0a12a6-mac
 

tested at: 2025-05-19T10:47:10.063082Z (213 days ago)

elapsed time: 49,523 ms

suite

92df3569e-linux
 

tested at: 2025-05-20T00:59:40.946018Z (212 days ago)

elapsed time: 39,197 ms

suite

3aa665352-linux
 

tested at: 2025-05-23T00:18:04.093493Z (209 days ago)

elapsed time: 94,332 ms

Error

Expected 12~15 chunks, got 1.
Traceback (most recent call last):
  File "/home/baehyunsol/Documents/ragit/tests/tests.py", line 670, in <module>
    test()
  File "/home/baehyunsol/Documents/ragit/tests/tests.py", line 612, in <lambda>
    ("prompts gpt-4o-mini", lambda: prompts(test_model="gpt-4o-mini")),
  File "/home/baehyunsol/Documents/ragit/tests/prompts.py", line 40, in prompts
    raise Exception(f"Expected 12~15 chunks, got {chunks}.")
Exception: Expected 12~15 chunks, got 1.

suite

16a6c7ac3-linux
 

tested at: 2025-05-24T20:40:39.036733Z (208 days ago)

elapsed time: 44,748 ms

suite

774e5c41d-linux
 

tested at: 2025-05-29T21:31:38.721609Z (202 days ago)

elapsed time: 59,941 ms

suite

3c6b6ebe1-linux
 

tested at: 2025-05-31T05:38:30.224392Z (201 days ago)

elapsed time: 50,451 ms

suite

f0327eaea-linux
 

tested at: 2025-06-01T18:34:31.254192Z (200 days ago)

elapsed time: 52,379 ms

suite

bb69badeb-linux
 

tested at: 2025-06-02T00:08:14.271670Z (199 days ago)

elapsed time: 47,181 ms

suite

29a3bf1cb-linux
 

tested at: 2025-06-03T01:16:18.813157Z (198 days ago)

elapsed time: 49,831 ms

suite

7f979aa5e-mac
 

tested at: 2025-06-05T12:05:40.934810Z (196 days ago)

elapsed time: 56,663 ms

suite

0526d3e20-linux
 

tested at: 2025-06-05T16:46:27.585326Z (196 days ago)

elapsed time: 55,102 ms

suite

0526d3e20-windows
 

tested at: 2025-06-07T08:13:40.365332Z (194 days ago)

elapsed time: 754 ms

Error

Command '['cargo', 'run', '--release', '--no-default-features', '--', 'config', '--set', 'strict_file_reader', 'true']' returned non-zero exit status 1.
Traceback (most recent call last):
  File "C:\Users\baehy\ragit\tests\tests.py", line 720, in <module>
    test()
    ~~~~^^
  File "C:\Users\baehy\ragit\tests\tests.py", line 658, in <lambda>
    ("prompts gpt-4o-mini", lambda: prompts(test_model="gpt-4o-mini")),
                                    ~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\baehy\ragit\tests\prompts.py", line 31, in prompts
    cargo_run(["config", "--set", "strict_file_reader", "true"])
    ~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\baehy\ragit\tests\utils.py", line 87, in cargo_run
    result = subprocess.run(args, **kwargs)
  File "C:\Program Files\WindowsApps\PythonSoftwareFoundation.Python.3.13_3.13.1008.0_x64__qbz5n2kfra8p0\Lib\subprocess.py", line 577, in run
    raise CalledProcessError(retcode, process.args,
                             output=stdout, stderr=stderr)
subprocess.CalledProcessError: Command '['cargo', 'run', '--release', '--no-default-features', '--', 'config', '--set', 'strict_file_reader', 'true']' returned non-zero exit status 1.

suite

869568e0b-linux
 

tested at: 2025-06-09T20:06:53.014494Z (192 days ago)

elapsed time: 48,251 ms

suite

a5cdea9c6-mac
 

tested at: 2025-06-20T11:50:38.004891Z (181 days ago)

elapsed time: 63,005 ms

suite

3e136fdeb-mac
 

tested at: 2025-06-27T11:25:09.159801Z (174 days ago)

elapsed time: 54,345 ms

suite

a24618552-mac
 

tested at: 2025-07-09T12:03:26.241876Z (162 days ago)

elapsed time: 94,020 ms

suite

a0c02bb2e-linux
 

tested at: 2025-07-21T08:03:09.691900Z (150 days ago)

elapsed time: 114,731 ms

suite

a0c02bb2e-mac
 

tested at: 2025-07-21T11:53:02.138116Z (150 days ago)

elapsed time: 65,773 ms

suite

45d75cf09-linux
 

tested at: 2025-07-25T08:20:45.661982Z (146 days ago)

elapsed time: 117,005 ms

suite

ad17ef1a2-linux
 

tested at: 2025-08-01T14:20:52.926510Z (139 days ago)

elapsed time: 104,917 ms

suite

2f93225e4-mac
 

tested at: 2025-08-26T11:51:14.366171Z (114 days ago)

elapsed time: 115,164 ms

suite

ffc0eca96-mac
 

tested at: 2025-09-15T16:35:10.288703Z (94 days ago)

elapsed time: 95,096 ms

suite

adbbbfc43-linux
 

tested at: 2025-09-23T17:41:46.610984Z (86 days ago)

elapsed time: 132,458 ms

suite