Skip to content

Actions: b4rtaz/distributed-llama

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
285 workflow runs
285 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

added option --run to launch.py.
main #302: Commit d10699f pushed by b4rtaz
October 14, 2024 18:57 6m 1s main
October 14, 2024 18:57 6m 1s
update README.md.
main #301: Commit 4587f55 pushed by b4rtaz
October 13, 2024 21:48 5m 55s main
October 13, 2024 21:48 5m 55s
socket-test.
main #300: Commit cc39725 pushed by b4rtaz
October 13, 2024 21:09 6m 27s main
October 13, 2024 21:09 6m 27s
socket-test.
main #299: Commit 6b4b3ca pushed by b4rtaz
October 13, 2024 21:03 6m 10s main
October 13, 2024 21:03 6m 10s
socket-test.
main #298: Commit 5bfe0d8 pushed by b4rtaz
October 13, 2024 20:49 5m 57s main
October 13, 2024 20:49 5m 57s
socket-test.
main #297: Commit ef3b53f pushed by b4rtaz
October 13, 2024 20:49 6m 0s main
October 13, 2024 20:49 6m 0s
socket-test.
main #296: Commit 3ea4556 pushed by b4rtaz
October 13, 2024 20:48 6m 3s main
October 13, 2024 20:48 6m 3s
llama 3.2 3b instruct q40 model.
main #295: Commit 140f8a2 pushed by b4rtaz
October 13, 2024 14:21 6m 35s main
October 13, 2024 14:21 6m 35s
llama 3.2 1b instruct q40 model.
main #294: Commit bf2de45 pushed by b4rtaz
October 13, 2024 13:48 5m 47s main
October 13, 2024 13:48 5m 47s
feat: reduction of writeMany/readMany calls. (#118)
main #293: Commit 3353d56 pushed by b4rtaz
August 10, 2024 22:24 5m 34s main
August 10, 2024 22:24 5m 34s
feat: reduction of writeMany/readMany calls.
main #292: Pull request #118 opened by b4rtaz
August 10, 2024 22:18 5m 37s feat/improve-socket-io
August 10, 2024 22:18 5m 37s
update readme.md. (#117)
main #291: Commit 668ea98 pushed by b4rtaz
August 9, 2024 12:36 7m 4s main
August 9, 2024 12:36 7m 4s
update readme.md.
main #290: Pull request #117 opened by b4rtaz
August 9, 2024 12:35 10m 20s feat/llama-3.1-405b-launch
August 9, 2024 12:35 10m 20s
feat: add llama3_1_405b_instruct_q40 to launch.py. (#112)
main #288: Commit 5244daa pushed by b4rtaz
July 31, 2024 14:22 8m 25s main
July 31, 2024 14:22 8m 25s
feat: dev mode.
main #286: Commit ee2c689 pushed by b4rtaz
July 31, 2024 08:57 11m 5s main
July 31, 2024 08:57 11m 5s
feat: improved performance of quantization to q40. (#111)
main #285: Commit f18ac63 pushed by b4rtaz
July 30, 2024 19:28 6m 11s main
July 30, 2024 19:28 6m 11s
feat: improved performance of quantization to q40.
main #284: Pull request #111 opened by b4rtaz
July 30, 2024 19:28 5m 33s feat/q40-speed-up
July 30, 2024 19:28 5m 33s
feat: --max-seq-len argument. (#109)
main #283: Commit 71135e6 pushed by b4rtaz
July 29, 2024 12:22 6m 6s main
July 29, 2024 12:22 6m 6s
feat: --max-seq-len argument.
main #282: Pull request #109 opened by b4rtaz
July 29, 2024 12:14 6m 47s feat/max-seq-len
July 29, 2024 12:14 6m 47s
convert-hf.py, skipping hidden files.
main #281: Commit 57e3807 pushed by b4rtaz
July 28, 2024 19:51 6m 49s main
July 28, 2024 19:51 6m 49s
update readme.md.
main #280: Commit 2339746 pushed by b4rtaz
July 28, 2024 14:25 6m 16s main
July 28, 2024 14:25 6m 16s
feat: fallback implementation for matmulQ40vQ80.
main #279: Commit 755cdf2 pushed by b4rtaz
July 28, 2024 14:12 6m 13s main
July 28, 2024 14:12 6m 13s
fix: set bufferSize.
main #278: Commit 9a729c9 pushed by b4rtaz
July 27, 2024 09:14 5m 34s main
July 27, 2024 09:14 5m 34s
update readme.md.
main #277: Commit dc0e94f pushed by b4rtaz
July 25, 2024 19:49 9m 37s main
July 25, 2024 19:49 9m 37s