Skip to content

Pull requests: pytorch/torchtitan

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[WIP] Adding OBELICS DataLoader CLA Signed This label is managed by the Meta Open Source bot.
#663 opened Oct 30, 2024 by TJ-Solergibert Loading…
[not for land] torch.compile individual linears CLA Signed This label is managed by the Meta Open Source bot.
#661 opened Oct 29, 2024 by vkuzo Loading…
empty_cache before barrier CLA Signed This label is managed by the Meta Open Source bot.
#660 opened Oct 29, 2024 by carmocca Loading…
Fix PP clip_grad_norm CLA Signed This label is managed by the Meta Open Source bot.
#649 opened Oct 24, 2024 by zijian-hu Loading…
Use enable_gqa in place of repeat_kv CLA Signed This label is managed by the Meta Open Source bot.
#641 opened Oct 22, 2024 by awgu Draft
single-gpu generation for integration testing CLA Signed This label is managed by the Meta Open Source bot.
#640 opened Oct 22, 2024 by jaysonfrancis Draft
Add script to convert pickled Llama weights to DCP CLA Signed This label is managed by the Meta Open Source bot.
#634 opened Oct 19, 2024 by rlrs Loading…
Init weights only if not loading a checkpoint CLA Signed This label is managed by the Meta Open Source bot.
#628 opened Oct 18, 2024 by carmocca Draft
[DO NOT REVIEW] gaps to enable FDSP2 cpu offloading CLA Signed This label is managed by the Meta Open Source bot.
#622 opened Oct 16, 2024 by weifengpy Loading…
[Not for land] Settings to make Llama3-8B on 8 GPUs faster CLA Signed This label is managed by the Meta Open Source bot.
#615 opened Oct 14, 2024 by awgu Draft
[not for land] TE experiments, take 2 CLA Signed This label is managed by the Meta Open Source bot.
#614 opened Oct 14, 2024 by vkuzo Loading…
[DO NOT REVIEW] --experimental.fsdp_sharding_on_largest_dim CLA Signed This label is managed by the Meta Open Source bot.
#607 opened Oct 9, 2024 by weifengpy Loading…
ensure reproducible determinsitc numerics CLA Signed This label is managed by the Meta Open Source bot.
#597 opened Oct 2, 2024 by weifengpy Loading…
fix mixed precision for replicate / pure DDP CLA Signed This label is managed by the Meta Open Source bot.
#591 opened Sep 29, 2024 by 152334H Loading…
[not for land yet] hack max and abs out of ops eligible for AC CLA Signed This label is managed by the Meta Open Source bot.
#580 opened Sep 17, 2024 by vkuzo Loading…
add pp validation for schedule CLA Signed This label is managed by the Meta Open Source bot.
#568 opened Sep 5, 2024 by H-Huang Loading…
3d with fp8 in test runner CLA Signed This label is managed by the Meta Open Source bot.
#564 opened Aug 29, 2024 by H-Huang Draft
[WIP] zero bubble CLA Signed This label is managed by the Meta Open Source bot.
#546 opened Aug 20, 2024 by H-Huang Draft
[DO NOT REVIEW] Runtime estimation with FakeTensor + TorchDispatchMode CLA Signed This label is managed by the Meta Open Source bot.
#536 opened Aug 20, 2024 by weifengpy Loading…
[Not for land] Added changes for GPT-2 perf CLA Signed This label is managed by the Meta Open Source bot.
#533 opened Aug 19, 2024 by awgu Draft
[Not for land] Added GPT-2-like config CLA Signed This label is managed by the Meta Open Source bot.
#532 opened Aug 19, 2024 by awgu Draft
[Not for land] GaLore example CLA Signed This label is managed by the Meta Open Source bot.
#488 opened Jul 29, 2024 by awgu Draft
[torchtitan][debug] integrated CommDebugMode into TorchTitan CLA Signed This label is managed by the Meta Open Source bot.
#480 opened Jul 24, 2024 by sinhaanshul Loading…
[not for land] TE experiments CLA Signed This label is managed by the Meta Open Source bot.
#477 opened Jul 23, 2024 by vkuzo Loading…
[Do not review] Activation offloading CLA Signed This label is managed by the Meta Open Source bot.
#467 opened Jul 18, 2024 by awgu Draft
ProTip! Add no:assignee to see everything that’s not assigned.