-
🌄 I like training interesting transformers.
-
🌠 The maximum model size I have trained from scratch is 3B, using 128 A100 GPUs. Looking forward to the opportunity to use more GPUs to train larger models in the future!
-
📫 How to reach me: [email protected]
| Tool | All-time | 30d | 7d | Top models | Last seen |
|---|---|---|---|---|---|
| Codex | 11.99B |
5.76B |
1.57B |
gpt-5.5 gpt-5.4 gpt-5.3-codex |
2026-05-29 |
| Claude Code | 404.1M |
0 |
0 |
claude-opus-4-5 claude-opus-4-6 gemini-3-pro-high |
2026-04-23 |




