Загрузка...
@torch.inference_mode() def generate(self, idx, max_new_tokens, temperature=1.0, top_k=None): for _ in range(max_new_tokens): idx_cond = idx[:, -self.max_seq_len:] logits = self(idx_cond)[:, -1, :] / temperature if top_k is not None: v, _ = torch.topk(logits, min(top_k, logits.size(-1))) logits[logits < v[:, [-1]]] = -float('Inf')
The model takes integer token IDs and passes them through two embedding layers: build a large language model from scratch github
git clone https://github.com/yourusername/llm-from-scratch.git cd llm-from-scratch pip install -r requirements.txt @torch
Building from scratch on consumer hardware requires efficiency techniques: @torch.inference_mode() def generate(self