Skip to content

Commit 4e64901

Browse files
author
Anivar A Aravind
committed
Remove claims from code comments - keep purely factual
1 parent b288c06 commit 4e64901

File tree

2 files changed

+3
-3
lines changed

2 files changed

+3
-3
lines changed

language/deepseek-r1/PREPROCESSING.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -19,7 +19,7 @@ Let's think about this step by step.<|im_end|>
1919

2020
## Answer Extraction
2121
```python
22-
# Remove reasoning, extract final answer
22+
# Extract final answer
2323
output = full_output.split('<|/thinking|>')[-1] if '<|/thinking|>' in full_output else full_output
2424
answer = re.search(r'####\s*(\d+)', output)
2525
final_answer = answer.group(1) if answer else output.strip()

language/llama3.1-8b/PREPROCESSING.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22

33
## Tokenization
44
```python
5-
tokenizer = AutoTokenizer.from_pretrained(model_path) # Use model's tokenizer
5+
tokenizer = AutoTokenizer.from_pretrained(model_path)
66
max_length = 2048
77
```
88

@@ -20,6 +20,6 @@ Summarize this article:
2020

2121
## Verification
2222
```bash
23-
# Check first token is 128000 (begin_of_text)
23+
# Verify first token
2424
python -c "print(tokenizer.encode('<|begin_of_text|>')[0])"
2525
```

0 commit comments

Comments
 (0)