Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
edg5000
46 days ago
|
parent
|
context
|
favorite
| on:
Show HN: Duplicate 3 layers in a 24B LLM, logical ...
Which types of tasks, in your experience, show negligable improvement when using larger models? And for what types of tasks do you feel even the best models deliver mediocre results?
Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: