Mockup for reviewTech-stack demonstration. Not affiliated with Nebius and not the live Builders Network.About this build →
← Library
REPO
ADVANCED

TF Cookbook · Post-Training

Post-training techniques for open-source models on Nebius — RLHF, DPO, instruction tuning. End-to-end working examples.
tokenfactory
aicloud
Read on the original site ↗

About this entry

Post-training techniques for open-source models on Nebius — RLHF, DPO, instruction tuning. End-to-end working examples.