

As a point of reference, I have a 5070 ti oc (300W tdp, suggested PSU 700W according to techpowerup) with a ryzen 7 7700 (65W tdp) and I use a Silverstone SFX 700 W 80+ platinum and it works great. I’ve monitored the GPU wattage and it generally doesn’t go above 200ish in practical usage.
Open webUI connected to ollama can do this. In openwebui, if you edit any one of your responses, it forks the conversation. You can flip between each branch using the arrows below any of your responses. If you click the 3 dot menu and click overview, it opens a graph view that shows the branches of the conversation visually.