One-click model liberation toolkit for removing refusal behaviors from LLMs via surgical abliteration techniques
OBLITERATUS — LLM Abliteration Toolkit Skill by ara.so — Daily 2026 Skills collection. OBLITERATUS is an open-source toolkit for identifying and surgically removing refusal behaviors from large language models using mechanistic interpretability techniques (abliteration). It locates refusal directions in a model's hidden states via SVD/PCA, projects them out of the weights, and preserves core language capabilities. Ships with a Gradio UI, CLI, Python API, and Colab notebook. Installation # Core install pip install obliteratus # With Gradio UI support pip install "obliteratus[spaces]" # With all optional analysis modules pip install "obliteratus[full]"
don't have the plugin yet? install it then click "run inline in claude" again.