The author demonstrates that local models, specifically Qwen 3.6 27B, can perform end-to-end document redaction when optimized with a higher quantization level and an agentic harness using the PI framework.
- The system uses Qwen 3.6 27B quantized to Q6_K_XL with a 114k token context window, requiring 40GB VRAM.
- A Gradio-based UI allows users to upload documents and provide custom redaction instructions to the agent.
- The backend employs OCR tools like Tesseract and PaddleOCR alongside spaCy for PII identification.
- VLM capabilities are utilized for detecting faces and signatures within scanned documents.
- Testing on emails, partnership agreements, and government policies showed acceptable initial redaction results.
This approach offers a significant time-saving alternative for users needing to perform contextual-aware redaction tasks locally without relying on proprietary cloud APIs.