DOC-2026-05 / SECURITY RESEARCH
AI Safety Security Research CVE-2023-32233

Gemini 2.5 Pro — Safety Alignment & Refusal Evolution on CVE-2023-32233

Active Tracking · May 2026 · Niranj-coder / Destawell
// Researcher Profile

Niranj-coder

Security Developer and Vulnerability Researcher affiliated with Destawell. Currently pursuing a degree in Computer Engineering with a focus on ethical hacking, Open Source Intelligence (OSINT), and technical network auditing.

Research specialization: advanced AI safety alignment parameters, analyzing how modern LLM guardrails process complex Linux kernel exploit primitives, kernel memory corruption risks, and edge-case boundary restrictions under public vulnerability models.

Ethical Hacking OSINT Linux Kernel Security LLM Alignment Network Auditing
Introduction

Evaluating the boundaries of modern frontier Large Language Models (LLMs) requires continuous monitoring of how defensive safety filters evolve against known public exploits. This technical case study maps the behavior shifts of Google's Gemini 2.5 Pro when processing structural requests regarding CVE-2023-32233.

CVE-2023-32233 is a prominent Use-After-Free vulnerability within the Linux kernel netfilter/nf_tables subsystem, capable of enabling local privilege escalation. This research explicitly tracks how model responses changed over a multi-week observation cycle, providing a reproducible behavioral dataset.

Responsible Disclosure Notice

This project contains zero functional exploit code, weaponized vectors, or bypass frameworks. It is strictly a benchmarking dataset logging LLM boundary adjustments, guardrail mechanics, and the practical application of refusal alignment policies over time.

Research Focus Areas
01
Contextual Processing Timeline
Evaluating model output variance between early April test runs and mid-May post-patch cycles to identify measurable behavioral drift.
02
Refusal Vector Mapping
Analyzing structural patterns in how the model triggers safe-harbor responses when interacting with legacy Linux kernel subsystem primitives.
03
Defensive Integrity Tracking
Documenting changes in model alignment behavior following rolling server-side updates, providing a longitudinal view of guardrail evolution.

Full dataset, structured testing templates, log history & analytical conclusions

github.com / Destawell / gemini-2.5-pro-nf-tables-red-teaming

View Full Repository