Search Results

Refine Results
  1. Fine-Tuning Small Language Models for Solution-Oriented Windows Event Log Analysis

    Published in ArXiv cs.CR Recent Papers (2026)
    Get full text
    Online Article RSS Article
  2. Gaming the Metric, Not the Harm: Certifying Safety Audits against Strategic Platform Manipulation

    Published in ArXiv cs.CR Recent Papers (2026)
    Get full text
    Online Article RSS Article
  3. Trade-off Functions for DP-SGD with Subsampling based on Random Shuffling: Tight Upper and Lower Bounds

    Published in ArXiv cs.CR Recent Papers (2026)
    Get full text
    Online Article RSS Article
  4. Profiling for Pennies: Unveiling the Privacy Iceberg of LLM Agents

    Published in ArXiv cs.CR Recent Papers (2026)
    Get full text
    Online Article RSS Article
  5. ClawGuard: Out-of-Band Detection of LLM Agent Workflow Hijacking via EM Side Channel

    Published in ArXiv cs.CR Recent Papers (2026)
    Get full text
    Online Article RSS Article
  6. Stateful Agent Backdoor

    Published in ArXiv cs.CR Recent Papers (2026)
    Get full text
    Online Article RSS Article
  7. Secure Seed-Based Multi-bit Watermarking for Diffusion Models from First Principles

    Published in ArXiv cs.CR Recent Papers (2026)
    Get full text
    Online Article RSS Article
  8. Safety Anchor: Defending Harmful Fine-tuning via Geometric Bottlenecks

    Published in ArXiv cs.CR Recent Papers (2026)
    Get full text
    Online Article RSS Article
  9. PragLocker: Protecting Agent Intellectual Property in Untrusted Deployments via Non-Portable Prompts

    Published in ArXiv cs.CR Recent Papers (2026)
    Get full text
    Online Article RSS Article
  10. Heimdallr: Characterizing and Detecting LLM-Induced Security Risks in GitHub CI Workflows

    Published in ArXiv cs.CR Recent Papers (2026)
    Get full text
    Online Article RSS Article
  11. CAN ARTIFICIAL INTELLIGENCE PREDICT A TSUNAMI?

    Published in Computer Science (2025)
    Get full text
    Online Article RSS Article
  12. Backdoor Mitigation in Object Detection via Adversarial Fine-Tuning

    Published in ArXiv cs.CR Recent Papers (2026)
    Get full text
    Online Article RSS Article
  13. ActiveFlowMark: Assessing Tor Anonymity under Active Bandwidth Watermarking

    Published in ArXiv cs.CR Recent Papers (2026)
    Get full text
    Online Article RSS Article
  14. SkillScope: Toward Fine-Grained Least-Privilege Enforcement for Agent Skills

    Published in ArXiv cs.CR Recent Papers (2026)
    Get full text
    Online Article RSS Article
  15. LoopTrap: Termination Poisoning Attacks on LLM Agents

    Published in ArXiv cs.CR Recent Papers (2026)
    Get full text
    Online Article RSS Article
  16. LeakDojo: Decoding the Leakage Threats of RAG Systems

    Published in ArXiv cs.CR Recent Papers (2026)
    Get full text
    Online Article RSS Article
  17. EMO: Pretraining Mixture of Experts for Emergent Modularity

    Published in ArXiv cs.CL Recent Papers (2026)
    Get full text
    Online Article RSS Article
  18. Beyond Negative Rollouts: Positive-Only Policy Optimization with Implicit Negative Gradients

    Published in ArXiv cs.CL Recent Papers (2026)
    Get full text
    Online Article RSS Article
  19. StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction

    Published in ArXiv cs.CL Recent Papers (2026)
    Get full text
    Online Article RSS Article
  20. Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

    Published in ArXiv cs.CL Recent Papers (2026)
    Get full text
    Online Article RSS Article