Command Palette
Search for a command to run...
Wiki
Machine Learning Glossary: Explore definitions and explanations of key AI and ML concepts
Skills are reusable capability modules that encapsulate knowledge and processes, enabling AI to transform from general-purpose models into specialized intelligent agents.
SoCE is a model optimization paradigm based on an automatic category-aware expert selection mechanism and combined with multiple benchmark tasks.
DePass is used to interpret the Transformer model by decomposing the forward pass.
A file format for storing medical imaging data
iSeal achieves a 100% fingerprint success rate (FSR) against more than 10 attacks on 12 LLMs.
It effectively solves the key challenges in LVLM secure alignment.
VLM can achieve cross-modal understanding, reasoning, and generation tasks by aligning and fusing image and text information.
VLA can generate robot movements directly based on visual images and verbal commands.
The NSG statistic quantifies the ratio of spatial probability gradient to temporal density change.
Mem-I has achieved significant improvements over existing memory-enhanced agent baselines in multiple benchmark tests.
SSP demonstrates the potential of self-game theory as a scalable and data-efficient training paradigm for agent LLM.
CudaForge is a simple, effective, and low-cost multi-agent workflow for CUDA kernel generation and optimization.
FractalForensics exhibits good robustness and vulnerability to common image processing operations and Deepfake operations.
ScaleNet is a novel approach that extends pre-trained Visual Transformer (ViT) through weight sharing.
FlashMoBA makes the theoretically optimal block size practical, achieving up to 14.7x speedup on GPUs.
CoT Hijacking is a novel jailbreak attack method in which benign reasoning systematically weakens the rejection behavior.
InstanceAssemble enables high-quality and controllable image generation under multimodal conditions.
Layout-to-Image provides a flexible control mechanism for image generation.
HiPO is used for adaptive LLM inference, mainly including hybrid data construction and hybrid reinforcement learning.
As a novel semantic-aware framework, it is used to reconstruct 3D models from sparse views.
AEPO focuses on balancing and rationalizing strategy extension branches and strategy updates under the guidance of high-entropy tool calls.
SDAR establishes a new practical language modeling paradigm that unifies the complementary advantages of autoregression and diffusion.
C2C enables direct semantic communication by transforming and fusing key-value (KV) caches between models.
CapRL can effectively train models to generate more general and accurate image descriptions.
Skills are reusable capability modules that encapsulate knowledge and processes, enabling AI to transform from general-purpose models into specialized intelligent agents.
SoCE is a model optimization paradigm based on an automatic category-aware expert selection mechanism and combined with multiple benchmark tasks.
DePass is used to interpret the Transformer model by decomposing the forward pass.
A file format for storing medical imaging data
iSeal achieves a 100% fingerprint success rate (FSR) against more than 10 attacks on 12 LLMs.
It effectively solves the key challenges in LVLM secure alignment.
VLM can achieve cross-modal understanding, reasoning, and generation tasks by aligning and fusing image and text information.
VLA can generate robot movements directly based on visual images and verbal commands.
The NSG statistic quantifies the ratio of spatial probability gradient to temporal density change.
Mem-I has achieved significant improvements over existing memory-enhanced agent baselines in multiple benchmark tests.
SSP demonstrates the potential of self-game theory as a scalable and data-efficient training paradigm for agent LLM.
CudaForge is a simple, effective, and low-cost multi-agent workflow for CUDA kernel generation and optimization.
FractalForensics exhibits good robustness and vulnerability to common image processing operations and Deepfake operations.
ScaleNet is a novel approach that extends pre-trained Visual Transformer (ViT) through weight sharing.
FlashMoBA makes the theoretically optimal block size practical, achieving up to 14.7x speedup on GPUs.
CoT Hijacking is a novel jailbreak attack method in which benign reasoning systematically weakens the rejection behavior.
InstanceAssemble enables high-quality and controllable image generation under multimodal conditions.
Layout-to-Image provides a flexible control mechanism for image generation.
HiPO is used for adaptive LLM inference, mainly including hybrid data construction and hybrid reinforcement learning.
As a novel semantic-aware framework, it is used to reconstruct 3D models from sparse views.
AEPO focuses on balancing and rationalizing strategy extension branches and strategy updates under the guidance of high-entropy tool calls.
SDAR establishes a new practical language modeling paradigm that unifies the complementary advantages of autoregression and diffusion.
C2C enables direct semantic communication by transforming and fusing key-value (KV) caches between models.
CapRL can effectively train models to generate more general and accurate image descriptions.