HyperAI

Machine Learning Glossary: Explore definitions and explanations of key AI and ML concepts

Huxley–Gödel Machine

The model approximates the Gödel Machine in a coding agent environment and guides the expansion through Thompson sampling with adaptive scheduling.

4 months ago

DiDi-Instruct Post-Training Method

The first framework to successfully apply distribution matching distillation to MDM-based text generation, setting a record in few-step language sequence generation.

4 months ago

MultiPL-MoE Architecture

MultiPL-MoE is an effective method for extending low-source programming languages in the post-pre-training stage.

4 months ago

Gated Attention

The Tongyi Qianwen team systematically studied the role of gating mechanisms in standard softmax attention.

4 months ago

Lancelot Framework

The Lancelot framework incorporates fully homomorphic encryption into BRFL to achieve robust privacy protection.

4 months ago

FOA-Attack, a Targeted migration-based Adversarial Attack Framework

By jointly aligning global and local features, adversarial examples can be effectively guided toward the target feature distribution and transferability can be enhanced.

4 months ago

Receptive Field

The receptive field is an important concept for understanding visual information processing and provides a reference for designing, analyzing and optimizing visual models.

4 months ago

Potential Diffusion Model SVG

SVG enables faster diffusion training, efficient few-step sampling, and improved generation quality.

4 months ago

RewardMap, a multi-stage Reinforcement Learning Framework

RewardMap enhances the capabilities of multimodal large language models in structured vision tasks.

4 months ago

Discriminative Constraint Optimization Framework (DisCO)

A novel principle-based discriminative constraint optimization framework avoids difficulty bias and training instability.

4 months ago

ReinFlow, an Online Reinforcement Learning Framework

ReinFlow features a lightweight implementation, built-in exploration capabilities, and broad applicability to various streaming strategy variants.

4 months ago

Fully Homomorphic Encryption (FHE)

FHE is widely used in scenarios such as cloud computing security, federated learning, medical data analysis, and financial data collaboration.

4 months ago

Byzantine Robust Federal Learning (BRFL)

BRFL is designed to address the Byzantine attack problem that occurs during model aggregation.

4 months ago

Exponential-Gaussian Mixture Network EGMN

EGMN successfully captured the potential interaction effects between user preferences and video features.

4 months ago

SAC Flow

SAC Flow achieves state-of-the-art performance in continuous control and robot operation benchmarks.

4 months ago

UserBench Benchmark

UserBench aims to assess and enhance an agent’s ability to understand, interact with, and adapt to real-world user communication.

5 months ago

PLACER Neural Network

PLACER is fast and stochastic, and can easily generate prediction sets to map conformational heterogeneity.

5 months ago

Representation Autoencoders

With its significant advantages, RAE is poised to become the new default choice for training diffusion Transformers.

5 months ago

Group Variance Strategy Optimization GVPO

Given the limitations of existing fine-tuning techniques such as GRPO, GVPO has emerged as a reliable and versatile post-training paradigm.

5 months ago

ReCA Integrated Acceleration Framework

ReCA has generalization capabilities in terms of application scenarios and system scale, and the success rate of tasks has been improved by 4.3%.

5 months ago

DexFlyWheel Data Generation Framework

DexFlyWheel is a scalable and self-improving data generation paradigm for agile operations.

5 months ago

NovaFlow, an Autonomous Operating Framework

NovaFlow is able to handle rigid, articulated, and deformable objects in different robot forms.

5 months ago

TreeSynth Is a Synthetic Data Method Based on tree-guided subspaces.

TreeSynth demonstrates exceptional robustness and scalability in large-scale data synthesis.

5 months ago

Guess – Think – Answer

GTA significantly outperforms standard SFT baselines and state-of-the-art RL methods in multiple text classification benchmarks.

5 months ago

Command Palette

Wiki

Command Palette

Wiki

Huxley–Gödel Machine

DiDi-Instruct Post-Training Method

MultiPL-MoE Architecture

Gated Attention

Lancelot Framework

FOA-Attack, a Targeted migration-based Adversarial Attack Framework

Receptive Field

Potential Diffusion Model SVG

RewardMap, a multi-stage Reinforcement Learning Framework

Discriminative Constraint Optimization Framework (DisCO)

ReinFlow, an Online Reinforcement Learning Framework

Fully Homomorphic Encryption (FHE)

Byzantine Robust Federal Learning (BRFL)

Exponential-Gaussian Mixture Network EGMN

SAC Flow

UserBench Benchmark

PLACER Neural Network

Representation Autoencoders

Group Variance Strategy Optimization GVPO

ReCA Integrated Acceleration Framework

DexFlyWheel Data Generation Framework

NovaFlow, an Autonomous Operating Framework

TreeSynth Is a Synthetic Data Method Based on tree-guided subspaces.

Guess – Think – Answer

Command Palette

Wiki

Huxley–Gödel Machine

DiDi-Instruct Post-Training Method

MultiPL-MoE Architecture

Gated Attention

Lancelot Framework

FOA-Attack, a Targeted migration-based Adversarial Attack Framework

Receptive Field

Potential Diffusion Model SVG

RewardMap, a multi-stage Reinforcement Learning Framework

Discriminative Constraint Optimization Framework (DisCO)

ReinFlow, an Online Reinforcement Learning Framework

Fully Homomorphic Encryption (FHE)

Byzantine Robust Federal Learning (BRFL)

Exponential-Gaussian Mixture Network EGMN

SAC Flow

UserBench Benchmark

PLACER Neural Network

Representation Autoencoders

Group Variance Strategy Optimization GVPO

ReCA Integrated Acceleration Framework

DexFlyWheel Data Generation Framework

NovaFlow, an Autonomous Operating Framework

TreeSynth Is a Synthetic Data Method Based on tree-guided subspaces.

Guess – Think – Answer

Huxley–Gödel Machine

DiDi-Instruct Post-Training Method

MultiPL-MoE Architecture

Gated Attention

Lancelot Framework

FOA-Attack, a Targeted migration-based Adversarial Attack Framework

Receptive Field

Potential Diffusion Model SVG

RewardMap, a multi-stage Reinforcement Learning Framework

Discriminative Constraint Optimization Framework (DisCO)

ReinFlow, an Online Reinforcement Learning Framework

Fully Homomorphic Encryption (FHE)

Byzantine Robust Federal Learning (BRFL)

Exponential-Gaussian Mixture Network EGMN

SAC Flow

UserBench Benchmark

PLACER Neural Network

Representation Autoencoders

Group Variance Strategy Optimization GVPO

ReCA Integrated Acceleration Framework

DexFlyWheel Data Generation Framework

NovaFlow, an Autonomous Operating Framework

TreeSynth Is a Synthetic Data Method Based on tree-guided subspaces.

Guess – Think – Answer

Huxley–Gödel Machine

DiDi-Instruct Post-Training Method