Command Palette
Search for a command to run...
Wiki
Machine Learning Glossary: Explore definitions and explanations of key AI and ML concepts
Intersection over Union (IOU) is a performance metric used to evaluate the accuracy of annotation, segmentation, and object detection algorithms. It quantifies the overlap between the predicted bounding box or segmented area in the dataset and the ground truth bounding box or annotated area.
Instance segmentation is a computer vision technique that identifies and segments individual objects in an image; unlike semantic segmentation, which groups pixels based on semantic meaning (e.g., road, sky, person), instance segmentation distinguishes between multiple instances of the same object class.
In computer vision, a grayscale image represents a scene or object using a range of grayscale shades rather than a full spectrum. Grayscale images are usually created by converting a full-color image into a single-channel image, where the intensity of each pixel is represented by a single value between 0 (black) and 255 (white).
In machine learning, features are the input variables or attributes used to train a model. These features are used to represent the characteristics or attributes of the data being analyzed and are used by the model to make predictions or classifications.
Frames per second (fps) is a measure of how many still images or frames are displayed in one second of a video or animation.
HITL is an iterative feedback process by which a person (or team) interacts with an algorithmically generated system (e.g., computer vision, machine learning, or artificial intelligence).
In machine learning, hyperparameters are given in advance to control the parameters of the learning process, while the values of other parameters (such as node weights) are obtained through training.
In terms of computer vision, diffusion models can be applied to a variety of tasks including image denoising, inpainting, super-resolution, and image generation.
In the field of deep learning, Ground Truth (commonly used in English, meaning "ground truth" or "benchmark truth" in Chinese, simply understood as the true value) refers to the accurate labels or data used to train and evaluate models.
Image Annotation is the process of tagging or annotating images with metadata, or additional information about the image content.
Human Pose Estimation (HPE) is a task in computer vision that involves detecting and estimating the positions of various body parts in images or videos of people.
An epoch in machine learning means the process of passing the entire training dataset through the neural network once (i.e., performing one forward propagation and one back propagation). For example, if the dataset consists of 1,000 samples and the model is trained using a batch size of 100, it will take 1 […]
False Positive Rate is a measure of the accuracy of a machine learning model in predicting positive outcomes. It is the proportion of instances where the model predicted a positive outcome but the actual outcome was negative.
A class boundary is the dividing line between two adjacent classes or categories in a dataset.
Concept drift refers to the phenomenon that the statistical properties of a data stream change over time, causing the learning model to not match the current data distribution.
Proximal Policy Optimization (PPO) is an algorithm in the field of reinforcement learning that is used to train the decision-making functions of computer agents to complete difficult tasks.
The Confusion Matrix is a performance evaluation tool used in machine learning that summarizes the performance of a classification model by listing the true positive, true negative, false positive, and false negative predictions.
Calibration curves are a useful tool in machine learning and predictive modeling to understand and fine-tune the reliability of a classification model's predicted probabilities.
Edge detection is a fundamental problem in image processing and computer vision. The purpose of edge detection is to identify points in digital images where brightness changes significantly.
In image processing and computer vision, the Laplacian operator has been used for various tasks such as blob detection and edge detection.
Differentiable Programming is a programming paradigm in which digital computer programs can be made fully differentiable via automatic differentiation.
Aspect-level sentiment analysis is a task to detect the sentiment of a specific aspect in a text.
Hallucination refers to the phenomenon that model-generated content is inconsistent with real-world facts or user input.
Foundation Agent is a general agent model that can be generalized in both the virtual world and the real world.
Intersection over Union (IOU) is a performance metric used to evaluate the accuracy of annotation, segmentation, and object detection algorithms. It quantifies the overlap between the predicted bounding box or segmented area in the dataset and the ground truth bounding box or annotated area.
Instance segmentation is a computer vision technique that identifies and segments individual objects in an image; unlike semantic segmentation, which groups pixels based on semantic meaning (e.g., road, sky, person), instance segmentation distinguishes between multiple instances of the same object class.
In computer vision, a grayscale image represents a scene or object using a range of grayscale shades rather than a full spectrum. Grayscale images are usually created by converting a full-color image into a single-channel image, where the intensity of each pixel is represented by a single value between 0 (black) and 255 (white).
In machine learning, features are the input variables or attributes used to train a model. These features are used to represent the characteristics or attributes of the data being analyzed and are used by the model to make predictions or classifications.
Frames per second (fps) is a measure of how many still images or frames are displayed in one second of a video or animation.
HITL is an iterative feedback process by which a person (or team) interacts with an algorithmically generated system (e.g., computer vision, machine learning, or artificial intelligence).
In machine learning, hyperparameters are given in advance to control the parameters of the learning process, while the values of other parameters (such as node weights) are obtained through training.
In terms of computer vision, diffusion models can be applied to a variety of tasks including image denoising, inpainting, super-resolution, and image generation.
In the field of deep learning, Ground Truth (commonly used in English, meaning "ground truth" or "benchmark truth" in Chinese, simply understood as the true value) refers to the accurate labels or data used to train and evaluate models.
Image Annotation is the process of tagging or annotating images with metadata, or additional information about the image content.
Human Pose Estimation (HPE) is a task in computer vision that involves detecting and estimating the positions of various body parts in images or videos of people.
An epoch in machine learning means the process of passing the entire training dataset through the neural network once (i.e., performing one forward propagation and one back propagation). For example, if the dataset consists of 1,000 samples and the model is trained using a batch size of 100, it will take 1 […]
False Positive Rate is a measure of the accuracy of a machine learning model in predicting positive outcomes. It is the proportion of instances where the model predicted a positive outcome but the actual outcome was negative.
A class boundary is the dividing line between two adjacent classes or categories in a dataset.
Concept drift refers to the phenomenon that the statistical properties of a data stream change over time, causing the learning model to not match the current data distribution.
Proximal Policy Optimization (PPO) is an algorithm in the field of reinforcement learning that is used to train the decision-making functions of computer agents to complete difficult tasks.
The Confusion Matrix is a performance evaluation tool used in machine learning that summarizes the performance of a classification model by listing the true positive, true negative, false positive, and false negative predictions.
Calibration curves are a useful tool in machine learning and predictive modeling to understand and fine-tune the reliability of a classification model's predicted probabilities.
Edge detection is a fundamental problem in image processing and computer vision. The purpose of edge detection is to identify points in digital images where brightness changes significantly.
In image processing and computer vision, the Laplacian operator has been used for various tasks such as blob detection and edge detection.
Differentiable Programming is a programming paradigm in which digital computer programs can be made fully differentiable via automatic differentiation.
Aspect-level sentiment analysis is a task to detect the sentiment of a specific aspect in a text.
Hallucination refers to the phenomenon that model-generated content is inconsistent with real-world facts or user input.
Foundation Agent is a general agent model that can be generalized in both the virtual world and the real world.