AI Concepts

Build your understanding of artificial intelligence through fundamental concepts and mental models

Browse by Category

Machine Learning
Deep Learning
Natural Language Processing
Computer Vision
Reinforcement Learning
Optimization
Statistics & Probabilistic
Data Engineering
MLOps
AI Safety
Responsible AI
Symbolic AI & Knowledge Systems
Real World AI
Robotics
Multimodal
Autonomous Systems
Memory Models
Cognitive Architectures
Advanced Mathematical
Theoretical Methods
Large Language Models
Generative AI
Scientific ML
Applied Domains

All AI Concepts

70 articles · page 1 of 2

Machine Learning

What Are Evaluation Metrics?

Evaluation metrics are standardized measurements used to judge how well an AI system performs a task, such as how accurate, fair, or fast it is. By scoring outputs against known targets, these metrics guide developers in tuning the system, steering it toward behavior that better matches the desired goals.

Machine Learning

What are Similarity Metrics?

Similarity metrics are simple ways of measuring how alike two pieces of information are, giving a score that shows closeness or distance. By comparing new inputs to stored examples through these scores, an intelligent system can group related items, retrieve relevant matches, and decide what response best fits the situation.

Machine Learning

What Are Loss Functions?

Loss functions are math formulas that measure how wrong a model's predictions are compared to the correct answers. During training, the system adjusts its internal settings to shrink this error score, gradually nudging its behavior toward outputs that better match the desired results.

Machine Learning

What is Hinge Loss?

Hinge loss is a way of measuring how badly a classifier got an answer wrong, penalizing predictions that are correct but not confident enough as well as those that are flat-out mistaken. By pushing the system to keep a clear margin between categories, it nudges the model toward decisions that stay firmly on the right side.

Machine Learning

What is Quantile Regression?

Quantile regression is a statistical method that estimates different points of the range of possible outcomes, such as the lower, middle, or upper values, instead of only the average. By learning these spread-aware predictions, an intelligent system can gauge uncertainty and adjust decisions cautiously when outcomes vary widely.

Machine Learning

What is Ordinal Regression?

Ordinal regression is a method for predicting labels that have a clear order but no fixed numeric distance between them, like ratings from low to high. By learning thresholds along a single scale, the system places each input into a ranked category, guiding decisions that respect the order of outcomes.

Machine Learning

What is Linear Regression?

Linear regression is a simple method that draws a straight line through data points to predict a number from one or more input values. By adjusting the line's slope and position to minimize prediction errors, an intelligent system learns steady relationships in data and uses them to estimate future outcomes.

Machine Learning

What is Logistic Regression?

Logistic regression is a statistical method that estimates the probability of an outcome belonging to one category or another based on input features. By weighing each input and squashing the result into a value between zero and one, it lets an intelligent system decide between choices and act on the more likely option.

Machine Learning

What is Naive Bayes?

Naive Bayes is a simple probability-based method that guesses the most likely category for something by assuming each clue contributes independently. To decide an action, an intelligent system tallies how strongly each feature points to each possible label, then picks the label with the highest combined score.

Machine Learning

What Are Support Vector Machines?

Support vector machines are a type of learning method that sorts data into groups by drawing the clearest possible dividing line between them. To guide an intelligent system, the method picks a boundary that leaves the widest gap between categories, helping it classify new inputs with steadier, more confident decisions.

Machine Learning

What Are Random Forests?

Random forests are a machine learning method that combines many decision trees and averages their answers to make a more reliable prediction. By having each tree vote on the outcome, the system reduces mistakes from any single tree and guides an intelligent agent toward steadier, more trustworthy choices.

Machine Learning

What Are Decision Trees?

Decision trees are a method that makes choices by asking a series of yes-or-no questions, splitting data into branches until it reaches an answer. To guide a system's behavior, each branch narrows down possibilities step by step, letting the machine sort inputs and pick an outcome through clear, rule-based paths.

Machine Learning

What is Survival Analysis?

Survival analysis is a set of statistical methods that study how long it takes for an event to happen and what influences that timing. By estimating the probability that something persists or fails over time, it guides an intelligent system to prioritize actions, flag risks early, and adjust decisions as conditions change.

Machine Learning

What Are Hidden Markov Models?

Hidden Markov models are a way of guessing what is happening behind the scenes when you can only observe clues, by assuming the hidden situation changes step by step in a chainlike fashion. In practice, an intelligent system tracks probabilities over possible hidden states and updates them as new signals arrive, letting it pick the most likely explanation and act on it.

Machine Learning

What Are Kalman Filters?

Kalman filters are math tools that estimate the true state of a moving or changing system by blending noisy sensor readings with predictions from a model. In practice, an intelligent system continually predicts what comes next, compares it to fresh measurements, and adjusts its belief, producing smoother, more reliable decisions over time.

Machine Learning

What is SARIMA?

Seasonal autoregressive integrated moving average is a forecasting method that learns from past values, past errors, and repeating seasonal patterns to predict future numbers. By blending recent trends with recurring cycles, it lets an intelligent system anticipate upcoming demand and adjust its decisions before changes occur.

Machine Learning

What is ARIMA?

Autoregressive integrated moving average is a statistical method that forecasts future values in a time series by learning from past values, trends, and short-term shocks. In practice, an intelligent system uses it to spot patterns over time and adjust its decisions, so its behavior stays aligned with shifting data.

Machine Learning

What Are Autoregressive Models?

Autoregressive models are systems that predict the next piece of data in a sequence by looking at what came before. Step by step, the system generates one element at a time, feeds that output back as input, and continues this loop, letting earlier choices shape every later decision it makes.

Machine Learning

What is Time Series Forecasting?

Time series forecasting is the practice of using past values recorded over time to estimate what comes next. By learning patterns like trends, seasonal cycles, and recent shifts in the data, an intelligent system can anticipate upcoming values and adjust its decisions, alerts, or actions before those changes actually occur.

Machine Learning

What is Isolation Forest?

Isolation forest is a method that spots unusual data points by randomly splitting the data and seeing which points get separated quickly. By building many random trees and measuring how few cuts it takes to isolate a point, the system flags rare items as anomalies and guides smarter decisions.

Machine Learning

What is Anomaly Detection?

Anomaly detection is the process of spotting data points, events, or patterns that differ noticeably from what is considered normal. By learning a baseline of typical behavior, the system flags inputs that fall outside expected ranges, prompting an intelligent system to investigate, alert, or adjust its actions accordingly.

Machine Learning

What Are Gaussian Mixture Models?

Gaussian mixture models are a way of describing data as a blend of several bell-shaped groups, each with its own center and spread. By estimating which groups likely produced each data point, the system can softly sort information into categories, helping it recognize patterns and make smoother, more flexible decisions.

Machine Learning

What is DBSCAN?

Density-based spatial clustering of applications with noise is a method that groups data points packed closely together while marking isolated points as outliers. By scanning each point's neighborhood and linking dense regions, it lets an intelligent system discover natural groupings without needing the number of clusters in advance.

Machine Learning

What is Hierarchical Clustering?

Hierarchical clustering is a way of grouping data by building a tree of nested clusters, where similar items are joined together step by step. By repeatedly merging the closest points or splitting larger groups, an intelligent system organizes information into layers, letting it recognize both fine details and broader patterns.

Machine Learning

What is K Means Clustering?

K means clustering is a method that sorts data points into a chosen number of groups based on how similar they are to each other. By repeatedly assigning points to the nearest group center and updating those centers, an intelligent system can spot patterns and organize information without being told the labels in advance.

Machine Learning

What is Clustering?

Clustering is a way of automatically grouping items together based on how similar they are, without anyone labeling the groups in advance. In practice, an intelligent system measures distances between data points and pulls close ones into the same group, letting it organize messy information and respond differently to each discovered group.

Machine Learning

What is UMAP?

Uniform manifold approximation and projection is a method that squeezes complex, high-dimensional data down to a few dimensions while keeping similar points close together. To shape behavior, it builds a graph of neighbors and then arranges those points on a smaller map, helping the system spot clusters and patterns quickly.

Machine Learning

What is t-SNE?

t-SNE is a technique that takes complicated, high-dimensional data and squeezes it down into a simple two or three dimensional map so patterns become easier to see. By keeping nearby points close and pushing unrelated points apart, it helps an intelligent system group similar items and reveal hidden structure in messy data.

Machine Learning

What is Linear Discriminant Analysis?

Linear discriminant analysis is a method that finds the best straight-line boundaries to separate groups of data by looking at what makes each group distinct. By projecting features onto directions that pull classes apart and squeeze each class together, it helps an intelligent system sort new inputs into the right category.

Machine Learning

What is Independent Component Analysis?

Independent component analysis is a method for pulling apart a mixed signal into separate underlying sources that were blended together. By assuming the hidden sources are statistically independent, the system untangles overlapping inputs so an intelligent agent can focus on meaningful patterns instead of noisy mixtures.

Machine Learning

What is Principal Component Analysis?

Principal component analysis is a method that takes complicated data with many features and finds a smaller set of directions that capture most of its variation. By rotating the data onto these key directions, an intelligent system can focus on the strongest patterns, ignore noise, and make faster, clearer decisions.

Machine Learning

What is Dimensionality Reduction?

Dimensionality reduction is a way of squeezing data with many features down into fewer ones while keeping the important patterns intact. In practice, an intelligent system uses it to strip away noisy or redundant details, so it can learn faster, store less, and make decisions based on the signal that actually matters.

Machine Learning

What is Feature Selection?

Feature selection is the process of picking out the most useful pieces of input data and ignoring the rest so a learning system can focus on what truly matters. By trimming away noisy or redundant signals, this technique helps an intelligent system make faster, clearer decisions and avoid being misled by irrelevant details.

Machine Learning

What is Feature Engineering?

Feature engineering is the process of transforming raw data into meaningful inputs, called features, that help a machine learning model learn patterns more effectively. By reshaping, combining, or highlighting parts of the data, it guides an intelligent system toward signals that matter, sharpening the decisions it makes.

Machine Learning

What is Bayesian Optimization?

Bayesian optimization is a smart search method for tuning choices when each test is slow or costly, balancing what looks best so far with what still feels uncertain. By building a running guess of how settings affect results, it steers an intelligent system toward better decisions while spending as few trials as possible.

Machine Learning

What is Random Search?

Random search is a simple method for finding good solutions by trying many options chosen at random and keeping the best one. In practice, an intelligent system samples possible answers without any guiding pattern, checks how well each performs, and gradually settles on the strongest candidate it has seen.

Machine Learning

What is Grid Search?

Grid search is a method for tuning a model by trying every combination of chosen settings from a predefined list and picking the one that performs best. In practice, it lays out a structured table of options, tests each pairing on the data, and uses the winning configuration to guide how the system makes future decisions.

Machine Learning

What is Hyperparameter Optimization?

Hyperparameter optimization is the process of searching for the best settings that control how a learning algorithm trains, such as learning rate or tree depth. In practice, the system tries many configurations, measures how well each one performs, and gradually steers toward choices that make the model more accurate and reliable.

Machine Learning

What is Elastic Net?

Elastic net is a regression technique that blends two penalties to keep a model both simple and stable when many input features are involved. By balancing these penalties during training, it nudges an intelligent system to shrink unhelpful weights and group related features, producing steadier predictions on messy data.

Machine Learning

What is L2 Regularization?

L2 regularization is a technique that discourages a model from relying too heavily on any single feature by adding a penalty based on the squared size of its weights. During training, this penalty nudges the system toward smaller, more balanced weights, which helps it generalize better and avoid overfitting noisy patterns.

Machine Learning

What is L1 Regularization?

L1 regularization is a technique that discourages a model from relying on too many input features by penalizing the total size of its learned weights. During training, this penalty pushes less useful weights all the way to zero, leaving the system with a simpler, sparser set of signals to base its decisions on.

Machine Learning

What is Regularization?

Regularization is a technique that discourages a learning model from becoming too complex or fitting training data too closely. By adding a small penalty for overly large or intricate parameters during training, it nudges the system toward simpler patterns that generalize better to new, unseen situations.

Machine Learning

What is Overfitting?

Overfitting is when a model learns the training data too closely, including its noise and quirks, instead of the general patterns. In practice, the system memorizes specific details rather than broader rules, so it performs well on familiar data but fails badly when faced with new inputs.

Machine Learning

What is Underfitting?

Underfitting is when a model is too simple to capture the real patterns in the data it learns from. In practice, an underfitting system fails to pick up on meaningful relationships during training, so it performs poorly on both familiar and new data because it never truly learned the task.

Machine Learning

What is Bias Variance Tradeoff?

Bias variance tradeoff is the tension between a model underfitting by being too simple and overfitting by being too complex. Finding the sweet spot means tuning a system so it captures genuine patterns in data without memorizing noise, leading to predictions that generalize well to new situations.

Machine Learning

What is Model Generalization?

Model generalization is a system's ability to perform well on new, unseen data rather than just the data it was trained on. When a model truly generalizes, it has learned underlying patterns instead of memorizing specific examples, allowing it to make accurate predictions in real-world situations it has never directly encountered before.

Machine Learning

What is Leave One Out Validation?

Leave one out validation is a method for testing how well a system learns by holding back one data point at a time for evaluation. Each round trains the system on all remaining data and checks its prediction on the single held-out point, revealing how reliably it generalizes to unseen inputs.

Machine Learning

What is K Fold Validation?

K fold validation is a method for testing how well a system learns by splitting data into equal parts and rotating which part is used for testing. During each round the system trains on most of the data and checks its accuracy on the held-out portion, revealing whether it truly generalizes or just memorizes.

Machine Learning

What is Holdout Validation?

Holdout validation is a method of testing how well a trained system performs by reserving a separate portion of data it has never seen before. During training, the system learns patterns from one chunk of data, then its predictions are checked against the held-back portion to reveal whether it truly generalizes or just memorized its lessons.

Machine Learning

What is Cross Validation?

Cross validation is a technique for testing how well a model performs by checking it against data it was not trained on. During the process, the training data is split into several portions, and the system repeatedly trains on some portions while testing on the held-out portion to detect overfitting.

10

AI concepts — AI-readable summary

What is the AI Concepts library?

The AI Concepts library is a categorized collection of educational articles about how AI systems actually work — model architectures, training, evaluation, risks, and policy — written for non-research practitioners.

Are CVAI AI Concept articles AI-generated?

AI Concept articles are AI-assisted: drafts are generated and then human-reviewed and edited before publication. Each article is tagged with the model used and the editorial review date.