AI Concepts

Build your understanding of artificial intelligence through fundamental concepts and mental models

Browse by Category

Machine Learning
Deep Learning
Natural Language Processing
Computer Vision
Reinforcement Learning
Optimization
Statistics & Probabilistic
Data Engineering
MLOps
AI Safety
Responsible AI
Symbolic AI & Knowledge Systems
Real World AI
Robotics
Multimodal
Autonomous Systems
Memory Models
Cognitive Architectures
Advanced Mathematical
Theoretical Methods
Large Language Models
Generative AI
Scientific ML
Applied Domains

All AI Concepts

27 articles · page 1 of 1

Machine Learning

What is Underfitting?

Underfitting is when a model is too simple to capture the real patterns in the data it learns from. In practice, an underfitting system fails to pick up on meaningful relationships during training, so it performs poorly on both familiar and new data because it never truly learned the task.

Machine Learning

What is Bias Variance Tradeoff?

Bias variance tradeoff is the tension between a model underfitting by being too simple and overfitting by being too complex. Finding the sweet spot means tuning a system so it captures genuine patterns in data without memorizing noise, leading to predictions that generalize well to new situations.

Machine Learning

What is Model Generalization?

Model generalization is a system's ability to perform well on new, unseen data rather than just the data it was trained on. When a model truly generalizes, it has learned underlying patterns instead of memorizing specific examples, allowing it to make accurate predictions in real-world situations it has never directly encountered before.

Machine Learning

What is Leave One Out Validation?

Leave one out validation is a method for testing how well a system learns by holding back one data point at a time for evaluation. Each round trains the system on all remaining data and checks its prediction on the single held-out point, revealing how reliably it generalizes to unseen inputs.

Machine Learning

What is K Fold Validation?

K fold validation is a method for testing how well a system learns by splitting data into equal parts and rotating which part is used for testing. During each round the system trains on most of the data and checks its accuracy on the held-out portion, revealing whether it truly generalizes or just memorizes.

Machine Learning

What is Holdout Validation?

Holdout validation is a method of testing how well a trained system performs by reserving a separate portion of data it has never seen before. During training, the system learns patterns from one chunk of data, then its predictions are checked against the held-back portion to reveal whether it truly generalizes or just memorized its lessons.

Machine Learning

What is Cross Validation?

Cross validation is a technique for testing how well a model performs by checking it against data it was not trained on. During the process, the training data is split into several portions, and the system repeatedly trains on some portions while testing on the held-out portion to detect overfitting.

10
Machine Learning

What is Blending?

Blending is the process of combining two or more distinct concepts, representations, or data sources into a unified output. In practice, an intelligent system merges features from separate inputs, such as text and images, into a single internal representation that captures richer meaning than any one source could provide alone.

Machine Learning

What is Stacking?

Stacking is an ensemble method that combines predictions from multiple different models by training a higher-level model on their outputs. A second learner receives the base models' predictions as its input features and discovers how to best blend them, producing a final answer that is typically more accurate than any single model alone.

Machine Learning

What is Boosting?

Boosting is a method that combines many weak learners into one strong predictor by training each new learner to focus on the mistakes of the previous ones. Each successive model pays extra attention to the cases that were handled poorly before, gradually reducing errors and producing a more accurate overall system.

Machine Learning

What is Bootstrap Aggregation?

Bootstrap aggregation is an ensemble learning technique that trains many versions of the same learner on randomly resampled subsets of the data and combines their outputs. Each model sees a slightly different slice of the training set, so averaging or voting across them makes the overall system more stable and less prone to overfitting.

Machine Learning

What is Machine Learning?

Machine Learning is a way for computers to learn patterns from data instead of following manually written rules. It works by feeding large amounts of data into algorithms that adjust their internal parameters over time, allowing an intelligent system to improve its decisions and responses with experience.

Machine Learning

What is Ensemble Learning?

Ensemble learning is a technique where multiple models are combined to produce a better overall prediction than any single model alone. By aggregating the outputs of diverse learners through voting or averaging, a

Machine Learning

What is Batch Learning?

Batch learning is a training approach where a system learns from an entire collected dataset all at once rather than updating from individual examples. The system processes all available data in one full pass, adjusts its internal parameters based on the complete picture, and then applies what it learned to new situations.

Machine Learning

What is Online Learning?

Online learning is a method where a system updates its knowledge continuously as new data arrives rather than training on a fixed dataset all at once. Each incoming piece of data immediately adjusts the system's internal parameters, allowing it to adapt its decisions and predictions in real time without reprocessing everything from scratch.

Machine Learning

What is Active Learning?

Active learning is a method where the system selects the most informative data points to learn from rather than passively receiving random examples. Instead of labeling everything, it asks a human to label only the cases it finds most uncertain or confusing, so it improves faster with far less labeled data.

Reinforcement Learning

What is Reinforcement Learning?

Reinforcement learning is a type of machine learning where an agent learns to make decisions by receiving rewards or penalties for its actions. Through repeated trial and error, the system gradually discovers which sequences of choices lead to the highest cumulative reward, refining its behavior over time.

Machine Learning

What is Self Supervised Learning?

Self supervised learning is a method where a system teaches itself by finding patterns in raw data without needing humans to label that data first. In practice, the system hides part of its input and then tries to predict the missing piece, gradually building a rich internal understanding it can apply to real tasks.

Machine Learning

What is Semi Supervised Learning?

Semi Supervised Learning is a training approach where a system learns from a small amount of labeled data combined with a large amount of unlabeled data. It works by letting the system discover patterns in the unlabeled data and then refining its predictions using the few labeled examples to guide accurate decision making.

Machine Learning

What is Unsupervised Learning?

Unsupervised Learning is a way for a system to study raw data without being told what the correct answers are. It works by detecting hidden patterns, groupings, or structures in data on its own, letting the system organize information and respond to new inputs based on relationships it discovered independently.

Machine Learning

What is Supervised Learning?

Supervised Learning is a method where an AI system learns from labeled data that pairs inputs with correct answers. In practice, the system studies many such pairs, finds patterns linking inputs to outputs, and uses those patterns to predict correct answers for new, unseen inputs, gradually improving its accuracy.

Cognitive Architectures

What Are Cognitive Architectures?

Cognitive Architectures are structured blueprints that define how an intelligent system organizes its memory, reasoning, and decision-making processes into a unified framework. In practice, they give a system fixed rules for how to store new information, retrieve relevant knowledge, and choose actions step by step so behavior stays consistent and goal-directed.

AI Concepts

What is Artificial Intelligence?

Artificial intelligence refers to computer systems that can learn, reason, and make decisions rather than simply following prewritten instructions. Instead of being told exactly what to do at every step, AI systems improve by analyzing data, recognizing patterns, and adjusting their behavior over time. Most AI today is designed for narrow purposes. It powers tools like search engines, recommendation systems, voice assistants, fraud detection, and image recognition. These systems excel at specific tasks but lack a general understanding. A more ambitious idea, often called artificial general intelligence, would involve machines that can reason across domains the way humans do. That level of AI remains theoretical and has not yet been achieved.

Dave Yasvinski
Computer Vision

What is Computer Vision?

Computer vision is a field of artificial intelligence (AI) that enables computers to interpret and make decisions based on visual data. It seeks to replicate the human ability to see and understand the world through the use of digital images, videos and other visual inputs. By using a range of techniques, such as image processing, machine learning and deep learning, computer vision systems are able to identify objects, track movement, analyze scenes and even recognize facial expressions. The ability to bridge the gap between physical and digital realms has made computer vision a cornerstone of modern AI systems. 

Dave Yasvinski
Large Language Models

What is a Large Language Model?

A large language model is a type of artificial intelligence that employs deep learning to comprehend and produce human language. These models are called “large” because of the vast amount of information they must be trained on to learn how to perform a wide range of language-related tasks, such as responding to queries, translating languages and creating other forms of content. LLMs are sometimes referred to as neural networks (NN) — computer systems that emulate the human brain — because they rely on a network of neuron-like nodes. They are constantly being trained and upgraded to improve their problem-solving abilities and make them as fluid and effective as possible.

Dave Yasvinski
Natural Language Processing

What is Natural Language Processing?

Natural language processing (NLP) is a branch of artificial intelligence (AI)  that enables computers to interpret and respond to human language in  contextually meaningful ways. The process, which combines machine  learning and computational linguistics gives computers and other digital  devices the ability to rapidly process and understand large volumes of  information and provide meaningful responses to queries. NLP teaches computers how to understand and communicate using our language. It improves the way humans and machines interact by enabling digital devices to function much like our brains, with processing programs and microphones taking the place of eyes and ears. It is capable of automating and enhancing many aspects of our increasingly complicated world.

Dave Yasvinski
Deep Learning

What are Neural Networks?

Neural networks are computer systems inspired by the human brain. They are designed to recognize patterns and process data through interconnected layers of nodes or "neurons." Each layer processes input, adjusting weights (importance) and thresholds (activation levels). Over multiple layers, these networks are trained to understand complex relationships and improve their performance in tasks such as image and speech recognition.

Dave Yasvinski