Glossary - AI Terminology

Bagging

An ensemble learning technique that combines multiple weak models by training them on different random subsets of data and averaging their predictions.

Types of Bagging

Bootstrap Aggregation - Uses bootstrapped samples to reduce variance.
Random Subspace Method - Uses different feature subsets for training.

Example

Used in Random Forests to improve decision tree performance.

Bandit Algorithms

Algorithms used for decision-making under uncertainty, balancing exploration and exploitation.

Types of Bandit Algorithms

Multi-Armed Bandit - Selects the best action from multiple options.
Contextual Bandit - Uses contextual information for decision-making.

Example

Used in online advertising and recommendation systems.

Batch Normalization

A technique used to stabilize and accelerate training in deep neural networks by normalizing activations.

Types of Batch Normalization

Full Batch Normalization - Normalizes across the entire dataset.
Mini-Batch Normalization - Normalizes within each batch during training.

Example

Used in convolutional neural networks (CNNs) to improve training stability.

Bayesian Optimization

A probabilistic approach to optimizing expensive-to-evaluate functions using Gaussian processes.

Types of Bayesian Optimization

Expected Improvement (EI) - Selects points that maximize expected improvement.
Upper Confidence Bound (UCB) - Balances exploration and exploitation.

Example

Used in hyperparameter tuning for machine learning models.

Bellman Equation

A recursive formula used in reinforcement learning to compute the optimal value function.

Types of Bellman Equations

Deterministic Bellman Equation - Assumes known state transitions.
Stochastic Bellman Equation - Handles uncertainty in transitions.

Example

Used in Q-learning and policy iteration methods.

Bias-Variance Tradeoff

A fundamental concept in machine learning that describes the tradeoff between bias (error due to oversimplification) and variance (error due to sensitivity to small fluctuations in training data).

Types of Bias-Variance Tradeoff

High Bias - Underfitting due to overly simplistic models.
High Variance - Overfitting due to excessive model complexity.

Example

Seen in polynomial regression where low-degree models underfit and high-degree models overfit.

Binary Cross-Entropy

A loss function used for binary classification problems that measures the dissimilarity between predicted and actual class labels.

Types of Binary Cross-Entropy

Log Loss - Computes log-based penalty for incorrect predictions.
Sigmoid Cross-Entropy - Uses a sigmoid activation function.

Example

Used in logistic regression and neural networks.

Boosting

An ensemble learning technique that improves weak models by sequentially adjusting their weights.

Types of Boosting

AdaBoost - Focuses on misclassified instances.
Gradient Boosting - Uses gradient descent to optimize model weights.

Example

Used in XGBoost and LightGBM.

Backpropagation

An optimization algorithm used in training neural networks to adjust weights by computing gradients of the loss function.

Types of Backpropagation

Stochastic Backpropagation - Uses one sample per iteration.
Batch Backpropagation - Computes gradients using the full dataset.

Example

Used in deep learning frameworks like TensorFlow and PyTorch.

Bag-of-Words (BoW)

A text representation technique that converts text into a collection of word frequencies, ignoring grammar and word order.

Types of Bag-of-Words

Unigram BoW - Counts individual words.
N-gram BoW - Considers sequences of words.

Example

Used in spam detection and sentiment analysis.

Bayesian Classifier

A probabilistic classification model based on Bayes' Theorem.

Types of Bayesian Classifiers

Naïve Bayes - Assumes feature independence.
Bayesian Networks - Captures dependencies between features.

Example

Used in spam filtering and text classification.

Beam Search

A search algorithm that expands the most promising nodes to find the optimal path.

Types of Beam Search

Fixed Beam Search - Uses a constant number of candidate paths.
Adaptive Beam Search - Adjusts beam width dynamically.

Example

Used in natural language processing for text generation.

Beta Distribution

A probability distribution used in Bayesian statistics to model uncertainty.

Types of Beta Distribution

Symmetric Beta - Equal shape parameters.
Skewed Beta - Unequal shape parameters.

Example

Used in A/B testing for conversion rate analysis.

Bidirectional LSTM

A variant of LSTMs that processes input sequences in both forward and backward directions.

Types of Bidirectional LSTM

Standard BiLSTM - Uses separate forward and backward layers.
Attention-Based BiLSTM - Enhances focus on key information.

Example

Used in speech recognition and machine translation.

Binary Classification

A classification problem where there are only two possible output labels.

Types of Binary Classification

Hard Binary Classification - Definite labels (e.g., spam vs. not spam).
Soft Binary Classification - Probability-based predictions.

Example

Used in fraud detection and sentiment analysis.

Binomial Distribution

A discrete probability distribution that models the number of successes in a fixed number of independent trials.

Types of Binomial Distribution

Bernoulli Distribution - Special case with one trial.
General Binomial - Multiple trials with constant probability.

Example

Used in predictive modeling for success/failure outcomes.

Bitwise Operations

Operations that manipulate binary representations of numbers at the bit level.

Types of Bitwise Operations

AND - Returns 1 if both bits are 1.
OR - Returns 1 if at least one bit is 1.

Example

Used in cryptographic hashing and compression algorithms.

Black Box Model

A machine learning model whose internal workings are not interpretable.

Types of Black Box Models

Neural Networks - Complex structures with hidden layers.
Random Forests - Multiple decision trees that obscure interpretability.

Example

Used in deep learning applications like facial recognition.

Boltzmann Machine

A stochastic neural network that learns probability distributions over data.

Types of Boltzmann Machines

Restricted Boltzmann Machine (RBM) - Has only two layers.
Deep Boltzmann Machine - Multiple layers for deep learning.

Example

Used in collaborative filtering for recommendation systems.

Bootstrap Sampling

A resampling technique that generates multiple datasets by sampling with replacement.

Types of Bootstrap Sampling

Parametric Bootstrap - Assumes a known distribution.
Non-Parametric Bootstrap - Makes no distributional assumptions.

Example

Used in estimating confidence intervals in statistics.

Bottleneck Layer

A neural network layer with fewer neurons than the previous and next layers, used to reduce dimensionality.

Types of Bottleneck Layers

Linear Bottleneck - Uses linear transformations for dimensionality reduction.
Non-Linear Bottleneck - Uses activation functions for non-linearity.

Example

Used in autoencoders for feature compression.

Boundary Detection

A technique used in computer vision to identify object edges in images.

Types of Boundary Detection

Sobel Filter - Uses gradients to detect edges.
Canny Edge Detector - Uses multi-stage processing for accuracy.

Example

Used in self-driving cars for road segmentation.

Box Plot

A statistical visualization used to display the distribution, central tendency, and variability of data.

Types of Box Plots

Standard Box Plot - Displays median, quartiles, and outliers.
Notched Box Plot - Includes confidence intervals for comparison.

Example

Used in data preprocessing to detect outliers.

Brute Force Search

A problem-solving technique that checks all possible solutions to find the best one.

Types of Brute Force Search

Exhaustive Search - Tests all possible combinations.
Heuristic Search - Uses rules to reduce search space.

Example

Used in password cracking algorithms.

Bucketization

A data preprocessing technique that divides continuous values into discrete buckets.

Types of Bucketization

Fixed-Width Buckets - Buckets have equal size.
Quantile-Based Buckets - Buckets contain equal proportions of data.

Example

Used in credit scoring models.

Bayesian Neural Networks

Neural networks that incorporate Bayesian inference for uncertainty estimation.

Types of Bayesian Neural Networks

Variational Bayesian Networks - Uses variational inference.
Monte Carlo Dropout - Uses dropout layers for uncertainty estimation.

Example

Used in medical diagnostics where confidence in predictions is critical.

Biased Dataset

A dataset that does not accurately represent the real-world population, leading to skewed model predictions.

Types of Data Bias

Selection Bias - Certain groups are over/underrepresented.
Measurement Bias - Data collection method skews results.

Example

Seen in facial recognition models trained on limited demographics.

BIC (Bayesian Information Criterion)

A model selection criterion that penalizes complex models to prevent overfitting.

Types of Information Criteria

AIC (Akaike Information Criterion) - Focuses on goodness of fit.
BIC - Stronger penalty for model complexity.

Example

Used in choosing the best regression model.

Big Data

A term referring to massive datasets that require advanced processing techniques.

Types of Big Data

Structured Data - Organized and stored in databases.
Unstructured Data - Text, images, and videos without predefined format.

Example

Used in social media analytics.

Bias Correction

A technique used to adjust models trained on biased data to improve fairness and accuracy.

Types of Bias Correction

Reweighting - Adjusts sample importance.
Adversarial Debiasing - Uses adversarial training to mitigate bias.

Example

Used in hiring models to ensure fairness.

Bias-Variance Tradeoff

A fundamental tradeoff in machine learning between bias (error due to overly simple models) and variance (error due to overly complex models).

Types of Bias-Variance Tradeoff

High Bias - Underfitting, leads to poor accuracy.
High Variance - Overfitting, fails on new data.

Example

Observed in polynomial regression models.

Binning

A data preprocessing technique that groups continuous values into discrete bins to simplify analysis.

Types of Binning

Equal-Width Binning - Bins have equal size.
Equal-Frequency Binning - Bins contain equal numbers of observations.

Example

Used in discretizing age groups in datasets.

Bio-Inspired Computing

Machine learning techniques inspired by biological systems such as neural networks and genetic algorithms.

Types of Bio-Inspired Computing

Artificial Neural Networks - Modeled after the human brain.
Genetic Algorithms - Inspired by evolution and natural selection.

Example

Used in robotics for adaptive learning.

Bit Error Rate (BER)

A metric that measures the number of erroneous bits received in a data transmission.

Types of BER

Raw BER - Before error correction.
Net BER - After error correction.

Example

Used in evaluating communication systems in IoT devices.

Bloom Filter

A space-efficient probabilistic data structure used for membership queries.

Types of Bloom Filters

Standard Bloom Filter - Uses multiple hash functions.
Counting Bloom Filter - Supports element deletion.

Example

Used in web browsers for safe browsing.

Bootstrap Aggregating (Bagging)

An ensemble learning technique that improves accuracy by training multiple models on different subsets of the data.

Types of Bagging

Standard Bagging - Uses random subsets of data.
Pasting - Uses different subsets without replacement.

Example

Used in random forests.

Brownian Motion

A random process used in time-series forecasting and financial modeling.

Types of Brownian Motion

Standard Brownian Motion - Random movement with no drift.
Geometric Brownian Motion - Includes drift for financial applications.

Example

Used in stock price prediction models.

Bayesian Optimization

An optimization technique that uses probabilistic models to find the best solution with fewer evaluations.

Types of Bayesian Optimization

Gaussian Process-Based - Uses Gaussian priors for optimization.
Tree-Structured Parzen Estimator (TPE) - Uses non-parametric models.

Example

Used in hyperparameter tuning for deep learning models.

Backpropagation

A supervised learning algorithm used to train neural networks by minimizing error through gradient descent.

Types of Backpropagation

Standard Backpropagation - Uses full dataset.
Stochastic Backpropagation - Updates weights after each sample.

Example

Used in deep learning for image recognition.

Bayesian Regression

A regression technique that incorporates probability distributions to model uncertainty in predictions.

Types of Bayesian Regression

Bayesian Linear Regression - Assumes normally distributed errors.
Bayesian Ridge Regression - Includes regularization to reduce overfitting.

Example

Used in time-series forecasting.

Machine Learning (ML)

ML is a subset of AI that enables machines to learn patterns from data and make predictions or decisions without explicit programming.

Types of ML

Supervised Learning
Unsupervised Learning
Reinforcement Learning

Example

Spam detection in emails using classification models.

Deep Learning (DL)

DL is a subset of ML that uses artificial neural networks to process complex data and perform high-level computations.

Example

Image recognition in self-driving cars.

Generative AI (Gen AI)

Gen AI refers to AI models that generate new content, including text, images, and code, using trained knowledge bases.

Example

AI models like ChatGPT and Stable Diffusion that generate text and images.