/

/

Monte Carlo methods

⌘ '

raccourcis clavier

Monte-Carlo methods

Étiquette

seed

publié à
12 avr. 2024
modifié à
28 oct. 2024
durée
1 min de lecture (94 words)
source
llms.txt

Monte-Carlo methods

tree search.

a search algorithm based on random sampling of the search space.

Selection: root $R$ $R$ and select successive child nodes until leaf $L$ $L$ is reached.
- The root is current game state and leaf is any node that has a potential child from no simulation
Expansion: Unless $L$ ends the game decisively for either player, then create one (or more) child nodes and choose node $C$ from one of them.
Simulation: Complete one random playout from node $C$ .
Backpropgation: Result of playout to update information in nodes on path from $C$ to $R$ .

simulations

tree search.

a search algorithm based on random sampling of the search space.

Selection: root $R$ $R$ and select successive child nodes until leaf $L$ $L$ is reached.
- The root is current game state and leaf is any node that has a potential child from no simulation
Expansion: Unless $L$ ends the game decisively for either player, then create one (or more) child nodes and choose node $C$ from one of them.
Simulation: Complete one random playout from node $C$ .
Backpropgation: Result of playout to update information in nodes on path from $C$ to $R$ .

simulations

Vous pourriez aimer ce qui suit

atelier with friends.

machine learning

Sequential programming

non-deterministic finite automaton

Liens retour

Cholesky decomposition

decomposition of a Hermitian, positive-definite matrix into the product of a lower triangular matrix and its conjugate transpose. (used for Monte-Carlo simulations) A = LL^{*} where L is a lower triangular matrix with real and positive diagonal entries, and L^{*} is the conjugate transpose of L.

Machine learning

Detects pattern within data and use it to make useful prediction. Generally AI \subset ML \subset DL Some main exploration: Transformers Large language models NLP CNN Logistic regression Optimization gradient descent hyperparameter tuning ensemble learning Recommender systems Reinforcement learning Q-learning Policy Gradient Monte-Carlo Tree Search Generative Models GAN VAE Autoencoder sparse autoencoder sparse crosscoders Supervised Q-learning Low-rank adapters Fields mechanistic interpretability Related: linear algebra.

constrained decoding

structured generations in vLLM a la carte

Economics for engineer, a guide.

Economics for engineer, a guide.

Créé avec Quartz v4.4.0 © 2025