Welcome 👋

Hi, this is Azril. I’m documenting my learning notes here. Sometimes I also write longer technical blog posts. My interest lies in AI Safety. Currently focusing on mechanistic interpretability.

Reflections on the Intro to ML Safety Course

As ML systems expand in size and capabilities, it’s crucial to prioritize safety research. Like any powerful technology, the responsible development and deployment of ML systems require a thorough understanding of potential risks and a dedication to mitigating them. In this blog post, I’ll share what I’ve learned from the Introduction to ML Safety course offered by the Center for AI Safety. There are four main research areas to mitigate existential risks (X-Risks) from strong AI....

Multi-Armed Bandit Problem and Its Solutions

In probability theory and decision-making under uncertainty, the multi-armed bandit problem presents a challenge where a limited set of resources must be wisely allocated among competing choices to maximize the expected gain. This is a classic reinforcement learning problem that perfectly embodies the exploration vs exploitation dilemma. Imagine we are facing a row of slot machines (also called one-armed bandits). We must make a series of decisions: which arms to play, how many times to play each arm, the order in which to play them, and whether to stick with the current arm or switch to another one....

Key Concepts In (Deep) Reinforcement Learning

Reinforcement Learning (RL) revolves around the interactions between an agent and its environment. The environment represents the world where the agent lives and takes action. At each step, the agent observes some information about the environment, makes decisions, and affects the environment through its actions. The agent also receives rewards from the environment, which indicate how well it is doing. The agent’s ultimate goal is to maximize the total rewards it receives, called return....