Friday, December 29, 2017

Monte Carlo Techniques: Modeling Stock Price Action




A random walk is a stochastic process that describes a path as a succession of random steps.  To understand the basics of random walks, we will consider the case where we start off at position $y = 0$ and take a step in the vertical direction if we flip a heads on a coin toss, and one step down if we throw a tails.  The animated image in Fig 1 shows this process for four different trials.  Any given path will, of course, be random, but repeating this process numerous times allows us to calculate statistics on the collection of paths and draw some conclusions.  For example with a large number of simulations, we could estimate the probability of ending up at a given point after a specified number of steps.  This is a useful technique when it is inconvenient or impossible to work out such details by hand, and in the age of fast computers is being used more and more.

Figure 1:  A random walk along the integers.  Four different walks are shown with the position at each step being shown on the Y-axis.


Stock returns-- the percent change (we'll use day-to-day changes)-- are assumed to be normally distributed.  The assumption is there an upward drift that depends on the risk-free interest rates at the time, plus some sort of volatility term.  This volatility term is the piece that is assumed to be normally distributed.  As an aside, we can assume any type of distribution we'd like.  That's an advantage of Monte Carlo techniques--  we can do these calculations quickly with a variety of assumptions just by tweaking a bit of code.  But for this article, we'll assume a normal distribution which is not far from what is actually observed looking at historical stock data.  We will implement a random walk to simulate stock behavior using Python 2.7 along with the Numpy and Scipy libraries.

Expressed as an equation, we have,
\begin{equation} \frac{S_{i+1} - S_i}{S_i} = r\Delta t + \sqrt{\Delta t}\sigma \epsilon_i. \label{daily_returns} \end{equation}

Here, $S_i$ is the stock price on the $i$th day, $r$ is the annualized risk-free ratem that has been scaled down to whatever time step we're using, $\Delta t$ is the time step (in our case it will be one day), $\sigma$ will be the volatility that has also been scaled down to the proper time step,  $\epsilon$ is a random number sampled from the standard normal distribution (mean = 0 and standard deviation = 1). Numpy provides a built-in function that does this sampling for us, np.random.normal

While we could rearrange Eq. \ref{daily_returns} as,
\begin{equation} S_{i+1} = S_i \left(r\Delta t + \sqrt{\Delta t}\sigma \epsilon_i\right) + S_i, \label{next_day} \end{equation}
and loop over the desired number of days, we will eventually want to reuse this code for more complicated problems where we might need to do millions of simulations.  Thus, speed is of the essence.  Therefore, we'll write a vectorized implementation instead of one with loops.  At first glance, the math invoked in doing so seems a bit abstract, but it just amounts to solving a bunch of algebraic equations. 

We will introduce a bit of notation just to keep things neater and set the right-hand side of Eq. \ref{daily_returns} to
$$r\Delta t + \sqrt{\Delta t}\sigma \epsilon_i = \Lambda_i.$$

If we do a bit of algebra and write out the first few iterations of Eq.  \ref{next_day} , we see
that
\begin{equation}\begin{array}{lcl}S_0                     & = & S_0 \\S_1 - S_0 \Lambda_1 - S_0 & = &  0\\\S_2 - S_1 \Lambda_2 - S_1 & = & 0 \\S_3 - S_2 \Lambda_3 - S_2 & = & 0 \\& \vdots & \\S_i - _{i-} \Lambda_i - S_{i-1} & = & 0 .\\\end{array}\label{unrolled}\end{equation}
We can rewrite Eq. \ref{unrolled} in matrix form as,
\begin{equation} \left( \begin{matrix} 1 & 0 & 0 & 0 & \cdots & 0 \\ \Lambda_1 + 1 & -1 & 0 & 0 & \cdots & 0 \\ 0 & \Lambda_2 + 1 & -1 & 0 & \cdots & 0 \\ 0 & 0 & \Lambda_3 + 1 & -1 & \cdots & 0 \\ \vdots & \vdots & \vdots & \vdots & \ddots & \vdots \\ 0 & 0 & 0 & 0 & \Lambda_5 + 1 & -1 \end{matrix} \right) \left( \begin{array}{c} S_0 \\ S_1 \\ S_2 \\ S_3 \\ \vdots \\ S_i \\ \end{array} \right) = \left( \begin{array}{c} S_0 \\ 0 \\ 0 \\ 0 \\ \vdots \\ 0 \\ \end{array} \right). \label{simple_matrix} \end{equation}

So now we've reduced our entire problem to solving one equation.  The solution to Eq. \ref{simple_matrix} amounts to finding the inverse of a matrix.  Notice also that almost all of the entries in this matrix are zeros.  Only the entries along the main diagonal and the one immediately below are non-zero.  This is known as a sparse matrix.

Sparse matrices have several advantages.  First, since most of the entries are zero, we can store only the non-zero values in memory.  In our case, we have finite values only on the main diagonal and diagonal immediately below.  For our specific problem then, we have $2N_{\mbox{days}} + 1$ numbers to story in memory as opposed to $(N_{\mbox{days}}+1)^2$ if every entry have to be accounted for.  The second advantage is we can use sparse matrix algorithms to calculate the inverse.  This speeds up the computation immensely.

Many numerical methods libraries have have these sparse matrix algorithms included in the package and Python is no expectation with the scipy.sparse and scipy.sparse.linalg packages containing what we need.

A couple of notes and assumptions before we begin to code this:

  1.   We'll ignore the effect of weekends.  We won't try to account for the weekend by bumping up volatility for the simulated Mondays or try to account for it in some other way.
  2.  We will take the number of days per year as 252, the number of trading days rather than 365, the number of calendar days.  It is a bit ambiguous as which number to use and in practice should probably be determined by testing each against actual market data.  Since we are ignoring weekends, however, I thought it would be clearer to use the number of trading days.
  3.   We will state without justification that to scale the volatility from yearly to daily we use the formula, $\sigma_{\mbox{day}} = \sigma_{\mbox{year}} / \sqrt{252}$.
  4.  We will assume the volatility $\sigma$ remains constant over our time period.  In reality, this is not true.
  5.  For the most part, determining the values of the parameters that go into the model is not a problem, but deciding what to use for volatility is not immediately obvious.  Since we are assuming returns are normally distributed, we could look at historical price data and calculate the standard deviation of returns and use that value.  This is referred to as historical volatility.  In practice however,  it is best to use the implied volatility derived from option prices.  This is the market's estimate of what volatility will be in the time period until the expiration of the option contract.  This number is given by your trading platform or can be calculated directly from the option prices.  I have an article on how to do this calculation here.

Let's begin to translate this into Python code.  We will need the numpy and scipy libraries mentioned above as well as some basic math functions.  We will also load in the matplotlib library to plot the results.

import numpy as np
from math import sqrt
import matplotlib.pyplot as plt
import scipy.sparse
import scipy.sparse.linalg

We'll set the random seed to a constant for debugging purposes.  This makes the random number generator use the same numbers each time the program is run, which will allow one to discern whether odd numerical values are caused by fluctuating random numbers or a bug in the code.  We will also set the parameters of our simulation:  the initial stock price, the risk-free rate, time-step , and volatility.

np.random.seed(0)               #  Set a constant random seed for debugging
                                #  purposes

N_days = 30                     #  Number of days to simulate

S0 = 100.0                      #  Initial stock price
r = 0.01 / 252.0                #  Risk-free rate (daily percent)
dt = 1.0                        #  Time step (days)
sigma = 0.15 / sqrt(252.0)      #  Volatility (daily percent)

Next, we create a vector of normally distributed numbers and calculate the daily percent change of the stock.

#  Create a vector of normally distributed entries with each entry
#  corresponding to  one day
epsilon = np.random.normal( size = (N_days) )

ds_s = r * dt + sigma * sqrt(dt) * epsilon  #  Vector of daily percent
                                            #  change in the stock price

Next, we set up our sparse matrix.  The list $d$ will contain our two vectors which will form the diagonals, and the list $k$ will designate which of these vector goes along which diagonal.  Then we create the matrix $M$ with the sparse.diags command.  The option format = 'csc' tells the code to use compressed sparse column format to represent the matrix internally and is not really important for the purpose of this article.

#  We need to build the diagonals of our sparse matrix
#  All of the main diagonal is equal to -1 expect the first entry which is
#  equal to one
ones = -np.ones( (N_days + 1) ); ones[0] = 1.;

#  Define our two diagonals,  the lower and main
d = [ds_s + 1, ones]

#  the K vector tells Python which of the vectors defined in 'd' go in
#  which diagonal.  Zero corresponds to the main, and -1 to the diag
#  immediately below.
K = [-1, 0]

#  Define the sparse matrix
M = scipy.sparse.diags(d, K, format = 'csc')

Lastly, we'll define the vector $p$ which corresponds to the right-hand side of Eq. \ref{simple_matrix}.  Then we will solve for our vector of stock prices, $s$.

#  Define a column vector off all zeros expect for the first entry which is
#  our initial stock price.
p = np.zeros( (N_days + 1, 1) )
p[0] = S0

#  Solve the system  M * s = p for the vector s
s = scipy.sparse.linalg.spsolve(M, p)

Now that we have the solution, let's visualize the results.

#  Plot the results
t = np.arange(0, N_days + 1)
plt.plot(t, s, 'k')
plt.xlabel('Time (days)')
plt.ylabel('Stock Price ($)')
plt.grid(True)
plt.show()

The code above should produces the plot shown in Fig. 1.
Figure 1:  The results of a single simulation using the parameters noted above.
In and of itself, the above isn't very useful.  It returns only one random walk simulated run of a stock.  To get something useful we would need to do a large number of such simulations.  Again, we could loop over the above code to do so but we will do this in a fully vectorized way by "stacking" up our vectors in Eq. \ref{simple_matrix}.   An explicit example might be clearer.  Let's adopt the notation where $S_{i,j}$ is the price on the $i$th day in the $j$th simulation.  The if we have two simulations for five days, the column vectors in \ref{simple_matrix} would be,
\begin{equation} \left( \begin{array}{c} s_0 \\ s_{1,1} \\ s_{2,1} \\ s_{3,1} \\ s_{4,1} \\ s_{5,1} \\ s_0 \\ s_{1,2} \\ s_{2,2} \\ s_{3,2} \\ s_{4,2} \\ s_{5,2} \\ \end{array} \right), \mbox{ and} \left( \begin{array}{c} s_0 \\ 0 \\ 0 \\ 0 \\ 0 \\ 0 \\ s_0 \\ 0 \\ 0 \\ 0 \\ 0 \\ 0 \\ \end{array} \right).\end{equation}
The matrix is constructed similarly, stacking a matrix for each day along the diagonal.  In the example above, this would be,
$$\left( \begin{array}{rrrrrrrrrrrr} 1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ \Lambda_{1,1} + 1 & -1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & \Lambda_{2,1} + 1 & -1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & \Lambda_{3,1} + 1 & -1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & \Lambda_{4,1} + 1 & -1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & \Lambda_{5,1} + 1 & -1 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & \Lambda_{1,2} + 1 & -1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & \Lambda_{2,2} + 1 & -1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & \Lambda_{3,2} + 1 & -1 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & \Lambda_{4,2} + 1 & -1 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & \Lambda_{5,2} + 1 & -1 \end{array} \right).$$

Since we are going to want to use this code in other models, we will encapsulate it all in a function.   The code below constructs the vectors and matrix for multiple simulations according to the user's input.

def stock_monte_carlo(init_price, N_days, N_sims, r, sigma):

    #  Scale interest rates and volatility.  Define time-step.
    r = r / 252.
    dt = 1.0
    sigma = sigma / sqrt(252.0)

    #  Calculate vector of normally distributed numbers and use it to
    #  calculate the daily percent change.
    epsilon = np.random.normal( size = (N_sims * N_days + N_sims - 1) )
    ds_s = r * dt + sigma * sqrt(dt) * epsilon

    #  Step up matrix diagonals
    ones = -np.ones( (N_sims * N_days + N_sims) )
    ones[0:-1:N_days+1] = 1.

    ds_s[N_days:N_days * N_sims + N_sims:N_days+1] = -1
    d = [ds_s + 1, ones]
    K = [-1, 0]

    #  Solve the system of equations
    M = scipy.sparse.diags(d, K, format = 'csc')
    p = np.zeros( (N_sims * N_days + N_sims, 1) )
    p[0:-1:N_days+1] = init_price
    s = scipy.sparse.linalg.spsolve(M, p)

    #  Reshape the column vector so the function returns a matrix where
    #  each row is a single simulation with each day corresponding the the
    #  columns
    return np.reshape(s, (N_sims, N_days+1))

Let's make use of this and generate a plot of all the simulated runs.


num_sims = 1000
N = 30

r = 0.01
sigma = 0.15
S0 = 100.0

s = stock_monte_carlo(S0, N, num_sims,  r, sigma)

t = np.arange(0, N + 1)
for i in range(num_sims):
    plt.plot(t, s[i,:], 'k', alpha = 0.05)

plt.xlabel('Time (days)')
plt.ylabel('Stock Price ($)')
plt.grid(True)
plt.show()

The above code should produce the following plot.  Since we're plotting 1000 different runs, we  make each plot transparent with an alpha of 0.05 to avoid the whole thing being a giant mess.  The results are what we'd intuitively expect --  the most frequent outcome is the stock drifting slightly higher or lower than the initial price.  The outlier moves, either substantially up or down, are rather rare.
Figure 2:  A plot of 1000 different stock simulations.
We can use a histogram to visualize the price on any given day:


plt.hist( s[:,-1], bins = 50, edgecolor = 'k', linewidth = 0.5 )
plt.xlabel('Stock Price ($)')
plt.ylabel('Count' )
plt.show()
Figure 4:  Histogram showing price distribution on day 30 for 1000 simulations.
The histogram confirms what we'd expect by looking at Fig. 3, the bulk of the prices end up a bit above or below our initial stock price of $100.   If we calculate the mean and median price on day 30, we find they are 100.20 and 100.04, respectively.

We've constructed a simple Monte Carlo simulation for the price action of a stock and written a function that outputs the values for all runs.  The next article in this series will show how to use the above code to calculate the probability of making 50% (or any specified percentage) of maximum profit on a short options position.

Articles in this series

Part I:  Monte Carlo Techniques: Estimating the Value of Pi with Random Numbers
Part III:  Monte Carlo Techniques: Calculating the Probability of Making 50% of Max Profit on Short Option Positions
Part IV:  Monte Carlo Techniques:  Estimating Correlations between Option Positions and  and Their Underlying Stock

Sunday, December 17, 2017

MATLAB GUI Programming: Dynamically Sizing Text Uicontrols

A number of years ago, I was putting together a battery simulation tool written in MATLAB.  Cycling parameters could vary depending on the mode (charge, discharge, rest, or loop) and which of several parameters were to be held constant such as current, power, voltage, or some function of those.  Because of the different possibilities, text labels that described which value should go in a given entry box would change.

Maybe a simple example will show what I mean.  Here is a quick script to produce a window with a popup menu allowing the user to select one of three questions.  The code then writes the question as a text uicontrol and places an edit uicontrol next to it.  One question is long containing many characcters, the other is of medium length, and the last is short.


function static_labels
clear all; close all; clc

f = figure();

%  List of question that will appear to the left of an edit uicontrol.
q = {'How much wood could a woodchuck chuck if a woodchuck could chuck wood? ', ...
     'What is the airspeed velocity of an unladen swallow? ', ...
     'What is the meaning of the universe? '};

%  Create the popup uicontrol and make its entries out list of questions
%  The menu's callback function changes the text in the text uicontrol
%  defined below
pulldown_menu = uicontrol('Parent', f, ...
 'Style', 'popupmenu', ...
 'String', {'Question 1', 'Question 2', 'Question 3'}, ...
 'Callback', @change_question, ...
 'Position', [10, 400, 100, 20]);

%  Create a string variable str which contains the question corresponding
%  to which value is currently set in the popup menu
switch pulldown_menu.Value
 case 1
  str = q{1};
 case 2
  str = q{2};
 case 3
  str = q{3};
end

%  Create a text label whose string is set to str.
label = uicontrol('Parent', f, ...
 'Style', 'text', ...
 'String', str, ...
 'HorizontalAlignment', 'left', ...
 'Position', [10, 300, 400, 17]);

%  Create an entry widget positioned so that it is just to the right of the
%  longest question.  Its position is static not changing if the question
%  changes.
entry = uicontrol('Parent', f, ...
 'Style', 'edit', ...
 'Position', [420, 300, 50, 20]);

%%%%%%%%%%%%%%%%%%%%%%%%%%%
%%%  Callback function  %%%
%%%%%%%%%%%%%%%%%%%%%%%%%%%
function change_question(source, events)
 %  Get the selected field of the pulldown menu
 N = source.Value;

 %  Assign text depending on which value was chosen
 switch N
  case 1
   str = q{1};
  case 2
   str = q{2};
  case 3
   str = q{3};
 end

 %  Adjust the string of the text uicontrol
 label.String = str;

end

end

Figure 1 shows the pulldown menu.  Question 1, the longest, is selected and an edit uicontrol is just to the right of the text.

Figure 1:  Pulldown menu selects question displayed below.

We can see what happens when the question is changed.  This is shown below in Figs 2a and 2b.  The edit uicontrol stays in the same place while the length of the text changes.  Depending on the project and complexity of the interface being built, this may lead to a less than aesthetic-looking design.


Figure 2a:  Entry edit box next to a long block of text


Figure 2b:  Entry widget is in the same place, though
the length of text in the question is shorter


In my real-world case, there were perhaps 15 different possible configurations, and since this was developed over a period of several months, I handled them manually.  But I always found this unsatisfying.

Unfortunately, MATLAB itself doesn't have a way to estimate the size of a uicontrol based on the text contained.  We need first to get the handle to the Java object using the utility findjobj.   Then, the workaround for the sizing issue is to use the getPreferredSize method.  This method essentially gives hints to the layout manager as to the best size to make the widget

So I added a function,  calculate_widget_positions, that gets the preferred size of the text uicontrol, and returns this number along with the position of the edit control calculated from the text size plus some amount of padding.  This function is shown below.


function [width_label, x_entry] = calculate_widget_positions()

        %  Get the text uicontrol suggested width
        width_label = jlabel.getPreferredSize().getWidth();

        %  Add in some padding.  The edit control starts this amount after
        %  the text control ends
        padding = 15;

        %  Horizontal starting position of the edit uicontrol
        x_entry = label.Position(1) + width_label + padding;
end

The callback function is then modified to adjust the position of the edit uicontrol after setting the text.


function change_question(source, events)
 %  Get the selected field of the pulldown menu
 N = source.Value;

 %  Assign text depending on which value was chosen
 switch N
  case 1
   str = q{1};
  case 2
   str = q{2};
  case 3
   str = q{3};
 end

 %  Adjust the string of the text uicontrol
 label.String = str;

 drawnow;
 [width_label, x_entry] = calculate_widget_positions()
 p = label.Position;
 p(3) = width_label;
 label.Position = p;

 p = entry.Position;
 p(1) = x_entry;
 entry.Position = p;

end

Note also the use of the drawnow function after changing the string of the text control.  This needs to be here so that MATLAB is forced to draw the widget. We can then get its new size for positioning the entry box.   Perhaps if I were a little more clever in coding this, we could avoid this step, but this is more a proof-of-concept thing than something I am about to drop into production code.  Now, for example when the third question is selected, the resulting window looks like Fig. 3 below.

Figure 3.  The position of the edit entry box is determined from the size of the preceding text.

So though MATLAB doesn't naively have a way to address this issue, we can work around it by fiddling under the hood with the Java methods that the uicontrols themselves use.

For the sake of completeness, here is the full code including the dynamic positioning.


function dynamic_labels
clear all; close all; clc

f = figure();

%  List of question that will appear to the left of an edit uicontrol.
q = {'How much wood could a woodchuck chuck if a woodchuck could chuck wood? ', ...
     'What is the airspeed velocity of an unladen swallow? ', ...
     'What is the meaning of the universe? '};

%  Create the popup uicontrol and make its entries out list of questions
%  The menu's callback function changes the text in the text uicontrol
%  defined below
pulldown_menu = uicontrol('Parent', f, ...
 'Style', 'popupmenu', ...
 'String', {'Question 1', 'Question 2', 'Question 3'}, ...
 'Callback', @change_question, ...
 'Position', [10, 400, 100, 20]);

%  Create a string variable str which contains the question corresponding
%  to which value is currently set in the popup menu
switch pulldown_menu.Value
 case 1
  str = q{1};
 case 2
  str = q{2};
 case 3
  str = q{3};
end

%  Create a text label whose string is set to str.
label = uicontrol('Parent', f, ...
 'Style', 'text', ...
 'String', str, ...
 'HorizontalAlignment', 'left', ...
 'Visible', 'on', ...
 'Position', [10, 300, 100, 17]);

%  Get the handle to the Java object
jlabel = findjobj(label);

%  Create an entry widget positioned so that it is just to the right of the
entry = uicontrol('Parent', f, ...
 'Style', 'edit', ...
 'Visible', 'on', ...
 'Position', [420, 300, 50, 20]);

%  Set the initial position of the edit uicontrol
[width_label, x_entry] = calculate_widget_positions();
p = label.Position;
p(3) = width_label;
label.Position = p;

p = entry.Position;
p(1) = x_entry;
entry.Position = p;

%%%%%%%%%%%%%%%%%%%%%%%%%%%
%%%  Callback function  %%%
%%%%%%%%%%%%%%%%%%%%%%%%%%%
function change_question(source, events)
 %  Get the selected field of the pulldown menu
 N = source.Value;

 %  Assign text depending on which value was chosen
 switch N
  case 1
   str = q{1};
  case 2
   str = q{2};
  case 3
   str = q{3};
 end

 %  Adjust the string of the text uicontrol
 label.String = str;

 drawnow;
 [width_label, x_entry] = calculate_widget_positions()
 p = label.Position;
 p(3) = width_label;
 label.Position = p;

 p = entry.Position;
 p(1) = x_entry;
 entry.Position = p;

end

function [width_label, x_entry] = calculate_widget_positions()
 %  Get the text uicntol suggested width
 width_label = jlabel.getPreferredSize().getWidth();

 %  Add in some padding.  The edit control starts this amount after
 %  the text control ends
 padding = 15;

 %  Horizontal starting position of the edit uicontrol
 x_entry = label.Position(1) + width_label + padding;
end

end

Monday, December 11, 2017

Monte Carlo Techniques: Estimating the Value of Pi with Random Numbers

I want to write a series of articles on using Monte Carlo methods in computational finance.  Monte Carlo algorithms are numerical techniques that use repeated sampling of random numbers to perform the desired calculation.  Such techniques are becoming increasingly frequent even down to the level of the individual trader with some brokerages using Monte Carlo simulations to estimate the probability of reaching a given percentage of maximum profit on short-premium options trades.

While that is not as complicated as one might think, I wanted to start out with a very simple article first.  In this article, we will write a simple Python script to estimate the value of $\pi$. In a follow-up post, we will introduce the concept of a random walk and use it to model a stock's price action. In the last article of this series, we will show how to predict the probability of reaching a given percentage of profit on an option trade.

So how can we use random numbers to estimate a constant like $\pi$?  We start with a unit circle circumscribed by a square.  Since the circle has a radius of one, the side of the square must have a length of two.
Figure 1:  A unit circle centered at the origin circumscribed by a square.
For the sake of coding convinces, we'll only look at the upper right quadrant as shown below in Fig 2.  This is because we'll use the built in rand function in numpy, which selects random values between zero and one. If we restrict ourselves in this way, we won't have to rescale this range to between -1 and 1.  Rescaling is no big deal, but if we can simply avoid doing so, we will take that opportunity.
Figure 2:  We will only consider the upper left quadrant.
The square shaded above in light gray now has sides $L = 1$ and an area of $L^2 = 1$.  The quadrant in the shaded region has an area of one-quarter the original, in other words, $\pi R^2 / 4$.  If we throw a large number of darts that hit randomly in that upper right quadrant,  we would expect the ratio of the number that hit within the circle to the total number of darts thrown to approach the ratio of the circle's area to that of the square.  In other words,
$$\frac{N_{inside}}{N_{total}} \approx \frac{A_{circle}}{A_{square}} = \frac{ \frac{1}{4} \pi R^2}{1}.$$

Fig. 3 shows an animation of this process.  For the sake of clarity, we only include 25 points in this animation whereas the actual calculation we will use significantly more.
Figure 3:  25 XY-pairs drawn at random.  For large numbers of pairs, the ratio of those within the circle to the total number of points should converge to the ration of the circle's area to that of the square.
Finding if a given point is within the circle is straightforward.  Since the circle has a radius of 1, the point is inside if
$$\sqrt{ x^2 + y^2} \leq 1^2$$.
The code below implements this calculation using the Python's Numpy library.  It should print "pi =  3.140936" to the screen.


# Python 2.7 script to estimate the value of pi using Monte Carlo techniques
import numpy as np

#  Use a constant random seed for debugging purposes
np.random.seed(0)

#  Number of random points to choose
N = 1000000

#  Create a 2-dimensional numpy array.  Each row will correspond to one
#  point with the entry in the first column being the X value and the
#  second column providing the Y values.
p = np.random.rand(N, 2)

#  Use the built-in norm function to calculate the distance from the origin
n = np.linalg.norm(p, axis = 1, keepdims = True)

#  Sum up the number of entries in the circle.  That is count the number of
#  points who have a norm less than or equal to one
number_in_circle =  np.sum(n <= 1)

#  Print out the results
print 'pi = ', 4 * float(number_in_circle) / float(N)

While this example borders on the trivial, it serves as a good starting point for more sophisticated calculations.  As noted in the introduction, I've used Monte Carlo methods in financial modeling (which we will discuss in later posts), as well as in complex physical systems such as modeling the loss of active material from cycling batteries.

Some comments on vectorization

The code above uses a "vectorized" implementation.  If you are familiar with the concept, there is no need to read further.  If not, I'd like to take a moment to explain the concept because we will use it heavily in future articles.

Numpy is a package for numerical calculations in Python.  It provides a significant speed increase over writing code in straight Python because it is written in C rather than Python itself.  It also provides the ability to "vectorize" calculations meaning operation can be performed on whole vectors or matrices without having to explicitly loop through the elements.  This can further enhance speed considerably.
Figure 4:  Snapshot of a Jupyter notebook.  We load in the numpy and math libraries (the math functionality will be used later), create two numpy arrays, add them together, and finally print out the results.
As an example, in Fig 4, we create two arrays p1, and p2, and compute their elementwise sum.  There is no need to loop over each index and add them in a piecemeal manner.  Numpy also provides a number of functions that can perform vectorized operation on arrays.  Notice in the code to estimate $\pi$,  we use the functions np.linalg.norm and np.sum to calculate the distance of the point from the origin and count the number of points with the circle, respectively.

Figure 5:  Compute the L2 norm of a vector $p$, and compare the result to that of calculating the distance between $p$ and the point (0, 0) using the Pythagorean theorem.
The function np.linalg.norm calculates the length of a vector (L2 norm).  In the code snippet shown in Fig. 5 we show that calculating the L2 norm via this  functionality is the same as explicitly calculating the length between the origin and a point $p$.  Lastly in Fig 6. we show the equivalence of looping over an array and calculating the distance of each point in that array to simply using the built in norm function.

Figure 6:  Combining the notions from Figs 4 & 5, we compare the vectorized implementation of the norm of an array of points to that of explicitly looping over the array and calculating the L2 norm via the Pythagorean theorem.

The vectorization used here is straightforward and pretty intuitive, but it is important to have a handle on the concept.  In subsequent articles where we discuss ransom walks in the context of modeling stock behavior, we will lean heavily on this as well as SciPy's ability to deal with sparse matrices  rapidly.  This is a bit more abstract, and thus it is a good idea to make sure one understands the simple case before jumping into the deep end.


Articles in this series

Part II:  Monte Carlo Techniques: Modeling Stock Price Action
Part III:  Monte Carlo Techniques:  Calculating the Probability of Making 50% of Max Profit on Short Option Positions
Part IV:  Monte Carlo Techniques:  Estimating Correlations between Option Positions and  and Their Underlying Stock