Haskell's `IO` & Monads

In this lecture we will accept the fact that in order to do anything useful with a programming language we need side effects. A program that does not read input or produce output (IO) is not very useful, because we would not have a means to access the results it produced. Unfortunately, producing output is an impure operation. For example, the putStrLn function puts something in stdout. Calling putStrLn again adds a second line to stdout, hence, calling the function does not always produce the same output. We are mutating something outside of our function!

haskell

𝝺> putStrLn "Hello World"   | stdout: Hello World

𝝺> putStrLn "Hello World"   | stdout: Hello World
                                      Hello World

Believe it or not, this was quite a problem for functional programming languages! Luckily, the Haskell community came up with a solution: Haskell works with the whole world as its state! So, theoretically, putStrLn becomes a function that accepts a string, the current world, and outputs a mutated world where the string has appeared on your screen:

haskell

putStrLn :: String -> World -> World

This is now a completely pure function! Very similarly, we could define a pure function that reads an input from the world:

haskell

getLine :: World -> (String, World)

With this approach we could write pure programs that read from an input and write to an output^[1]:

haskell

helloworld :: World -> World
helloworld w1 = w4 where
  w2         = putStrLn "What is your name?" w1
  (name, w3) = getLine w2
  w4         = putStrLn ("Hello " ++ name) w3

Of course it would be awfully annoying to have to pass around the state of the world in every function that should perform IO. Hence, Haskellers came up with a special type that separates impure, IO parts of programs from our normal, functional code. This type is called: IO. Essentially, IO is encapsulating the state of the whole world as a function from one World state to another World state (including an additional type we can use for, e.g. reading a string from the world).

haskell

type IO a = World -> (a, World)

In the rest of this lecture we will learn how to abstract away the World from our IO programs, such that we can rewrite the helloworld function like this:

haskell

helloworld :: IO ()
helloworld = do
  putStrLn "What is your name?"
  name <- getLine
  putStrLn ("Hello " ++ name)

Where the World completely disappeared and we can write code that looks very much like procedural programming.

Note

The actual implementation of IO of course does not really operate with "the whole world". IO is merely the type that signals: Mutation happening here! Watch out! But for the sake of your mental model its nice to work with a World type in this lecture.

Monads

The fundamental concept you will learn about today is called a monad. Monads were initially introduced to formalize and simplify working with the mutating nature of IO in Haskell. However, monads extend far beyond IO and mutation. They are a powerful way to compose computations over values a that are wrapped in some context m. A context refers to anything that provides additional information about the value we are working with. Common Haskell types that are instances of monad are: Maybe, list [], IO, State, and many more.

`IO` actions

Haskell's IO is a functor which satisfies further properties (collected under the name monad). IO is a type constructor that produces values of type IO a. You can think of it as:

haskell

type IO a = World -> (a, World)

IO a is called an action. When we run an IO action, it produces a value of type a. For example, the function getLine with the definition of IO above just becomes:

haskell

getLine :: IO String

so getLine can be regarded as an action that (once we run it) produces a value of IO String (i.e. a modified world from which we read a String). The putStrLn function from before has to be slightly modified to work with IO. It does not return anything except a modified world, so we will represent the missing a as ():

haskell

putStrLn :: String -> World -> ((),World)

Written in terms of IO it becomes:

haskell

putStrLn :: String -> IO ()

i.e. a function that accepts a string and outputs an action that only contains a mutated world, nothing else.

With the definitions above we have completely hidden the World and we could try to rewrite helloworld:

haskell

helloworld :: IO ()
helloworld = 
  let ac_name = getLine          -- IO String
  in putStrLn ("Hello " ++ ac_name) -- This fails! We cannot ++ with an action!

The above code won't compile, because the function (++) :: String -> String -> String does not work for the case we have here: String -> IO String -> String. We need a way to manipulate the values that are hidden inside our IO actions.

Taking a step back, what we really need is a way to sequence the getLine :: IO String action with the putStrLn :: String -> IO () action:

haskell

??? :: IO String -> (String -> IO ()) -> IO ()

(Un)fortunately, we ran into the very same problem already last time when trying to sequence failing computations and the solution was a function andThen. The more general concept that this function encapsulates is called a Monad.

Monads

In the previous lecture we pulled out the boilerplate that was needed to chain failing computations into a function andThen

haskell

andThen :: Maybe a -> (a -> Maybe b) -> Maybe b

which accepted a value Maybe a and inserted it into a function a -> Maybe b.

The problem of chaining IO actions is almost exactly the same! We want to sequence a value coming from getLine which is an IO String action and stick it into a String -> IO () function. The general typeclass that Haskell defines for this is called a Monad:

haskell

class Applicative m => Monad m where
  (>>=) :: m a -> (a -> m b) -> m b
  (>>) :: m a -> m b -> m b
  return :: a -> m a

Referring to the type variable m as a computational context we can unpack the definition above:

The bind operator (>>=), which sequences values and functions with a context m.
(>>) does the same as >>=, it just neglects the value a. In fact it can be implemented in terms of >>=: x >> f = x >>= \_ -> f.
return constructs a computational context. (Note, that this is just a normal function, not a statement like in procedural languages. You will see in a few paragraphs why this function is called return.)

Additionally, we can see that a Monad has a type constraint of Applicative. We will discuss Applicatives in the next lecture; for now you can just think of this type constraint to be Functor such that, every Monad is also a Functor.

Monads are functors.

Every monad is a functor as we can express fmap in terms of >>=:

haskell

fmap :: (Monad m) => (a -> b) -> m a -> m b
fmap f x = x >>= return . f

-- return :: b -> m b
-- return . f :: a -> m b

`IO` Monad

Haskell already implements the monad instance of IO (and many other types) for us, so with its help we can rewrite our helloworld function, but first, the two ingredients we need:

(>>) :: IO a -> IO b -> IO b: composes two IO actions (the first action is performed only for its side effect), for example:

haskell

main :: IO ()
main = 
  putStrLn "hello" -- :: IO ()
  >> putStrLn "world"  -- ignore result of IO () and chain next action

x >>= f :: IO a -> (a -> IO b) -> IO b is the action that performs first x, and then passes its result to f which returns a second action to be performed:

haskell

-- getLine :: IO String
-- putStrLn :: String -> IO ()
main = getLine >>= putStrLn

The helloworld function first asks for a name, then reads from input, and finally prints "Hello NAME". These steps can be encapsulated in three IO actions, which have to be chained. We can start with reading from input and printing, because we almost have the function written above. We just want to do something slightly more complex than putStrLn:

haskell

main = getLine >>= \name -> putStrLn ("Hello " ++ name)

Above, we call getLine, which produces an IO String. We want to use its value, so we can define a function that accepts a string name and processes it before passing the result to putStrLn.

As the last step we want to ask "What is your name?". This is also an IO action that as to happen before we read/print to stdout. We don't care about the output of this action, we just want to print, so we can use >>:

haskell

helloworld :: IO ()
helloworld = 
  putStrLn "What is your name?" >>
  getLine >>=
  \name -> putStrLn ("Hello " ++ name)

Separation of IO side effects

The IO monad is the only way to work with IO side effects in Haskell. It is not possible to access the values that are stored in an IO action, like we would be able to with other monads, like Maybe:

haskell

getMaybe :: Maybe Int -> Int
getMaybe (Just x) = x
getMaybe _ = 0

There is no accessible data constructor for IO, so we cannot do pattern matching on values of type IO a.

We can manipulate IO actions only via bind >>=.

More safe computations

Recalling our two safe functions from the last lecture

haskell

safeHead :: [a] -> Maybe a
safeHead [] = Nothing
safeHead xs = Just (head xs)

safeTail :: [a] -> Maybe [a]
safeTail [] = Nothing
safeTail (_:xs) = Just xs

We can now define safeSecond with the more general >>= that works for any monad:

haskell

safeSecond :: [a] -> Maybe a
safeSecond xs = safeTail xs >>= safeHead

>>= implementation for Maybe:

haskell

instance Monad Maybe where
  return  = Just
  Nothing >>= _ = Nothing
  Just x  >>= k = k x

Let's assume we want to sum the first two elements of a list. We can do this in a safe way by sequencing safeHead, safeSecond and then summing:

haskell

sumFirstTwo :: Num a => [a] -> Maybe a
sumFirstTwo xs =
  safeHead xs >>=
  \first -> safeSecond xs >>=
  \second -> 
    return (first + second)

The return function

Sometimes, in order to combine results of previous actions it is useful to just wrap a value in a monadic context. This is what the function return is for:

haskell

getSquare :: IO Int
getSquare = putStrLn "Enter number:"
            >> getLine
            >>= \line -> let n = read line
                         in return (n*n)

Above, we read a line, parse it to an Int (via read), and then make sure that the thing we return from our lambda function is actually an IO action by using return.

`do`-notation

The kind of nesting of >>= and lambda functions above can become very tedious and confusing. To simplify things, and make them look very much like procedural programming, we can use do-notation.

do-notation is a syntax block (like e.g. where or let) that lets you sequence actions more easily:

Actions on a separate line get executed
value <- x runs action x and binds the result to v

With do we can rewrite the above function to

haskell

sumFirstTwo :: Num a => [a] -> Maybe a
sumFirstTwo xs = do
  first <- safeHead xs
  second <- safeSecond xs
  return (first + second)

Now, we can also finally understand the second version of our initial helloworld function. Compare the version with >>= to the one with do-notation:

haskell

helloworld :: IO ()
helloworld = 
  putStrLn "What is your name?" >>
  getLine >>=
  \name -> putStrLn ("Hello " ++ name)

helloworld :: IO ()
helloworld = do
  putStrLn "What is your name?"
  name <- getLine
  putStrLn ("Hello " ++ name)

List monad

Most types that provide a context in Haskell are instance of Monad. We've already seen it for Maybe, another prominent example is the list monad:

haskell

instance Monad [] where
  return :: a -> [a]
  return x = [x]

  (>>=) :: [a] -> (a -> [b]) -> [b]
  xs >>= f = concat (map f xs)

With the list monad we can do things like

haskell

𝝺> [1,2,3] >>= \x -> [x, 10*x, 100*x]
[1,10,100, 2,20,200, 3,30,300]

That must mean we can use do-notation for the list monad as well!

haskell

squares :: [Int]
squares = do
  x <- [1,2,3]
  return x*x

With the above we can also see where the syntax for list comprehensions comes from. They are essentially syntactic sugar for do-notation:

haskell

squares :: [Int]
squares = [x*x | x <- [1,2,3]]

Higher-order monadic functions

We are in Haskell, so of course it is possible to use higher-order functions with monads. This is another level of abstraction and you should first get comfortable with monads themselves, but just so you have seen two examples:

You can sequence a list of monadic actions:

haskell

sequence :: Monad m => [m a] -> m [a]
sequence_ :: Monad m => [m a] -> m ()

ioActions :: [IO ()]
ioActions = [print "Hello!", putStrLn "just kidding", getLine >>= putStrLn]

𝝺> sequence_ ioActions

Or use the monadic versions of map:

haskell

mapM :: Monad m => (a -> m b) -> [a] -> m [b]
mapM_ :: Monad m => (a -> m b) -> [a] -> m ()

𝝺> mapM putStrLn ["a", "b", "c"]
a
b
c
[(),(),()]

Stolen from here, which is a highly recommended video! ↩︎

Haskell's IO & Monads ​

IO actions ​

Monads ​

IO Monad ​

More safe computations ​

do-notation ​

List monad ​

Higher-order monadic functions ​

Haskell's `IO` & Monads

`IO` actions

Monads

`IO` Monad

More safe computations

`do`-notation

List monad

Higher-order monadic functions