GHC supports several pragmas, or instructions to the compiler placed in the source code. Pragmas don't normally affect the meaning of the program, but they might affect the efficiency of the generated code.
Pragmas all take the form
{-#
where word
... #-}word
indicates the type of
pragma, and is followed optionally by information specific to that
type of pragma. Case is ignored in
word
. The various values for
word
that GHC understands are described
in the following sections; any pragma encountered with an
unrecognised word
is
ignored. The layout rule applies in pragmas, so the closing #-}
should start in a column to the right of the opening {-#
.
Certain pragmas are file-header pragmas:
A file-header
pragma must precede the module
keyword in the file.
There can be as many file-header pragmas as you please, and they can be preceded or followed by comments.
File-header pragmas are read once only, before pre-processing the file (e.g. with cpp).
The file-header pragmas are: {-# LANGUAGE #-}
,
{-# OPTIONS_GHC #-}
, and
{-# INCLUDE #-}
.
The LANGUAGE
pragma allows language extensions to be enabled
in a portable way.
It is the intention that all Haskell compilers support the
LANGUAGE
pragma with the same syntax, although not
all extensions are supported by all compilers, of
course. The LANGUAGE
pragma should be used instead
of OPTIONS_GHC
, if possible.
For example, to enable the FFI and preprocessing with CPP:
{-# LANGUAGE ForeignFunctionInterface, CPP #-}
LANGUAGE
is a file-header pragma (see Section 7.13, “Pragmas”).
Every language extension can also be turned into a command-line flag
by prefixing it with "-X
"; for example -XForeignFunctionInterface
.
(Similarly, all "-X
" flags can be written as LANGUAGE
pragmas.
A list of all supported language extensions can be obtained by invoking
ghc --supported-extensions
(see Section 4.5, “Modes of operation”).
Any extension from the Extension
type defined in
Language.Haskell.Extension
may be used. GHC will report an error if any of the requested extensions are not supported.
The OPTIONS_GHC
pragma is used to specify
additional options that are given to the compiler when compiling
this source file. See Section 4.2.2, “Command line options in source files” for
details.
Previous versions of GHC accepted OPTIONS
rather
than OPTIONS_GHC
, but that is now deprecated.
OPTIONS_GHC
is a file-header pragma (see Section 7.13, “Pragmas”).
The INCLUDE
used to be necessary for
specifying header files to be included when using the FFI and
compiling via C. It is no longer required for GHC, but is
accepted (and ignored) for compatibility with other
compilers.
The WARNING pragma allows you to attach an arbitrary warning to a particular function, class, or type. A DEPRECATED pragma lets you specify that a particular function, class, or type is deprecated. There are two ways of using these pragmas.
You can work on an entire module thus:
module Wibble {-# DEPRECATED "Use Wobble instead" #-} where ...
Or:
module Wibble {-# WARNING "This is an unstable interface." #-} where ...
When you compile any module that import
Wibble
, GHC will print the specified
message.
You can attach a warning to a function, class, type, or data constructor, with the following top-level declarations:
{-# DEPRECATED f, C, T "Don't use these" #-} {-# WARNING unsafePerformIO "This is unsafe; I hope you know what you're doing" #-}
When you compile any module that imports and uses any of the specified entities, GHC will print the specified message.
You can only attach to entities declared at top level in the module
being compiled, and you can only use unqualified names in the list of
entities. A capitalised name, such as T
refers to either the type constructor T
or the data constructor T
, or both if
both are in scope. If both are in scope, there is currently no way to
specify one without the other (c.f. fixities
Section 7.4.3, “Infix type constructors, classes, and type variables”).
Warnings and deprecations are not reported for (a) uses within the defining module, and (b) uses in an export list. The latter reduces spurious complaints within a library in which one module gathers together and re-exports the exports of several others.
You can suppress the warnings with the flag
-fno-warn-warnings-deprecations
.
These pragmas control the inlining of function definitions.
GHC (with -O
, as always) tries to
inline (or “unfold”) functions/values that are
“small enough,” thus avoiding the call overhead
and possibly exposing other more-wonderful optimisations.
Normally, if GHC decides a function is “too
expensive” to inline, it will not do so, nor will it
export that unfolding for other modules to use.
The sledgehammer you can bring to bear is the
INLINE
pragma, used thusly:
key_function :: Int -> String -> (Bool, Double) {-# INLINE key_function #-}
The major effect of an INLINE
pragma
is to declare a function's “cost” to be very low.
The normal unfolding machinery will then be very keen to
inline it. However, an INLINE
pragma for a
function "f
" has a number of other effects:
While GHC is keen to inline the function, it does not do so blindly. For example, if you write
map key_function xs
there really isn't any point in inlining key_function
to get
map (\x -> body
) xs
In general, GHC only inlines the function if there is some reason (no matter how slight) to suppose that it is useful to do so.
Moreover, GHC will only inline the function if it is fully applied, where "fully applied" means applied to as many arguments as appear (syntactically) on the LHS of the function definition. For example:
comp1 :: (b -> c) -> (a -> b) -> a -> c {-# INLINE comp1 #-} comp1 f g = \x -> f (g x) comp2 :: (b -> c) -> (a -> b) -> a -> c {-# INLINE comp2 #-} comp2 f g x = f (g x)
The two functions comp1
and comp2
have the
same semantics, but comp1
will be inlined when applied
to two arguments, while comp2
requires
three. This might make a big difference if you say
map (not `comp1` not) xs
which will optimise better than the corresponding use of `comp2`.
It is useful for GHC to optimise the definition of an
INLINE function f
just like any other non-INLINE function,
in case the non-inlined version of f
is
ultimately called. But we don't want to inline
the optimised version
of f
;
a major reason for INLINE pragmas is to expose functions
in f
's RHS that have
rewrite rules, and it's no good if those functions have been optimised
away.
So GHC guarantees to inline precisely the code that you wrote, no more and no less. It does this by capturing a copy of the definition of the function to use for inlining (we call this the "inline-RHS"), which it leaves untouched, while optimising the ordinarily RHS as usual. For externally-visible functions the inline-RHS (not the optimised RHS) is recorded in the interface file.
An INLINE function is not worker/wrappered by strictness analysis. It's going to be inlined wholesale instead.
GHC ensures that inlining cannot go on forever: every mutually-recursive group is cut by one or more loop breakers that is never inlined (see Secrets of the GHC inliner, JFP 12(4) July 2002). GHC tries not to select a function with an INLINE pragma as a loop breaker, but when there is no choice even an INLINE function can be selected, in which case the INLINE pragma is ignored. For example, for a self-recursive function, the loop breaker can only be the function itself, so an INLINE pragma is always ignored.
Syntactically, an INLINE
pragma for a
function can be put anywhere its type signature could be
put.
INLINE
pragmas are a particularly
good idea for the
then
/return
(or
bind
/unit
) functions in
a monad. For example, in GHC's own
UniqueSupply
monad code, we have:
{-# INLINE thenUs #-} {-# INLINE returnUs #-}
See also the NOINLINE
(Section 7.13.5.3, “NOINLINE pragma”)
and INLINABLE
(Section 7.13.5.2, “INLINABLE pragma”)
pragmas.
Note: the HBC compiler doesn't like INLINE
pragmas,
so if you want your code to be HBC-compatible you'll have to surround
the pragma with C pre-processor directives
#ifdef __GLASGOW_HASKELL__
...#endif
.
An {-# INLINABLE f #-}
pragma on a
function f
has the following behaviour:
While INLINE
says "please inline me", the INLINABLE
says "feel free to inline me; use your
discretion". In other words the choice is left to GHC, which uses the same
rules as for pragma-free functions. Unlike INLINE
, that decision is made at
the call site, and
will therefore be affected by the inlining threshold, optimisation level etc.
Like INLINE
, the INLINABLE
pragma retains a
copy of the original RHS for
inlining purposes, and persists it in the interface file, regardless of
the size of the RHS.
One way to use INLINABLE
is in conjunction with
the special function inline
(Section 7.15, “Special built-in functions”).
The call inline f
tries very hard to inline f
.
To make sure that f
can be inlined,
it is a good idea to mark the definition
of f
as INLINABLE
,
so that GHC guarantees to expose an unfolding regardless of how big it is.
Moreover, by annotating f
as INLINABLE
,
you ensure that f
's original RHS is inlined, rather than
whatever random optimised version of f
GHC's optimiser
has produced.
The INLINABLE
pragma also works with SPECIALISE
:
if you mark function f
as INLINABLE
, then
you can subsequently SPECIALISE
in another module
(see Section 7.13.8, “SPECIALIZE pragma”).
Unlike INLINE
, it is OK to use
an INLINABLE
pragma on a recursive function.
The principal reason do to so to allow later use of SPECIALISE
The NOINLINE
pragma does exactly what
you'd expect: it stops the named function from being inlined
by the compiler. You shouldn't ever need to do this, unless
you're very cautious about code size.
NOTINLINE
is a synonym for
NOINLINE
(NOINLINE
is
specified by Haskell 98 as the standard way to disable
inlining, so it should be used if you want your code to be
portable).
An INLINE or NOINLINE pragma may have a CONLIKE modifier, which affects matching in RULEs (only). See Section 7.14.3, “How rules interact with INLINE/NOINLINE and CONLIKE pragmas”.
Sometimes you want to control exactly when in GHC's
pipeline the INLINE pragma is switched on. Inlining happens
only during runs of the simplifier. Each
run of the simplifier has a different phase
number; the phase number decreases towards zero.
If you use -dverbose-core2core
you'll see the
sequence of phase numbers for successive runs of the
simplifier. In an INLINE pragma you can optionally specify a
phase number, thus:
"INLINE[k] f
" means: do not inline
f
until phase k
, but from phase
k
onwards be very keen to inline it.
"INLINE[~k] f
" means: be very keen to inline
f
until phase k
, but from phase
k
onwards do not inline it.
"NOINLINE[k] f
" means: do not inline
f
until phase k
, but from phase
k
onwards be willing to inline it (as if
there was no pragma).
"NOINLINE[~k] f
" means: be willing to inline
f
until phase k
, but from phase
k
onwards do not inline it.
The same information is summarised here:
-- Before phase 2 Phase 2 and later {-# INLINE [2] f #-} -- No Yes {-# INLINE [~2] f #-} -- Yes No {-# NOINLINE [2] f #-} -- No Maybe {-# NOINLINE [~2] f #-} -- Maybe No {-# INLINE f #-} -- Yes Yes {-# NOINLINE f #-} -- No No
By "Maybe" we mean that the usual heuristic inlining rules apply (if the function body is small, or it is applied to interesting-looking arguments etc). Another way to understand the semantics is this:
For both INLINE and NOINLINE, the phase number says when inlining is allowed at all.
The INLINE pragma has the additional effect of making the function body look small, so that when inlining is allowed it is very likely to happen.
The same phase-numbering control is available for RULES (Section 7.14, “Rewrite rules ”).
This pragma is similar to C's #line
pragma, and is mainly for use in automatically generated Haskell
code. It lets you specify the line number and filename of the
original code; for example
{-# LINE 42 "Foo.vhs" #-}
if you'd generated the current file from something called
Foo.vhs
and this line corresponds to line
42 in the original. GHC will adjust its error messages to refer
to the line/file named in the LINE
pragma.
The RULES pragma lets you specify rewrite rules. It is described in Section 7.14, “Rewrite rules ”.
(UK spelling also accepted.) For key overloaded functions, you can create extra versions (NB: more code space) specialised to particular types. Thus, if you have an overloaded function:
hammeredLookup :: Ord key => [(key, value)] -> key -> value
If it is heavily used on lists with
Widget
keys, you could specialise it as
follows:
{-# SPECIALIZE hammeredLookup :: [(Widget, value)] -> Widget -> value #-}
A SPECIALIZE
pragma for a function can
be put anywhere its type signature could be put.
A SPECIALIZE
has the effect of generating
(a) a specialised version of the function and (b) a rewrite rule
(see Section 7.14, “Rewrite rules
”) that rewrites a call to the
un-specialised function into a call to the specialised one.
The type in a SPECIALIZE pragma can be any type that is less
polymorphic than the type of the original function. In concrete terms,
if the original function is f
then the pragma
{-# SPECIALIZE f :: <type> #-}
is valid if and only if the definition
f_spec :: <type> f_spec = f
is valid. Here are some examples (where we only give the type signature for the original function, not its code):
f :: Eq a => a -> b -> b {-# SPECIALISE f :: Int -> b -> b #-} g :: (Eq a, Ix b) => a -> b -> b {-# SPECIALISE g :: (Eq a) => a -> Int -> Int #-} h :: Eq a => a -> a -> a {-# SPECIALISE h :: (Eq a) => [a] -> [a] -> [a] #-}
The last of these examples will generate a RULE with a somewhat-complex left-hand side (try it yourself), so it might not fire very well. If you use this kind of specialisation, let us know how well it works.
A SPECIALIZE
pragma can optionally be followed with a
INLINE
or NOINLINE
pragma, optionally
followed by a phase, as described in Section 7.13.5, “INLINE and NOINLINE pragmas”.
The INLINE
pragma affects the specialised version of the
function (only), and applies even if the function is recursive. The motivating
example is this:
-- A GADT for arrays with type-indexed representation data Arr e where ArrInt :: !Int -> ByteArray# -> Arr Int ArrPair :: !Int -> Arr e1 -> Arr e2 -> Arr (e1, e2) (!:) :: Arr e -> Int -> e {-# SPECIALISE INLINE (!:) :: Arr Int -> Int -> Int #-} {-# SPECIALISE INLINE (!:) :: Arr (a, b) -> Int -> (a, b) #-} (ArrInt _ ba) !: (I# i) = I# (indexIntArray# ba i) (ArrPair _ a1 a2) !: i = (a1 !: i, a2 !: i)
Here, (!:)
is a recursive function that indexes arrays
of type Arr e
. Consider a call to (!:)
at type (Int,Int)
. The second specialisation will fire, and
the specialised function will be inlined. It has two calls to
(!:)
,
both at type Int
. Both these calls fire the first
specialisation, whose body is also inlined. The result is a type-based
unrolling of the indexing function.
Warning: you can make GHC diverge by using SPECIALISE INLINE
on an ordinarily-recursive function.
Generally, you can only give a SPECIALIZE
pragma
for a function defined in the same module.
However if a function f
is given an INLINABLE
pragma at its definition site, then it can subsequently be specialised by
importing modules (see Section 7.13.5.2, “INLINABLE pragma”).
For example
module Map( lookup, blah blah ) where lookup :: Ord key => [(key,a)] -> key -> Maybe a lookup = ... {-# INLINABLE lookup #-} module Client where import Map( lookup ) data T = T1 | T2 deriving( Eq, Ord ) {-# SPECIALISE lookup :: [(T,a)] -> T -> Maybe a
Here, lookup
is declared INLINABLE
, but
it cannot be specialised for type T
at its definition site,
because that type does not exist yet. Instead a client module can define T
and then specialise lookup
at that type.
Moreover, every module that imports Client
(or imports a module
that imports Client
, transitively) will "see", and make use of,
the specialised version of lookup
. You don't need to put
a SPECIALIZE
pragma in every module.
Moreover you often don't even need the SPECIALIZE
pragma in the
first place. When compiling a module M,
GHC's optimiser (with -O) automatically considers each top-level
overloaded function declared in M, and specialises it
for the different types at which it is called in M. The optimiser
also considers each imported
INLINABLE
overloaded function, and specialises it
for the different types at which it is called in M.
So in our example, it would be enough for lookup
to
be called at type T
:
module Client where import Map( lookup ) data T = T1 | T2 deriving( Eq, Ord ) findT1 :: [(T,a)] -> Maybe a findT1 m = lookup m T1 -- A call of lookup at type T
However, sometimes there are no such calls, in which case the pragma can be useful.
Note: In earlier versions of GHC, it was possible to provide your own specialised function for a given type:
{-# SPECIALIZE hammeredLookup :: [(Int, value)] -> Int -> value = intLookup #-}
This feature has been removed, as it is now subsumed by the
RULES
pragma (see Section 7.14.5, “Specialisation
”).
Same idea, except for instance declarations. For example:
instance (Eq a) => Eq (Foo a) where { {-# SPECIALIZE instance Eq (Foo [(Int, Bar)]) #-} ... usual stuff ... }
The pragma must occur inside the where
part
of the instance declaration.
Compatible with HBC, by the way, except perhaps in the placement of the pragma.
The UNPACK
indicates to the compiler
that it should unpack the contents of a constructor field into
the constructor itself, removing a level of indirection. For
example:
data T = T {-# UNPACK #-} !Float {-# UNPACK #-} !Float
will create a constructor T
containing
two unboxed floats. This may not always be an optimisation: if
the T
constructor is scrutinised and the
floats passed to a non-strict function for example, they will
have to be reboxed (this is done automatically by the
compiler).
Unpacking constructor fields should only be used in
conjunction with -O
[11], in order to expose
unfoldings to the compiler so the reboxing can be removed as
often as possible. For example:
f :: T -> Float f (T f1 f2) = f1 + f2
The compiler will avoid reboxing f1
and f2
by inlining +
on floats, but only when -O
is on.
Any single-constructor data is eligible for unpacking; for example
data T = T {-# UNPACK #-} !(Int,Int)
will store the two Int
s directly in the
T
constructor, by flattening the pair.
Multi-level unpacking is also supported:
data T = T {-# UNPACK #-} !S data S = S {-# UNPACK #-} !Int {-# UNPACK #-} !Int
will store two unboxed Int#
s
directly in the T
constructor. The
unpacker can see through newtypes, too.
See also the -funbox-strict-fields
flag,
which essentially has the effect of adding
{-# UNPACK #-}
to every strict
constructor field.
The {-# SOURCE #-}
pragma is used only in import
declarations,
to break a module loop. It is described in detail in Section 4.7.9, “How to compile mutually recursive modules”.