2.1. Version 9.2.1¶
The significant changes to the various parts of the compiler are listed in the following sections.
The LLVM backend
of this release is to be used with LLVM
10 or 11.
2.1.1. Language¶
ImpredicativeTypes
: Finally, polymorphic types have become first class! GHC 9.2 includes a full implementation of the Quick Look approach to type inference for impredicative types, as described in in the paper A quick look at impredicativity (Serrano et al, ICFP 2020). More information here: Impredicative polymorphism. This replaces the old (undefined, flaky) behaviour of theImpredicativeTypes
extension.- The first stage of the Pointer Rep Proposal has been implemented. All
boxed types, both lifted and unlifted, now have representation kinds of
the shape
BoxedRep r
. Code that referencesLiftedRep
andUnliftedRep
will need to be updated.
UnliftedDatatypes
: The Unlifted Datatypes Proposal has been implemented. That means GHC Haskell now offers a way to define algebraic data types with strict semantics like in OCaml or Idris! The distinction to ordinary lifted data types is made in the kind system: Unlifted data types live in kindTYPE (BoxedRep Unlifted)
.UnliftedDatatypes
allows giving data declarations such result kinds, such as in the following example with the help ofStandaloneKindSignatures
:type IntSet :: UnliftedType -- type UnliftedType = TYPE (BoxedRep Unlifted) data IntSet = Branch IntSet !Int IntSet | Leaf
See
UnliftedDatatypes
for what other declarations are possible. Slight caveat: Most functions inbase
(including$
) are not levity-polymorphic (yet) and hence won’t work with unlifted data types.
Kind inference for data/newtype instance declarations is slightly more restrictive than before. In particular, GHC now requires that the kind of a data family instance be fully determined by the header of the instance, without looking at the definition of the constructor.
This means that data families that dispatched on an invisible parameter might now require this parameter to be made explicit, as in the following example:
data family DF :: forall (r :: RuntimeRep). TYPE r newtype instance DF @IntRep = MkDF2 Int# newtype instance DF @FloatRep = MkDF1 Float#
See the user manual Kind inference for data/newtype instance declarations.
GHC is stricter about checking for out-of-scope type variables on the right-hand sides of associated type family instances that are not bound on the left-hand side. As a result, some programs that were accidentally accepted in previous versions of GHC will now be rejected, such as this example:
class Funct f where type Codomain f instance Funct ('KProxy :: KProxy o) where type Codomain 'KProxy = NatTr (Proxy :: o -> Type)
Where:
data Proxy (a :: k) = Proxy data KProxy (t :: Type) = KProxy data NatTr (c :: o -> Type)
GHC will now reject the
o
on the right-hand side of theCodomain
instance as being out of scope, as it does not meet the requirements for being explicitly bound (as it is not mentioned on the left-hand side) nor implicitly bound (as it is not mentioned in an outermost kind signature, as required by Scoping of class parameters). This program can be repaired in a backwards-compatible way by mentioningo
on the left-hand side:instance Funct ('KProxy :: KProxy o) where type Codomain ('KProxy @o) = NatTr (Proxy :: o -> Type) -- Alternatively, -- type Codomain ('KProxy :: KProxy o) = NatTr (Proxy :: o -> Type)
Previously,
-XUndecidableInstances
accidentally implied-XFlexibleContexts
. This is now fixed, but it means that some programs will newly require-XFlexibleContexts
.The
GHC2021
language is supported now. It builds on top of Haskell2010, adding several stable and conservative extensions, and removing deprecated ones. It is now also the “default” language set that is active when no other language set, such asHaskell98
orHaskell2010
, is explicitly loaded (e.g via Cabal’sdefault-language
).Because
GHC2021
includesGeneralizedNewtypeDeriving
, which is not safe for Safe Haskell, users of Safe Haskell are advised to useHaskell2010
explicitly.The default mode of GHC until 9.0 included
NondecreasingIndentation
, butGHC2021
does not. This may break code implicitly using this extension.The Record Dot Syntax Proposal has been implemented:
- A new extension
OverloadedRecordDot
provides record.
syntax e.g.x.foo
- A new extension
OverloadedRecordUpdate
provides record.
syntax in record updates e.g.x{foo.bar = 1}
. The design of this extension may well change in the future.
- A new extension
Various records-related extensions have been improved:
- A new extension
NoFieldSelectors
hides record field selector functions, so it is possible to define top-level bindings with the same names. - The
DisambiguateRecordFields
extension now works for updates. An updateexpr { field = value }
will be accepted if there is a single field calledfield
in scope, regardless of whether there are non-fields in scope with the same name. - The
DuplicateRecordFields
extension now applies to fields in record pattern synonyms. In particular, it is possible for a single module to define multiple pattern synonyms using the same field names.
- A new extension
Because of simplifications to the way that GHC typechecks operator sections, operators with nested
forall
s or contexts in their type signatures might not typecheck when used in a section. For instance, theg
function below, which was accepted in previous GHC releases, will no longer typecheck:f :: a -> forall b. b -> a f x _ = x g :: a -> a g = (`f` "hello")
g
can be made to typecheck once more by eta expanding it to\x -> x `f` "hello"
. For more information, see simple-subsumption.LinearTypes
can now infer multiplicity forcase
expressions. Previously, the scrutinee of acase
(the bit betweencase
andof
) was always assumed to have a multiplicity ofMany
. Now, GHC will allow the scrutinee to have a multiplicity ofOne
, using its best-effort inference algorithm.Support for matching on GADT constructors in arrow notation has been removed, as the current implementation of
Arrows
doesn’t handle GADT evidence correctly.One possible workaround, for the time being, is to perform GADT matches inside let bindings:
data G a where MkG :: Show a => a -> G a foo :: G a -> String foo = proc x -> do let res = case x of { MkG a -> show a } returnA -< res
2.1.2. Compiler¶
GHC now has an ARMv8 native code generator, significantly improving compilation performance for ARM targets and eliminating a dependency on LLVM.
Performance of the compiler in
--make
mode with-j[⟨n⟩]
is significantly improved by improvements to the parallel garbage collector noted below.Benchmarks show a 20% decrease in wall clock time, and a 40% decrease in cpu time, when compiling Cabal with
-j4
on linux. Improvements are more dramatic with higher parallelism, and we no longer see significant degradation in wall clock time as parallelism increases above 4.New
-Wredundant-bang-patterns
flag that enables checks for “dead” bangs. For instance, given this program:f :: Bool -> Bool f True = False f !x = x
GHC would report that the bang on
x
is redundant and can be removed since the argument was already forced in the first equation. For more details see-Wredundant-bang-patterns
.New
-Wimplicit-lift
flag which warns when a Template Haskell quote implicitly useslift
.New
-finline-generics
and-finline-generics-aggressively
flags for improving performance of generics-based algorithms.For more details see
-finline-generics
and-finline-generics-aggressively
.GHC now supports a flag,
-fprof-callers=⟨name⟩
, for requesting that the compiler automatically insert cost-centres on all call-sites of the named function.The heap profiler can now be controlled from within a Haskell program using functions in
GHC.Profiling
. Profiling can be started and stopped or a heap census requested at a specific point in the program. There is a new RTS flag--no-automatic-heap-samples
which can be used to stop heap profiling starting when a program starts.A new debugging facility,
-finfo-table-map
, which embeds a mapping from the address of an info table to information about that info table, including an approximate source position.-fdistinct-constructor-tables
is also useful with this flag to give each usage of a data constructor its own unique info table so they can be distinguished in gdb and heap profiles.
2.1.3. GHCi¶
- GHCi’s
:kind!
command now expands through type synonyms in addition to type families. See:kind
. - GHCi’s
:edit
command now looks for an editor in theVISUAL
environment variable beforeEDITOR
, following UNIX convention. (#19030) - GHC now follows by default the XDG Base Directory Specification. If
$HOME/.ghc
is found it will fallback to the old paths to give you time to migrate. This fallback will be removed in three releases. - New debugger command
:ignore
to set anignore count
for a specified breakpoint. The nextignore count
times the program hits this breakpoint, the breakpoint is ignored, and the program doesn’t stop. - New optional parameter added to the command
:continue
to set theignore count
for the current breakpoint.
2.1.4. Runtime system¶
The parallel garbage collector is now significantly more performant. Heavily contended spinlocks have been replaced with mutexes and condition variables. For most programs compiled with the threaded runtime, and run with more than four capabilities, we expect minor GC pauses and GC cpu time both to be reduced.
For very short running programs (in the order of 10s of milliseconds), we have seen some performance regressions. We recommend programs affected by this to either compile with the single threaded runtime, or otherwise to disable the parallel garbage collector with
-qg ⟨gen⟩
.We don’t expect any other performance regressions, however only limited benchmarking has been done. We have only benchmarked GHC and nofib and only on linux.
Users are advised to reconsider the rts flags that programs are run with. If you have been mitigating poor parallel GC performance by: using large nurseries (
-A
), disabling load balancing (-qb ⟨gen⟩
), or limiting parallel GC to older generations (-qg ⟨gen⟩
); then you may find these mitigations are no longer necessary.The heap profiler now has proper treatment of pinned
ByteArray#
s. Such heap objects will now be correctly attributed to their appropriate cost centre instead of merely being lumped into thePINNED
category. Moreover, we now correctly account for the size of the array, meaning that space lost to fragmentation is no longer counted as live data.The
-xt
RTS flag has been removed. Now STACK and TSO closures are always included in heap profiles. Tooling can choose to filter out these closure types if necessary.A new heap profiling mode,
-hi
, profile by info table allows for fine-grain banding by the info table address of a closure. The profiling mode is intended to be used with-finfo-table-map
and can best be consumed witheventlog2html
. This profiling mode does not require a profiling build.The RTS will now gradually return unused memory back to the OS rather than retaining a large amount (up to 4 * live) indefinitely. The rate at which memory is returned is controlled by the
-Fd ⟨factor⟩
. Memory return is triggered by consecutive idle collections.The default nursery size,
-A
, has been increased from 1mb to 4mb.
2.1.5. Template Haskell¶
There are two new functions
putDoc
andgetDoc
, which allow Haddock documentation to be attached and read from module headers, declarations, function arguments, class instances and family instances. These functions are quite low level, so thewithDecDoc
function provides a more ergonomic interface for this. SimilarlyfunD_doc
,dataD_doc
and friends provide an easy way to document functions and constructors alongside their arguments simultaneously.$(withDecsDoc "This does good things" [d| foo x = 42 |])
2.1.6. ghc-prim
library¶
Void#
is now a type synonym for the unboxed tuple(# #)
. Code usingVoid#
now has to enableUnboxedTuples
.
2.1.7. Eventlog¶
- Two new events,
BLOCKS_SIZE
tells you about the total size of all allocated blocks andMEM_RETURN
gives statistics about why the OS is returning and retaining megablocks.
2.1.8. ghc
library¶
There is a significant refactoring in the solver; any type-checker plugins will have to be updated, as GHC no longer uses flattening skolems or flattening metavariables.
Type checker plugins which work with the natural numbers now should use
naturalTy
kind instead oftypeNatKind
, which has been removed.The
con_args
field ofConDeclGADT
has been renamed tocon_g_args
. This is because the type ofcon_g_args
is now different from the type of thecon_args
field inConDeclH98
:data ConDecl pass = ConDeclGADT { ... , con_g_args :: HsConDeclGADTDetails pass -- ^ Arguments; never infix , ... } | ConDeclH98 { ... , con_args :: HsConDeclH98Details pass -- ^ Arguments; can be infix , ... }
Where:
-- Introduced in GHC 9.2; was called `HsConDeclDetails` in previous versions of GHC type HsConDeclH98Details pass = HsConDetails (HsScaled pass (LBangType pass)) (XRec pass [LConDeclField pass]) -- Introduced in GHC 9.2 data HsConDeclGADTDetails pass = PrefixConGADT [HsScaled pass (LBangType pass)] | RecConGADT (XRec pass [LConDeclField pass])
Unlike Haskell98-style constructors, GADT constructors cannot be declared using infix syntax, which is why
HsConDeclGADTDetails
lacks anInfixConGADT
constructor.As a result of all this, the
con_args
field is now partial, so usingcon_args
as a top-level field selector is discouraged.
2.1.9. base
library¶
The lifted fixed-width integer and word types (e.g.
Data.Int.Int8
,Data.Word.Word32
) are now represented by their associated fixed-width unlifted types. For instance, while in previous GHC versionsInt8
was defined as:data Int8 = I8# Int#
As of GHC 9.2 it is rather defined as,
data Int8 = I8# Int8#
Character set metadata bumped to Unicode 13.0.0.
It’s possible now to promote the
Natural
type:data Coordinate = Mk2D Natural Natural type MyCoordinate = Mk2D 1 10
The separate kind
Nat
is removed and now it is just a type synonym forNatural
. As a consequence, one must enableTypeSynonymInstances
in order to define instances forNat
.The
Numeric
module receivesshowBin
andreadBin
to show and read integer numbers in binary.Char
gets type-level support by analogy with strings and natural numbers. We extend theGHC.TypeLits
module with these built-in type-families:type family CmpChar (a :: Char) (b :: Char) :: Ordering type family ConsSymbol (a :: Char) (b :: Symbol) :: Symbol type family UnconsSymbol (a :: Symbol) :: Maybe (Char, Symbol) type family CharToNat (c :: Char) :: Natural type family NatToChar (n :: Natural) :: Char
and with the type class
KnownChar
(and such additional functions ascharVal
andcharVal'
):class KnownChar (n :: Char) charVal :: forall n proxy. KnownChar n => proxy n -> Char charVal' :: forall n. KnownChar n => Proxy# n -> Char
A new kind-polymorphic
Compare
type family was added inData.Type.Ord
and has type instances forNat
,Symbol
, andChar
. Furthermore, the(<=?)
type (and(<=)
) fromGHC.TypeNats
is now governed by this type family (as well as new comparison type operators that are exported byData.Type.Ord
). This has two important repercussions. First, GHC can no longer deduce that all natural numbers are greater than or equal to zero. For instance,test1 :: Proxy (0 <=? x) -> Proxy True test1 = id
which previously type checked will now result in a type error. Second, when these comparison type operators are used very generically, a kind may need to be provided. For example,
test2 :: Proxy (x <=? x) -> Proxy True test2 = id
will now generate a type error because GHC does not know the kind of
x
. To fix this, one must provide an explicit kind, perhaps by changing the type to:test2 :: forall (x :: Nat). Proxy (x <=? x) -> Proxy True
On POSIX,
System.IO.openFile
can no longer leak a file descriptor if it is interrupted by an asynchronous exception (#19114, #19115).There’s a new binding
GHC.Exts.considerAccessible
. It’s equivalent toTrue
and allows the programmer to turn off pattern-match redundancy warnings for particular clauses, like the third one hereg :: Bool -> Int g x = case (x, x) of (True, True) -> 1 (False, False) -> 2 (True, False) | considerAccessible -> 3 -- No warning!
2.1.9.1. Included libraries¶
The package database provided with this distribution also contains a number of packages other than GHC itself. See the changelogs provided with these packages for further change information.
Package | Version | Reason for inclusion |
---|---|---|
ghc | 9.2.4 | The compiler itself |
Cabal | 3.6.3.0 | Dependency of ghc-pkg utility |
Win32 | 2.12.0.1 | Dependency of ghc library |
array | 0.5.4.0 | Dependency of ghc library |
base | 4.16.3.0 | Core library |
binary | 0.8.9.0 | Dependency of ghc library |
bytestring | 0.11.3.1 | Dependency of ghc library |
containers | 0.6.5.1 | Dependency of ghc library |
deepseq | 1.4.6.1 | Dependency of ghc library |
directory | 1.3.6.2 | Dependency of ghc library |
exceptions | 0.10.4 | Dependency of ghc and haskeline library |
filepath | 1.4.2.2 | Dependency of ghc library |
ghc-boot-th | 9.2.4 | Internal compiler library |
ghc-boot | 9.2.4 | Internal compiler library |
ghc-compact | 0.1.0.0 | Core library |
ghc-heap | 9.2.4 | GHC heap-walking library |
ghc-prim | 0.8.0 | Core library |
ghci | 9.2.4 | The REPL interface |
haskeline | 0.8.2 | Dependency of ghci executable |
hpc | 0.6.1.0 | Dependency of hpc executable |
integer-gmp | 1.1 | Core library |
libiserv | 9.2.4 | Internal compiler library |
mtl | 2.2.2 | Dependency of Cabal library |
parsec | 3.1.15.0 | Dependency of Cabal library |
pretty | 1.1.3.6 | Dependency of ghc library |
process | 1.6.13.2 | Dependency of ghc library |
stm | 2.5.0.2 | Dependency of haskeline library |
template-haskell | 2.18.0.0 | Core library |
terminfo | 0.4.1.5 | Dependency of haskeline library |
text | 1.2.5.0 | Dependency of Cabal library |
time | 1.11.1.1 | Dependency of ghc library |
transformers | 0.5.6.2 | Dependency of ghc library |
unix | 2.7.2.2 | Dependency of ghc library |
xhtml | 3000.2.2.1 | Dependency of haddock executable |