5. Introduction to Types and Generic Programming#
Contents
5.1. Overview#
In Julia, arrays and tuples are the most important data type for working with numerical data.
In this lecture we give more details on
declaring types
abstract types
motivation for generic programming
multiple dispatch
building user-defined types
using LinearAlgebra, Statistics
5.2. Finding and Interpreting Types#
5.2.1. Finding The Type#
As we have seen in the previous lectures, in Julia all values have a type, which can be queried using the typeof
function
@show typeof(1)
@show typeof(1.0);
typeof(1) = Int64
typeof(1.0) = Float64
The hard-coded values 1
and 1.0
are called literals in a programming
language, and the compiler deduces their types (Int64
and Float64
respectively in the example above).
You can also query the type of a value
x = 1
typeof(x)
Int64
The name x
binds to the value 1
, created as a literal.
5.2.2. Parametric Types#
(See parametric types documentation).
The next two types use curly bracket notation to express the fact that they are parametric
@show typeof(1.0 + 1im)
@show typeof(ones(2, 2));
typeof(1.0 + 1im) = ComplexF64
typeof(ones(2, 2)) = Matrix{Float64}
We will learn more details about generic programming later, but the key is to interpret the curly brackets as swappable parameters for a given type.
For example, Array{Float64, 2}
can be read as
Array
is a parametric type representing a dense array, where the first parameter is the type stored, and the second is the number of dimensions.Float64
is a concrete type declaring that the data stored will be a particular size of floating point.2
is the number of dimensions of that array.
A concrete type is one where values can be created by the compiler (equivalently, one which can be the result of typeof(x)
for some object x
).
Values of a parametric type cannot be concretely constructed unless all of the parameters are given (themselves with concrete types).
In the case of Complex{Float64}
Complex
is an abstract complex number type.Float64
is a concrete type declaring what the type of the real and imaginary parts of the value should store.
Another type to consider is the Tuple
and NamedTuple
x = (1, 2.0, "test")
@show typeof(x)
typeof(x) = Tuple{Int64, Float64, String}
Tuple{Int64, Float64, String}
In this case, Tuple
is the parametric type, and the three parameters are a list of the types of each value.
For a named tuple
x = (a = 1, b = 2.0, c = "test")
@show typeof(x)
typeof(x) = @NamedTuple{a::Int64, b::Float64, c::String}
@NamedTuple{a::Int64, b::Float64, c::String}
The parametric NamedTuple
type contains two parameters: first a list of names for each field of the tuple, and second the underlying Tuple
type to store the values.
Anytime a value is prefixed by a colon, as in the :a
above, the type is Symbol
– a special kind of string used by the compiler.
typeof(:a)
Symbol
Remark: Note that, by convention, type names use CamelCase – Array
, AbstractArray
, etc.
5.2.3. Variables, Types, and Values#
Since variables and functions are lower case by convention, this can be used to easily identify types when reading code and output.
After assigning a variable name to a value, we can query the type of the value via the name.
x = 42
@show typeof(x);
typeof(x) = Int64
Thus, x
is just a symbol bound to a value of type Int64
.
We can rebind the symbol x
to any other value, of the same type or otherwise.
x = 42.0
42.0
Now x
“points to” another value, of type Float64
typeof(x)
Float64
However, beyond a few notable exceptions (e.g. nothing
used for error handling),
changing types is usually a symptom of poorly organized code, and makes
type inference more difficult for the compiler.
5.3. The Type Hierarchy#
Let’s discuss how types are organized.
5.3.1. Abstract vs Concrete Types#
(See abstract types documentation)
Up to this point, most of the types we have worked with (e.g., Float64, Int64
) are examples of concrete types.
Concrete types are types that we can instantiate – i.e., pair with data in memory.
We will now examine abstract types that cannot be instantiated (e.g., Real
, AbstractFloat
).
For example, while you will never have a Real
number directly in memory, the abstract types
help us organize and work with related concrete types.
5.3.2. Subtypes and Supertypes#
How exactly do abstract types organize or relate different concrete types?
In the Julia language specification, the types form a hierarchy.
You can check if a type is a subtype of another with the <:
operator.
@show Float64 <: Real
@show Int64 <: Real
@show Complex{Float64} <: Real
@show Array <: Real;
Float64 <: Real = true
Int64 <: Real = true
Complex{Float64} <: Real = false
Array <: Real = false
In the above, both Float64
and Int64
are subtypes of Real
, whereas the Complex
numbers are not.
They are, however, all subtypes of Number
@show Real <: Number
@show Float64 <: Number
@show Int64 <: Number
@show Complex{Float64} <: Number;
Real <: Number = true
Float64 <: Number = true
Int64 <: Number = true
Complex{Float64} <: Number = true
Number
in turn is a subtype of Any
, which is a parent of all types.
Number <: Any
true
In particular, the type tree is organized with Any
at the top and the concrete types at the bottom.
We never actually see instances of abstract types (i.e., typeof(x)
never returns an abstract type).
The point of abstract types is to categorize the concrete types, as well as other abstract types that sit below them in the hierarchy.
There are some further functions to help you explore the type hierarchy, such as show_supertypes
which walks up the tree of types to Any
for a given type.
using Base: show_supertypes # import the function from the `Base` package
show_supertypes(Int64)
Int64 <: Signed <: Integer <: Real <: Number <: Any
And the subtypes
which gives a list of the available subtypes for any packages or code currently loaded
@show subtypes(Real)
@show subtypes(AbstractFloat);
subtypes(Real) = Any[AbstractFloat, AbstractIrrational, Integer, Rational]
subtypes(AbstractFloat) = Any[BigFloat, Float16, Float32, Float64]
5.4. Deducing and Declaring Types#
We will discuss this in detail in generic programming, but much of Julia’s performance gains and generality of notation comes from its type system.
For example
x1 = [1, 2, 3]
x2 = [1.0, 2.0, 3.0]
@show typeof(x1)
@show typeof(x2)
typeof(x1) = Vector{Int64}
typeof(x2) = Vector{Float64}
Vector{Float64} (alias for Array{Float64, 1})
These return Array{Int64,1}
and Array{Float64,1}
respectively, which the compiler is able to infer from the right hand side of the expressions.
Given the information on the type, the compiler can work through the sequence of expressions to infer other types.
f(y) = 2y # define some function
x = [1, 2, 3]
z = f(x) # call with an integer array - compiler deduces type
3-element Vector{Int64}:
2
4
6
5.4.1. Good Practices for Functions and Variable Types#
In order to keep many of the benefits of Julia, you will sometimes want to ensure the compiler can always deduce a single type from any function or expression.
An example of bad practice is to use an array to hold unrelated types
x = [1.0, "test", 1] # typically poor style
3-element Vector{Any}:
1.0
"test"
1
The type of this array is Array{Any,1}
, where Any
means the compiler has determined that any valid Julia type can be added to the array.
While occasionally useful, this is to be avoided whenever possible in performance sensitive code.
The other place this can come up is in the declaration of functions.
As an example, consider a function which returns different types depending on the arguments.
function f(x)
if x > 0
return 1.0
else
return 0 # probably meant `0.0`
end
end
@show f(1)
@show f(-1);
f(1) = 1.0
f(-1) = 0
The issue here is relatively subtle: 1.0
is a floating point, while 0
is an integer.
Consequently, given the type of x
, the compiler cannot in general determine what type the function will return.
This issue, called type stability, is at the heart of most Julia performance considerations.
Luckily, trying to ensure that functions return the same types is also generally consistent with simple, clear code.
5.4.2. Manually Declaring Function and Variable Types#
(See type declarations documentation)
You will notice that in the lecture notes we have never directly declared any types.
This is intentional both for exposition and as a best practice for using packages (as opposed to writing new packages, where declaring these types is very important).
It is also in contrast to some of the sample code you will see in other Julia sources, which you will need to be able to read.
To give an example of the declaration of types, the following are equivalent
function f(x, A)
b = [5.0, 6.0]
return A * x .+ b
end
val = f([0.1, 2.0], [1.0 2.0; 3.0 4.0])
2-element Vector{Float64}:
9.1
14.3
function f2(x::Vector{Float64}, A::Matrix{Float64})::Vector{Float64}
# argument and return types
b::Vector{Float64} = [5.0, 6.0]
return A * x .+ b
end
val = f2([0.1; 2.0], [1.0 2.0; 3.0 4.0])
2-element Vector{Float64}:
9.1
14.3
While declaring the types may be verbose, would it ever generate faster code?
The answer is almost never.
Furthermore, it can lead to confusion and inefficiencies since many things that behave like vectors and matrices are not Matrix{Float64}
and Vector{Float64}
.
Here, the first line works and the second line fails
@show f([0.1; 2.0], [1 2; 3 4])
@show f([0.1; 2.0], Diagonal([1.0, 2.0]))
# f2([0.1; 2.0], [1 2; 3 4]) # not a `Float64`
# f2([0.1; 2.0], Diagonal([1.0, 2.0])) # not a `Matrix{Float64}`
f([0.1; 2.0], [1 2; 3 4]) = [9.1, 14.3]
f([0.1; 2.0], Diagonal([1.0, 2.0])) =
[5.1, 10.0]
2-element Vector{Float64}:
5.1
10.0
5.5. Creating New Types#
(See type declarations documentation)
Up until now, we have used NamedTuple
to collect sets of parameters for our models and examples.
These are useful for maintaining values for model parameters, but you will eventually need to be able to use code that creates its own types.
5.5.1. Syntax for Creating Concrete Types#
(See composite types documentation)
While other sorts of types exist, we almost always use the struct
keyword, which is for creation of composite data types
“Composite” refers to the fact that the data types in question can be used as collection of named fields.
The
struct
terminology is used in a number of programming languages to refer to composite data types.
Let’s start with a trivial example where the struct
we build has fields named a, b, c
, are not typed
struct FooNotTyped # immutable by default, use `mutable struct` otherwise
a # BAD! not typed
b
c
end
And another where the types of the fields are chosen
struct Foo
a::Float64
b::Int64
c::Vector{Float64}
end
In either case, the compiler generates a function to create new values of the data type, called a “constructor”.
It has the same name as the data type but uses function call notion
foo_nt = FooNotTyped(2.0, 3, [1.0, 2.0, 3.0]) # new `FooNotTyped`
foo = Foo(2.0, 3, [1.0, 2.0, 3.0]) # creates a new `Foo`
@show typeof(foo)
@show foo.a # get the value for a field
@show foo.b
@show foo.c;
# foo.a = 2.0 # fails since it is immutable
typeof(foo) = Foo
foo.a = 2.0
foo.b = 3
foo.c = [1.0, 2.0, 3.0]
You will notice two differences above for the creation of a struct
compared to our use of NamedTuple
.
Types are declared for the fields, rather than inferred by the compiler.
The construction of a new instance has no named parameters to prevent accidental misuse if the wrong order is chosen.
5.5.2. Issues with Type Declarations#
Was it necessary to manually declare the types a::Float64
in the above struct?
The answer, in practice, is usually yes.
Without a declaration of the type, the compiler is unable to generate efficient code, and the use of a struct
declared without types could drop performance by orders of magnitude.
Moreover, it is very easy to use the wrong type, or unnecessarily constrain the types.
The first example, which is usually just as low-performance as no declaration of types at all, is to accidentally declare it with an abstract type
struct Foo2
a::Float64
b::Integer # BAD! Not a concrete type
c::Vector{Real} # BAD! Not a concrete type
end
The second issue is that by choosing a type (as in the Foo
above), you may
be unnecessarily constraining what is allowed
f(x) = x.a + x.b + sum(x.c) # use the type
a = 2.0
b = 3
c = [1.0, 2.0, 3.0]
foo = Foo(a, b, c)
@show f(foo) # call with the foo, no problem
# some other typed for the values
a = 2 # not a floating point but `f()` would work
b = 3
c = [1.0, 2.0, 3.0]' # transpose is not a `Vector` but `f()` would work
# foo = Foo(a, b, c) # fails to compile
# works with `NotTyped` version, but low performance
foo_nt = FooNotTyped(a, b, c)
@show f(foo_nt);
f(foo) = 11.0
f(foo_nt) =
11.0
5.5.3. Declaring Parametric Types (Advanced)#
(See type parametric types documentation)
Motivated by the above, we can create a type which can adapt to holding fields of different types.
struct Foo3{T1, T2, T3}
a::T1 # could be any type
b::T2
c::T3
end
# works fine
a = 2
b = 3
c = [1.0, 2.0, 3.0]' # transpose is not a `Vector` but `f()` would work
foo = Foo3(a, b, c)
@show typeof(foo)
f(foo)
typeof(foo) = Foo3{Int64, Int64, Adjoint{Float64, Vector{Float64}}}
11.0
Of course, this is probably too flexible, and the f
function might not work on an arbitrary set of a, b, c
.
You could constrain the types based on the abstract parent type using the <:
operator
struct Foo4{T1 <: Real, T2 <: Real, T3 <: AbstractVecOrMat{<:Real}}
a::T1
b::T2
c::T3 # should check dimensions as well
end
foo = Foo4(a, b, c) # no problem, and high performance
@show typeof(foo)
f(foo)
typeof(foo) = Foo4{Int64, Int64, Adjoint{Float64, Vector{Float64}}}
11.0
This ensures that
a
andb
are a subtype ofReal
, and+
in the definition off
worksc
is a one dimensional abstract array ofReal
values
The code works, and is equivalent in performance to a NamedTuple
, but is more verbose and error prone.
5.5.4. Keyword Argument Constructors (Advanced)#
There is no way to avoid learning parametric types to achieve high performance code.
However, the other issue where constructor arguments are error-prone can be
remedied with the @kwdef
macro from Base
.
using Base: @kwdef
@kwdef struct Foo5
a::Float64 = 2.0 # adds default value
b::Int64
c::Vector{Float64}
end
foo = Foo5(a = 0.1, b = 2, c = [1.0, 2.0, 3.0])
foo2 = Foo5(c = [1.0, 2.0, 3.0], b = 2) # rearrange order, uses default values
@show foo
@show foo2
function f(x)
(; a, b, c) = x # unpacks any struct or named tuple
return a + b + sum(c)
end
f(foo)
foo = Foo5(0.1, 2, [1.0, 2.0, 3.0])
foo2 = Foo5(2.0, 2, [1.0, 2.0, 3.0])
8.1
5.5.5. Tips and Tricks for Writing Generic Functions#
As discussed in the previous sections, there is major advantage to never declaring a type unless it is absolutely necessary.
The main place where it is necessary is designing code around multiple dispatch.
If you are careful to write code that doesn’t unnecessarily assume types, you will both achieve higher performance and allow seamless use of a number of powerful libraries such as auto-differentiation, static arrays, GPUs, interval arithmetic and root finding, arbitrary precision numbers, and many more packages – including ones that have not even been written yet.
A few simple programming patterns ensure that this is possible
Do not declare types when declaring variables or functions unless necessary.
# BAD
x = [5.0, 6.0, 2.1]
function g(x::Array{Float64, 1}) # not generic!
y = zeros(length(x)) # not generic, hidden float!
z = Diagonal(ones(length(x))) # not generic, hidden float!
q = ones(length(x))
y .= z * x + q
return y
end
g(x)
# GOOD
function g2(x) # or `x::AbstractVector`
y = similar(x)
z = I
q = ones(eltype(x), length(x)) # or `fill(one(x), length(x))`
y .= z * x + q
return y
end
g2(x)
3-element Vector{Float64}:
6.0
7.0
3.1
Preallocate related vectors with
similar
where possible, and useeltype
ortypeof
. This is important when using Multiple Dispatch given the different input types the function can call
function g(x)
y = similar(x)
for i in eachindex(x)
y[i] = x[i]^2 # could broadcast
end
return y
end
g([BigInt(1), BigInt(2)])
2-element Vector{BigInt}:
1
4
Use
typeof
oreltype
to declare a type
@show typeof([1.0, 2.0, 3.0])
@show eltype([1.0, 2.0, 3.0]);
typeof([1.0, 2.0, 3.0]) = Vector{Float64}
eltype([1.0, 2.0, 3.0]) = Float64
Beware of hidden floating points
@show typeof(ones(3))
@show typeof(ones(Int64, 3))
@show typeof(zeros(3))
@show typeof(zeros(Int64, 3));
typeof(ones(3)) = Vector{Float64}
typeof(ones(Int64, 3)) = Vector{Int64}
typeof(zeros(3)) = Vector{Float64}
typeof(zeros(Int64, 3)) = Vector{Int64}
Use
one
andzero
to write generic code
@show typeof(1)
@show typeof(1.0)
@show typeof(BigFloat(1.0))
@show typeof(one(BigFloat)) # gets multiplicative identity, passing in type
@show typeof(zero(BigFloat))
x = BigFloat(2)
@show typeof(one(x)) # can call with a variable for convenience
@show typeof(zero(x));
typeof(1) = Int64
typeof(1.0) = Float64
typeof(BigFloat(1.0)) = BigFloat
typeof(one(BigFloat)) = BigFloat
typeof(zero(BigFloat)) = BigFloat
typeof(one(x)) = BigFloat
typeof(zero(x)) = BigFloat
This last example is a subtle, because of something called type promotion
Assume reasonable type promotion exists for numeric types
# ACCEPTABLE
function g(x::AbstractFloat)
return x + 1.0 # assumes `1.0` can be converted to something compatible with `typeof(x)`
end
x = BigFloat(1.0)
@show typeof(g(x)); # this has "promoted" the `1.0` to a `BigFloat`
typeof(g(x)) = BigFloat
But sometimes assuming promotion is not enough
# BAD
function g2(x::AbstractFloat)
if x > 0.0 # can't efficiently call with `x::Integer`
return x + 1.0 # OK - assumes you can promote `Float64` to `AbstractFloat`
else
return 0 # BAD! Returns a `Int64`
end
end
x = BigFloat(1.0)
x2 = BigFloat(-1.0)
@show typeof(g2(x))
@show typeof(g2(x2)) # type unstable
# GOOD
function g3(x) #
if x > zero(x) # any type with an additive identity
return x + one(x) # more general but less important of a change
else
return zero(x)
end
end
@show typeof(g3(x))
@show typeof(g3(x2)); # type stable
typeof(g2(x)) = BigFloat
typeof(g2(x2)) = Int64
typeof(g3(x)) = BigFloat
typeof(g3(x2)) = BigFloat
These patterns are relatively straightforward, but generic programming can be thought of as a Leontief production function: if any of the functions you write or call are not precise enough, then it may break the chain.
This is all the more reason to exploit carefully designed packages rather than “do-it-yourself”.
5.5.6. A Digression on Style and Naming#
The previous section helps to establish some of the reasoning behind the style choices in these lectures: “be aware of types, but avoid declaring them”.
The purpose of this is threefold:
Provide easy to read code with minimal “syntactic noise” and a clear correspondence to the math.
Ensure that code is sufficiently generic to exploit other packages and types.
Avoid common mistakes and unnecessary performance degradations.
This is just one of many decisions and patterns to ensure that your code is consistent and clear.
The best resource is to carefully read other peoples code, but a few sources to review are
Now why would we emphasize naming and style as a crucial part of the lectures?
Because it is an essential tool for creating research that is reproducible and correct.
Some helpful ways to think about this are
Clearly written code is easier to review for errors: The first-order concern of any code is that it correctly implements the whiteboard math.
Code is read many more times than it is written: Saving a few keystrokes in typing a variable name is never worth it, nor is a divergence from the mathematical notation where a single symbol for a variable name would map better to the model.
Write code to be read in the future, not today: If you are not sure anyone else will read the code, then write it for an ignorant future version of yourself who may have forgotten everything, and is likely to misuse the code.
Maintain the correspondence between the whiteboard math and the code: For example, if you change notation in your model, then immediately update all variables in the code to reflect it.
While variable naming takes discipline, code formatting for a specific style guide can be automated using JuliaFormatter.jl which is built into the Julia VS Code extension.
5.5.6.1. Commenting Code#
One common mistake people make when trying to apply these goals is to add in a large number of comments.
Over the years, developers have found that excess comments in code (and especially big comment headers used before every function declaration) can make code harder to read.
The issue is one of syntactic noise: if most of the comments are redundant given clear variable and function names, then the comments make it more difficult to mentally parse and read the code.
If you examine Julia code in packages and the core language, you will see a great amount of care taken in function and variable names, and comments are only added where helpful.
For creating packages that you intend others to use, instead of a comment header, you should use docstrings.
5.6. Introduction to Multiple Dispatch#
One of the defining features of Julia is multiple dispatch, whereby the same function name can do different things depending on the underlying types.
Without realizing it, in nearly every function call within packages or the standard library you have used this feature.
To see this in action, consider the absolute value function abs
@show abs(-1) # `Int64`
@show abs(-1.0) # `Float64`
@show abs(0.0 - 1.0im); # `Complex{Float64}`
abs(-1) = 1
abs(-1.0) = 1.0
abs(0.0 - 1.0im) = 1.0
In all of these cases, the abs
function has specialized code depending on the type passed in.
To do this, a function specifies different methods which operate on a particular set of types.
Unlike most cases we have seen before, this requires a type annotation.
To rewrite the abs
function
function ourabs(x::Real)
if x > zero(x) # note, not 0!
return x
else
return -x
end
end
function ourabs(x::Complex)
sqrt(real(x)^2 + imag(x)^2)
end
@show ourabs(-1) # `Int64`
@show ourabs(-1.0) # `Float64`
@show ourabs(1.0 - 2.0im); # `Complex{Float64}`
ourabs(-1) = 1
ourabs(-1.0) = 1.0
ourabs(1.0 - 2.0im) = 2.23606797749979
Note that in the above, x
works for any type of Real
, including Int64
, Float64
, and ones you may not have realized exist
x = -2 // 3 # `Rational` number, -2/3
@show typeof(x)
@show ourabs(x);
typeof(x) = Rational{Int64}
ourabs(x) = 2//3
You will also note that we used an abstract type, Real
, and an incomplete
parametric type, Complex
, when defining the above functions.
Unlike the creation of struct
fields, there is no penalty in using abstract
types when you define function parameters, as they are used purely to determine which version of a function to use.
5.6.1. Multiple Dispatch in Algorithms (Advanced)#
If you want an algorithm to have specialized versions when given different input types, you need to declare the types for the function inputs.
As an example where this could come up, assume that we have some grid x
of values, the results of a function f
applied at those values, and want to calculate an approximate derivative using forward differences.
In that case, given \(x_n, x_{n+1}, f(x_n)\) and \(f(x_{n+1})\), the forward-difference approximation of the derivative is
To implement this calculation for a vector of inputs, we notice that there is a specialized implementation if the grid is uniform.
The uniform grid can be implemented using an AbstractRange
, which we can analyze with
typeof
, supertype
and show_supertypes
.
x = range(0.0, 1.0, length = 20)
x_2 = 1:1:20 # if integers
@show typeof(x)
@show typeof(x_2)
@show supertype(typeof(x))
typeof(x) = StepRangeLen{Float64, Base.TwicePrecision{Float64}, Base.TwicePrecision{Float64}, Int64}
typeof(x_2) = StepRange{Int64, Int64}
supertype(typeof(x)) = AbstractRange{Float64}
AbstractRange{Float64}
To see the entire tree about a particular type, use show_supertypes
.
show_supertypes(typeof(x)) # or typeof(x) |> show_supertypes
StepRangeLen{Float64, Base.TwicePrecision{Float64}, Base.TwicePrecision{Float64}, Int64} <: AbstractRange{Float64} <: AbstractVector{
Float64} <: Any
show_supertypes(typeof(x_2))
StepRange{Int64, Int64} <: OrdinalRange{Int64, Int64} <: AbstractRange{Int64} <: AbstractVector{Int64} <: Any
The types of the range objects can be very complicated, but are both subtypes of AbstractRange
.
@show typeof(x) <: AbstractRange
@show typeof(x_2) <: AbstractRange;
typeof(x) <: AbstractRange = true
typeof(x_2) <: AbstractRange = true
While you may not know the exact concrete type, any AbstractRange
has an informal set of operations that are available.
@show minimum(x)
@show maximum(x)
@show length(x)
@show step(x);
minimum(x) = 0.0
maximum(x) = 1.0
length(x) = 20
step(x) = 0.05263157894736842
Similarly, there are a number of operations available for any AbstractVector
, such as length
.
f(x) = x^2
f_x = f.(x) # calculating at the range values
@show typeof(f_x)
@show supertype(typeof(f_x))
@show supertype(supertype(typeof(f_x))) # walk up tree again!
@show length(f_x); # and many more
typeof(f_x) = Vector{Float64}
supertype(typeof(f_x)) = DenseVector{Float64}
supertype(supertype(typeof(f_x))) = AbstractVector{Float64}
length(f_x) = 20
show_supertypes(typeof(f_x))
Vector{Float64} <: DenseVector{Float64} <: AbstractVector{Float64} <: Any
There are also many functions that can use any AbstractArray
, such as diff
.
?diff
search: diff symdiff setdiff symdiff! setdiff! Cptrdiff_t
diff(A::AbstractVector) # finite difference operator of matrix or vector A
# if A is a matrix, specify the dimension over which to operate with the dims keyword argument
diff(A::AbstractMatrix; dims::Integer)
Hence, we can call this function for anything of type AbstractVector
.
Finally, we can make a high performance specialization for any AbstractVector
and AbstractRange
.
slopes(f_x::AbstractVector, x::AbstractRange) = diff(f_x) / step(x)
slopes (generic function with 1 method)
We can use auto-differentiation to compare the results.
using Plots, ForwardDiff
# operator to get the derivative of this function using AD
D(f) = x -> ForwardDiff.derivative(f, x)
# compare slopes with AD for sin(x)
q(x) = sin(x)
x = 0.0:0.1:4.0
q_x = q.(x)
q_slopes_x = slopes(q_x, x)
D_q_x = D(q).(x) # broadcasts AD across vector
plot(x[1:(end - 1)], D_q_x[1:(end - 1)], label = "q' with AD")
plot!(x[1:(end - 1)], q_slopes_x, label = "q slopes")
Consider a variation where we pass a function instead of an AbstractArray
slopes(f::Function, x::AbstractRange) = diff(f.(x)) / step(x) # broadcast function
@show typeof(q) <: Function
@show typeof(x) <: AbstractRange
q_slopes_x = slopes(q, x) # use slopes(f::Function, x)
@show q_slopes_x[1];
typeof(q) <: Function = true
typeof(x) <: AbstractRange = true
q_slopes_x[1] = 0.9983341664682815
Finally, if x
was an AbstractArray
and not an AbstractRange
we can no longer use a uniform step.
For this, we add in a version calculating slopes with forward first-differences
# broadcasts over the diff
slopes(f::Function, x::AbstractArray) = diff(f.(x)) ./ diff(x)
x_array = Array(x) # convert range to array
@show typeof(x_array) <: AbstractArray
q_slopes_x = slopes(q, x_array)
@show q_slopes_x[1];
typeof(x_array) <: AbstractArray = true
q_slopes_x[1] =
0.9983341664682815
In the final example, we see that it is able to use specialized implementations over both the f
and the x
arguments.
This is the “multiple” in multiple dispatch.
5.7. Exercises#
5.7.1. Exercise 1#
Explore the package StaticArrays.jl.
Describe two abstract types and the hierarchy of three different concrete types.
Benchmark the calculation of some simple linear algebra with a static array compared to the following for a dense array for
N = 3
andN = 15
.
using BenchmarkTools
N = 3
A = rand(N, N)
x = rand(N)
@btime $A * $x # the $ in front of variable names is sometimes important
@btime inv($A)
61.446 ns (1 allocation: 80 bytes)
462.207 ns (4 allocations: 1.86 KiB)
3×3 Matrix{Float64}:
1.55745 -0.545213 0.179581
-1.70698 0.464573 1.52309
0.247411 1.33384 -1.97723
5.7.2. Exercise 2#
A key step in the calculation of the Kalman Filter is calculation of the Kalman gain, as can be seen with the following example using dense matrices from the Kalman lecture.
Using what you learned from Exercise 1, benchmark this using Static Arrays
Sigma = [0.4 0.3;
0.3 0.45]
G = I
R = 0.5 * Sigma
gain(Sigma, G, R) = Sigma * G' * inv(G * Sigma * G' + R)
@btime gain($Sigma, $G, $R)
613.898 ns (9 allocations: 1.77 KiB)
2×2 Matrix{Float64}:
0.666667 1.11022e-16
1.11022e-16 0.666667
How many times faster are static arrays in this example?
5.7.3. Exercise 3#
The Polynomial.jl provides a package for simple univariate Polynomials.
using Polynomials
p = Polynomial([2, -5, 2], :x) # :x just gives a symbol for display
@show p
p′ = derivative(p) # gives the derivative of p, another polynomial
@show p(0.1), p′(0.1) # call like a function
@show roots(p); # find roots such that p(x) = 0
p = Polynomial(2 - 5*x + 2*x^2)
(p(0.1), p′(0.1)) = (1.52, -4.6)
roots(p) = [0.5, 2.0]
Plot both p(x)
and p′(x)
for \(x \in [-2, 2]\).
5.7.4. Exercise 4#
Use your solution to Exercise 8(a/b) in Introductory Examples to
create a specialized version of Newton’s method for Polynomials
using the derivative
function.
The signature of the function should be newtonsmethod(p::Polynomial, x_0; tolerance = 1E-7, maxiter = 100)
,
where p::Polynomial
ensures that this version of the function will be used anytime a polynomial is passed (i.e. dispatch).
Compare the results of this function to the built-in roots(p)
function.
5.7.5. Exercise 5 (Advanced)#
The trapezoidal rule approximates an integral with
where \(x_0 = {\underline{x}},\, x_N = \bar{x}\), and \(\Delta x_n \equiv x_{n-1} - x_n\).
Given an x
and a function f
, implement a few variations of the trapezoidal rule using multiple dispatch
trapezoidal(f, x)
for anytypeof(x) = AbstractArray
andtypeof(f) == AbstractArray
wherelength(x) = length(f)
trapezoidal(f, x)
for anytypeof(x) = AbstractRange
andtypeof(f) == AbstractArray
wherelength(x) = length(f)
Exploit the fact that
AbstractRange
has constant step sizes to specialize the algorithm
trapezoidal(f, x̲, x̄, N)
wheretypeof(f) = Function
, and the other arguments areReal
For this, build a uniform grid with
N
points on[x̲, x̄]
– call thef
function at those grid points and use the existingtrapezoidal(f, x)
from the implementation
With these:
Test each variation of the function with \(f(x) = x^2\) with \(\underline{x}=0,\, \bar{x} = 1\).
From the analytical solution of the function, plot the error of
trapezoidal(f, x̲, x̄, N)
relative to the analytical solution for a grid of differentN
values.Consider trying different functions for \(f(x)\) and compare the solutions for various
N
.
When trying different functions, instead of integrating by hand consider using a high-accuracy library for numerical integration such as QuadGK.jl
using QuadGK
f(x) = x^2
value, accuracy = quadgk(f, 0.0, 1.0)
(0.3333333333333333, 0.0)
5.7.6. Exercise 6 (Advanced)#
Take a variation of your code in Exercise 5.
Use auto-differentiation to calculate the following derivative for the example functions
Hint: See the following code for the general pattern, and be careful to follow the rules for generic programming.
using ForwardDiff
function f(a, b; N = 50)
r = range(a, b, length = N) # one
return mean(r)
end
Df(x) = ForwardDiff.derivative(y -> f(0.0, y), x)
@show f(0.0, 3.0)
@show f(0.0, 3.1)
Df(3.0)
f(0.0, 3.0) = 1.5
f(0.0, 3.1) = 1.55
0.5