PyDIP User Manual
Contents
Currently, most functionality in the PyDIP module is directly mirrored from the DIPlib library. That is, function names and signatures are mostly identical to those in DIPlib. Please see the documentation for DIPlib to learn how to use these functions. Type T here to bring up a search dialog box where you can find functions by name.
To install the package from PyPI, use
pip install diplib
To read images through the Bio-Formats library, you will need to download it separately:
python -m diplib download_bioformats
Note that Bio-Formats also requires a working Java installation.
This user manual discusses the differences with the DIPlib library.
Type correspondences
Most classes defined in DIPlib and used as input arguments to functions have a Python binding, with the following exceptions:
-
dip::DataType
: pass a string such as'UINT8'
or'SFLOAT'
. -
dip::Sample
: pass a scalar (a regular Python number). -
dip::Pixel
: pass a list of scalars. -
dip::Range
: pass a slice (slice(0, 3, 1)
). Note that the second argument, the end value, is interpreted differently by DIPlib: it is included in the range. You can also pass a scalar here. -
dip::UnsignedArray
,dip::FloatArray
, or similar: pass a Python list:[5, 5]
. A scalar is accepted as a one-element list. -
dip::StringArray
: pass a list of strings (['foo', 'bar']
). -
dip::StringSet
: pass a set of strings ({'foo', 'bar'}
).
By using named arguments, it is quite simple to set only needed arguments, and leave all others with their default values. All arguments that have a default value in C++ have the same default value in Python.
Displaying images
The class dip.Image
has a method Show()
. There is an identical function
dip.Show()
. They display an image to the current matplotlib window, if
matplotlib is installed:
import diplib as dip img = dip.ImageReadTIFF('cameraman') img.Show()
By default, the image intensities are mapped to the full display range (i.e. the minimum image intensity is black and the maximum is white). This can be changed for example as follows:
img.Show('unit') # maps [0,1] to the display range img.Show('8bit') # maps [0,255] to the display range img.Show('orientation') # maps [0,pi] to the display range img.Show('base') # keeps 0 to the middle grey level, and uses a divergent color map img.Show('log') # uses logarithmic mapping
Type help(dip.Show)
in Python to learn about many more options.
If DIPviewer is installed, its functionality will be in the dip.viewer
namespace. Use img.ShowSlice()
for convenience. Depending on the backend used, it
will be necessary to do dip.viewer.Spin()
to interact with the created
window. Spin()
interrupts the interactive session until all DIPviewer
windows have been closed. Even when Spin()
is not needed to interact
with the windows, it should be run before closing the Python session to
avoid a series of error messages. Alternatively, periodically call
dip.viewer.Draw()
.
dip.Image.ShowSlice()
and dip.viewer.Show()
have additional parameters
that can be used to set viewing options. They also return an object that can be used for further interaction:
wdw = img.ShowSlice('Window title', mapping='unit', lut='sequential') dip.viewer.Spin()
Type help(dip.viewer.Show)
for details.
Indexing into images
Indexing into a dip.Image
object works as it does for other array types in
Python:
img[0] img[0:10] img[0:-1:2, 0:-1:2]
Note that dimensions are ordered in reverse from how NumPy stores them (the first dimension is horizontal, or x).
It is possible to assign to a subset of the image pixels using indexing:
img[0] = 0 img[0:10] = img[20:30] img[0:-1:2, 0:-1:2] = 255
Unlike in DIPlib, the square brackets index into spatial dimensions. To index into tensor dimensions, use round brackets (parentheses):
img(0) img(0, 2) img(slice(0, 3))
The output of any of these indexing operations shares data with the original image, so writing to that output also changes the original image:
img2 = img(0) # this copy shares data with img img2.Fill(100) # same as img(0).Fill(100) img(1).Copy(img(0)) img(2)[:,:] = img(0) # note that img(2) = img(0) is not allowed by Python img2 = img(0).Copy() # this copy does not share data with img img2.Fill(100) # does not affect img
Irregular indexing using a mask image is also supported. This indexing returns a copy of the data, but an assignment form is also available:
img2 = img[mask] # this copy does not share data with img img2.Fill(0) # does not affect img img[mask] = 0 # sets all pixels in mask to 0
Testing image validity
You can use either IsForged()
or IsEmpty()
to test if an image is forged.
IsEmpty()
is the opposite of IsForged()
, and returns True
if this image is not forged.
An image in a Boolean context, such as if image
, has the value of IsForged()
. This means
that the following two pieces of code are identical:
if image.IsForged(): print("The image has data") if image: print("The image has data")
Functions that expect an image interpret None
as an empty (non-forged) image.
Operators applied to images
Most operators have been overloaded to do what they do in C++:
- Arithmetic operators:
+
,-
,/
,%
, and the unary+
and-
. - Bit-wise logical operators:
&
,|
,^
. - Comparison operators:
<
,<=
,==
,>=
,>
and!=
.
Operators that work differently in Python and C++:
- Indexing operators: see Indexing into images.
- Multiplication operators: Python has both
*
and@
.@
behaves like*
in C++.*
is always the element-wise multiplication. - Exponentiation operator: Python has
**
, this doesn’t exist in C++.
For operators that have an in-place version (e.g. +
has a +=
), we have always overloaded the in-place
version as well.
Not overloaded are the integer division //
, del
, and the shift operators <<
and >>
. Logical operators
and
, or
and not
cannot be overloaded.
len()
and str()
have also been overloaded. len(image)
returns the number of pixels (i.e. it is the same
as image.NumberOfPixels()
if the image is forged, or 0 otherwise). str(image)
returns a string containing
what you’d see if you do print(image)
.
Note that operator chaining, where x < y < z
is interpreted as x < y and y < z
, does not work as expected
with DIPlib images (in the same way it doesn’t work with NumPy arrays). This expression, where x
or y
is an image, will evaluate to y < z
. This is because and
here first evaluates its left-hand-side argument
as a boolean expression. If x
or y
is an image, this will always evaluate as true, as described
in Testing image validity. and
will then evaluate to its right-hand-side argument, y < z
.
Mixing NumPy arrays and DIPlib images
A NumPy array can be passed instead of an image to any DIPlib function. In fact, any Python object that uses the buffer interface implicitly casts to an image. The reverse is also true: NumPy treats DIPlib images as an array, you can call any NumPy function on an image. However, some code that accepts a NumPy array calls methods of the array, which would not be defined for a DIPlib image. For example,
array = np.zeros((10, 11)) dip.Gauss(array) # OK img = dip.Image((11, 10)) np.amax(img) # OK img.max() # error! np.array method not defined for dip.Image img.shape # error! np.array property not defined for dip.Image
One can “cast” from a NumPy array to a DIPlib image and back, without copying the data:
x = np.asarray(img) y = dip.Image(array)
The image and the array point to the same memory in these two cases: modifying values in the one cause the other to see the modified values as well
The mapping from images to arrays causes the indexes to be reversed: The first array index corresponds
to the last image index, and vice versa. If an image is indexed as img[x,y,z]
, the corresponding
array is indexed as array[z,y,x]
. 2D NumPy arrays are typically interpreted with the first dimension
being vertical (y) and the second horizontal (x). This is how they are printed to the console, and how
pyplot.imshow
displays them as images. Preserving the indexing order between DIPlib and NumPy would
therefore cause 2D images to be shown transposed by other Python tools.
Thus, the following indexing operations are identical:
array = np.zeros((10, 11, 12)) img = dip.Image(array) array[1, 2, 3] == img[3, 2, 1]
Furthermore, by reversing the
indexing, we map an image with normal strides to an array in NumPy’s standard C-ordering.
The following evaluates to True
:
dip.Image( np.zeros((10,11,5,7)) ).HasNormalStrides()
When using a NumPy array as an image in a DIPlib function, it is implicitly cast to a dip.Image
object as above, and passed to the DIPlib function. This means that, whether the input is a NumPy
array or a DIPlib image, other function parameters that identify dimensions are always interpreted
in the same way. For example, the filter sizes are ordered (x, y, z), not (z, y, x) as they would be
ordered in scikit-image or other Python imaging libraries.
By calling dip.ReverseDimensions()
(which one should do only directly after loading the diplib
module to avoid confusing results), PyDIP is configured to reverse dimensions of all DIPlib images.
This means that the NumPy indexing order will be preserved, images will be indexed as img[z,y,x]
.
This has several surprising results, for example the direction of all angles is reversed, with
positive angles being counter-clockwise instead of clockwise. This option is intended to make it
easier to mix DIPlib functions into code that also uses e.g. scikit-image.
When casting a tensor image to a NumPy array, the tensor dimension will become the last array dimension.
When casting a NumPy array to a DIPlib image, there is no information about which dimension, if
any, is the tensor dimension. By default the following heuristic is used: if the array has more than
two dimensions, and if the smaller of the last or the first array dimension has no more than 4 elements,
then that dimension will be the tensor dimension. The tensor will have a column vector shape (this is the
default tensor shape in DIPlib). The threshold of 4 was picked because it will handle correctly all color
images. This threshold can be adjusted using dip.SetTensorConversionThreshold()
. If set to 0, all arrays
will be converted to a scalar image.
DIPjavaio, or how to use Bio-Formats
When using an installation of DIPlib that has DIPjavaio (the installation from PyPI does), and Bio-Formats has been downloaded according to the instructions at the top of this page, then one can load images from a file in any of the 160 or so formats currently supported by Bio-Formats.
The function dip.ImageRead()
will, in this case, use Bio-Formats if it doesn’t recognize the file type
(currently this is for any file that is not ICS, TIFF, PNG, JPEG or NPY). Adding format="bioformats"
as an
argument will cause Bio-Formats to be used even for these known file types.
dip.ImageRead()
is a simple interface, it just reads the first image seen. DIPlib has specialized functions
for each file type, which allow for specifying how an image is to be read. For the Bio-Formats “format”,
this function is dip.javaio.ImageReadJavaIO()
. It has a parameter interface
, which defaults to our
Bio-Formats interface. In principle other parameters are possible here, but no other interfaces currently exist.
The other parameter is imageNumber
, which specifies which image from a multi-image file format to read.
This is what Bio-Formats refers to as the “series”.
Note that Bio-Formats doesn’t always have the same interpretation of a file as DIPlib.
For example, a multi-page TIFF file is always seen as multiple images by DIPlib.
dip.ImageReadTIFF("file", imageNumbers=slice(0, 5))
will read the first 5 images (pages) and stack them
along the 3rd dimension (assuming they’re of the same size). In contrast, Bio-Formats will read all the
pages as a single 3D image. Only if the images have different sizes will it see them as individual images.
To use the dip.javaio.ImageReadJavaIO()
function, dip.javaio
must first be imported. This doesn’t happen
automatically when importing the DIPlib package because it takes a bit of time and it is not always needed.
dip.ImageRead()
will import dip.javaio
when first called. Otherwise, one can explicitly import it via
import diplib as dip import diplib.javaio
Note that we cannot import dip.javaio
, as dip
is an alias and the import
statement does not resolve it.
But after importing we can refer to dip.javaio
.
As an alternative, one can, for example,
import diplib.javaio as dj
and then use dj.ImageReadJavaIO()
.