gnuastro-0.16/doc/gnuastro.info-3

This is gnuastro.info, produced by makeinfo version 6.8 from
gnuastro.texi.

This book documents version 0.16 of the GNU Astronomy Utilities
(Gnuastro).  Gnuastro provides various programs and libraries for
astronomical data manipulation and analysis.

   Copyright © 2015-2021, Free Software Foundation, Inc.

     Permission is granted to copy, distribute and/or modify this
     document under the terms of the GNU Free Documentation License,
     Version 1.3 or any later version published by the Free Software
     Foundation; with no Invariant Sections, no Front-Cover Texts, and
     no Back-Cover Texts.  A copy of the license is included in the
     section entitled “GNU Free Documentation License”.
INFO-DIR-SECTION Astronomy
START-INFO-DIR-ENTRY
* Gnuastro: (gnuastro).       GNU Astronomy Utilities.
* libgnuastro: (gnuastro)Gnuastro library. Full Gnuastro library doc.

* help-gnuastro: (gnuastro)help-gnuastro mailing list. Getting help.

* bug-gnuastro: (gnuastro)Report a bug. How to report bugs

* Arithmetic: (gnuastro)Arithmetic. Arithmetic operations on pixels.
* astarithmetic: (gnuastro)Invoking astarithmetic. Options to Arithmetic.

* BuildProgram: (gnuastro)BuildProgram. Compile and run programs using Gnuastro’s library.
* astbuildprog: (gnuastro)Invoking astbuildprog. Options to BuildProgram.

* ConvertType: (gnuastro)ConvertType. Convert different file types.
* astconvertt: (gnuastro)Invoking astconvertt. Options to ConvertType.

* Convolve: (gnuastro)Convolve. Convolve an input file with kernel.
* astconvolve: (gnuastro)Invoking astconvolve. Options to Convolve.

* CosmicCalculator: (gnuastro)CosmicCalculator. For cosmological params.
* astcosmiccal: (gnuastro)Invoking astcosmiccal. Options to CosmicCalculator.

* Crop: (gnuastro)Crop. Crop region(s) from image(s).
* astcrop: (gnuastro)Invoking astcrop. Options to Crop.

* Fits: (gnuastro)Fits. View and manipulate FITS extensions and keywords.
* astfits: (gnuastro)Invoking astfits. Options to Fits.

* MakeCatalog: (gnuastro)MakeCatalog. Make a catalog from labeled image.
* astmkcatalog: (gnuastro)Invoking astmkcatalog. Options to MakeCatalog.

* MakeNoise: (gnuastro)MakeNoise. Make (add) noise to an image.
* astmknoise: (gnuastro)Invoking astmknoise. Options to MakeNoise.

* MakeProfiles: (gnuastro)MakeProfiles. Make mock profiles.
* astmkprof: (gnuastro)Invoking astmkprof. Options to MakeProfiles.

* Match: (gnuastro)Match. Match two separate catalogs.
* astmatch: (gnuastro)Invoking astmatch. Options to Match.

* NoiseChisel: (gnuastro)NoiseChisel. Detect signal in noise.
* astnoisechisel: (gnuastro)Invoking astnoisechisel. Options to NoiseChisel.

* Segment: (gnuastro)Segment. Segment detections based on signal structure.
* astsegment: (gnuastro)Invoking astsegment. Options to Segment.

* Query: (gnuastro)Query. Access remote databases for downloading data.
* astquery: (gnuastro)Invoking astquery. Options to Query.

* Statistics: (gnuastro)Statistics. Get image Statistics.
* aststatistics: (gnuastro)Invoking aststatistics. Options to Statistics.

* Table: (gnuastro)Table. Read and write FITS binary or ASCII tables.
* asttable: (gnuastro)Invoking asttable. Options to Table.

* Warp: (gnuastro)Warp. Warp a dataset to a new grid.
* astwarp: (gnuastro)Invoking astwarp. Options to Warp.

* astscript: (gnuastro)Installed scripts. Gnuastro’s installed scripts.
* astscript-sort-by-night: (gnuastro)Invoking astscript-sort-by-night. Options to this script
* astscript-radial-profile: (gnuastro)Invoking astscript-radial-profile. Options to this script
* astscript-ds9-region: (gnuastro)Invoking astscript-ds9-region. Options to this script

END-INFO-DIR-ENTRY


File: gnuastro.info,  Node: ConvertType,  Next: Table,  Prev: Fits,  Up: Data containers

5.2 ConvertType
===============

The FITS format used in astronomy was defined mainly for archiving,
transmission, and processing.  In other situations, the data might be
useful in other formats.  For example, when you are writing a paper or
report, or if you are making slides for a talk, you can’t use a FITS
image.  Other image formats should be used.  In other cases you might
want your pixel values in a table format as plain text for input to
other programs that don’t recognize FITS. ConvertType is created for
such situations.  The various types will increase with future updates
and based on need.

   The conversion is not only one way (from FITS to other formats), but
two ways (except the EPS and PDF formats(1)).  So you can also convert a
JPEG image or text file into a FITS image.  Basically, other than
EPS/PDF, you can use any of the recognized formats as different color
channel inputs to get any of the recognized outputs.  So before
explaining the options and arguments (in *note Invoking astconvertt::),
we’ll start with a short description of the recognized files types in
*note Recognized file formats::, followed a short introduction to
digital color in *note Color::.

* Menu:

* Recognized file formats::     Recognized file formats
* Color::                       Some explanations on color.
* Aligning images with small WCS offsets::  When the WCS slightly differs.
* Annotations for figure in paper::  Adding coordinates or physical scale.
* Invoking astconvertt::        Options and arguments to ConvertType.

   ---------- Footnotes ----------

   (1) Because EPS and PDF are vector, not raster/pixelated formats


File: gnuastro.info,  Node: Recognized file formats,  Next: Color,  Prev: ConvertType,  Up: ConvertType

5.2.1 Recognized file formats
-----------------------------

The various standards and the file name extensions recognized by
ConvertType are listed below.  Currently Gnuastro uses the file name’s
suffix to identify the format.

FITS or IMH
     Astronomical data are commonly stored in the FITS format (or the
     older data IRAF ‘.imh’ format), a list of file name suffixes which
     indicate that the file is in this format is given in *note
     Arguments::.

     Each image extension of a FITS file only has one value per
     pixel/element.  Therefore, when used as input, each input FITS
     image contributes as one color channel.  If you want multiple
     extensions in one FITS file for different color channels, you have
     to repeat the file name multiple times and use the ‘--hdu’,
     ‘--hdu2’, ‘--hdu3’ or ‘--hdu4’ options to specify the different
     extensions.

JPEG
     The JPEG standard was created by the Joint photographic experts
     group.  It is currently one of the most commonly used image
     formats.  Its major advantage is the compression algorithm that is
     defined by the standard.  Like the FITS standard, this is a raster
     graphics format, which means that it is pixelated.

     A JPEG file can have 1 (for gray-scale), 3 (for RGB) and 4 (for
     CMYK) color channels.  If you only want to convert one JPEG image
     into other formats, there is no problem, however, if you want to
     use it in combination with other input files, make sure that the
     final number of color channels does not exceed four.  If it does,
     then ConvertType will abort and notify you.

     The file name endings that are recognized as a JPEG file for input
     are: ‘.jpg’, ‘.JPG’, ‘.jpeg’, ‘.JPEG’, ‘.jpe’, ‘.jif’, ‘.jfif’ and
     ‘.jfi’.

TIFF
     TIFF (or Tagged Image File Format) was originally designed as a
     common format for scanners in the early 90s and since then it has
     grown to become very general.  In many aspects, the TIFF standard
     is similar to the FITS image standard: it can allow data of many
     types (see *note Numeric data types::), and also allows multiple
     images to be stored in a single file (each image in the file is
     called a ‘directory’ in the TIFF standard).  However, unlike FITS,
     it can only store images, it has no constructs for tables.  Another
     (inconvenient) difference with the FITS standard is that keyword
     names are stored as numbers, not human-readable text.

     However, outside of astronomy, because of its support of different
     numeric data types, many fields use TIFF images for accurate (for
     example 16-bit integer or floating point for example) imaging data.

     Currently ConvertType can only read TIFF images, if you are
     interested in writing TIFF images, please get in touch with us.

EPS
     The Encapsulated PostScript (EPS) format is essentially a one page
     PostScript file which has a specified size.  PostScript also
     includes non-image data, for example lines and texts.  It is a
     fully functional programming language to describe a document.
     Therefore in ConvertType, EPS is only an output format and cannot
     be used as input.  Contrary to the FITS or JPEG formats, PostScript
     is not a raster format, but is categorized as vector graphics.

     The Portable Document Format (PDF) is currently the most common
     format for documents.  Some believe that PDF has replaced
     PostScript and that PostScript is now obsolete.  This view is
     wrong, a PostScript file is an actual plain text file that can be
     edited like any program source with any text editor.  To be able to
     display its programmed content or print, it needs to pass through a
     processor or compiler.  A PDF file can be thought of as the
     processed output of the compiler on an input PostScript file.
     PostScript, EPS and PDF were created and are registered by Adobe
     Systems.

     With these features in mind, you can see that when you are
     compiling a document with TeX or LaTeX, using an EPS file is much
     more low level than a JPEG and thus you have much greater control
     and therefore quality.  Since it also includes vector graphic lines
     we also use such lines to make a thin border around the image to
     make its appearance in the document much better.  No matter the
     resolution of the display or printer, these lines will always be
     clear and not pixelated.  In the future, addition of text might be
     included (for example labels or object IDs) on the EPS output.
     However, this can be done better with tools within TeX or LaTeX
     such as PGF/Tikz(1).

     If the final input image (possibly after all operations on the flux
     explained below) is a binary image or only has two colors of black
     and white (in segmentation maps for example), then PostScript has
     another great advantage compared to other formats.  It allows for 1
     bit pixels (pixels with a value of 0 or 1), this can decrease the
     output file size by 8 times.  So if a gray-scale image is binary,
     ConvertType will exploit this property in the EPS and PDF (see
     below) outputs.

     The standard formats for an EPS file are ‘.eps’, ‘.EPS’, ‘.epsf’
     and ‘.epsi’.  The EPS outputs of ConvertType have the ‘.eps’
     suffix.

PDF
     As explained above, a PDF document is a static document description
     format, viewing its result is therefore much faster and more
     efficient than PostScript.  To create a PDF output, ConvertType
     will make a PostScript page description and convert that to PDF
     using GPL Ghostscript.  The suffixes recognized for a PDF file are:
     ‘.pdf’, ‘.PDF’.  If GPL Ghostscript cannot be run on the PostScript
     file, it will remain and a warning will be printed.

‘blank’
     This is not actually a file type!  But can be used to fill one
     color channel with a blank value.  If this argument is given for
     any color channel, that channel will not be used in the output.

Plain text
     Plain text files have the advantage that they can be viewed with
     any text editor or on the command-line.  Most programs also support
     input as plain text files.  As input, each plain text file is
     considered to contain one color channel.

     In ConvertType, the recognized extensions for plain text files are
     ‘.txt’ and ‘.dat’.  As described in *note Invoking astconvertt::,
     if you just give these extensions, (and not a full filename) as
     output, then automatic output will be preformed to determine the
     final output name (see *note Automatic output::).  Besides these,
     when the format of a file cannot be recognized from its name,
     ConvertType will fall back to plain text mode.  So you can use any
     name (even without an extension) for a plain text input or output.
     Just note that when the suffix is not recognized, automatic output
     will not be preformed.

     The basic input/output on plain text images is very similar to how
     tables are read/written as described in *note Gnuastro text table
     format::.  Simply put, the restrictions are very loose, and there
     is a convention to define a name, units, data type (see *note
     Numeric data types::), and comments for the data in a commented
     line.  The only difference is that as a table, a text file can
     contain many datasets (columns), but as a 2D image, it can only
     contain one dataset.  As a result, only one information comment
     line is necessary for a 2D image, and instead of the starting ‘‘#
     Column N’’ (‘N’ is the column number), the information line for a
     2D image must start with ‘‘# Image 1’’.  When ConvertType is asked
     to output to plain text file, this information comment line is
     written before the image pixel values.

     When converting an image to plain text, consider the fact that if
     the image is large, the number of columns in each line will become
     very large, possibly making it very hard to open in some text
     editors.

Standard output (command-line)
     This is very similar to the plain text output, but instead of
     creating a file to keep the printed values, they are printed on the
     command line.  This can be very useful when you want to redirect
     the results directly to another program in one command with no
     intermediate file.  The only difference is that only the pixel
     values are printed (with no information comment line).  To print to
     the standard output, set the output name to ‘‘stdout’’.

   ---------- Footnotes ----------

   (1) <http://sourceforge.net/projects/pgf/>


File: gnuastro.info,  Node: Color,  Next: Aligning images with small WCS offsets,  Prev: Recognized file formats,  Up: ConvertType

5.2.2 Color
-----------

Color is defined by mixing various measurements/filters.  In digital
monitors or common digital cameras, colors are displayed/stored by
mixing the three basic colors of red, green and blue (RGB) with various
proportions.  When printing on paper, standard printers use the cyan,
magenta, yellow and key (CMYK, key=black) color space.  In other words,
for each displayed/printed pixel of a color image, the dataset/image has
three or four values.

   To store/show the three values for each pixel, cameras and monitors
allocate a certain fraction of each pixel’s area to red, green and blue
filters.  These three filters are thus built into the hardware at the
pixel level.  However, because measurement accuracy is very important in
scientific instruments, and we want to do measurements (take images)
with various/custom filters (without having to order a new expensive
detector!), scientific detectors use the full area of the pixel to store
one value for it in a single/mono channel dataset.  To make measurements
in different filters, we just place a filter in the light path before
the detector.  Therefore, the FITS format that is used to store
astronomical datasets is inherently a mono-channel format (see *note
Recognized file formats:: or *note Fits::).

   When a subject has been imaged in multiple filters, you can feed each
different filter into the red, green and blue channels and obtain a
colored visualization.  In ConvertType, you can do this by giving each
separate single-channel dataset (for example in the FITS image format)
as an argument (in the proper order), then asking for the output in a
format that supports multi-channel datasets (for example JPEG or PDF,
see the examples in *note Invoking astconvertt::).

   As discussed above, color is not defined when a dataset/image
contains a single value for each pixel.  However, we interact with
scientific datasets through monitors or printers (which allow multiple
values per pixel and produce color with them).  As a result, there is a
lot of freedom in visualizing a single-channel dataset.  The most basic
is to use shades of black (because of its strong contrast with white).
This scheme is called grayscale.  To help in visualization, more complex
mappings can be defined.  For example, the values can be scaled to a
range of 0 to 360 and used as the “Hue” term of the Hue-Saturation-Value
(https://en.wikipedia.org/wiki/HSL_and_HSV) (HSV) color space (while
fixing the “Saturation” and “Value” terms).  In ConvertType, you can use
the ‘--colormap’ option to choose between different mappings of
mono-channel inputs, see *note Invoking astconvertt::.

   Since grayscale is a commonly used mapping of single-valued datasets,
we’ll continue with a closer look at how it is stored.  One way to
represent a gray-scale image in different color spaces is to use the
same proportions of the primary colors in each pixel.  This is the
common way most FITS image viewers work: for each pixel, they fill all
the channels with the single value.  While this is necessary for
displaying a dataset, there are downsides when storing/saving this type
of grayscale visualization (for example in a paper).

   • Three (for RGB) or four (for CMYK) values have to be stored for
     every pixel, this makes the output file very heavy (in terms of
     bytes).

   • If printing, the printing errors of each color channel can make the
     printed image slightly more blurred than it actually is.

   To solve both these problems when storing grayscale visualization,
the best way is to save a single-channel dataset into the black channel
of the CMYK color space.  The JPEG standard is the only common standard
that accepts CMYK color space.

   The JPEG and EPS standards set two sizes for the number of bits in
each channel: 8-bit and 12-bit.  The former is by far the most common
and is what is used in ConvertType.  Therefore, each channel should have
values between 0 to 2^8-1=255.  From this we see how each pixel in a
gray-scale image is one byte (8 bits) long, in an RGB image, it is 3
bytes long and in CMYK it is 4 bytes long.  But thanks to the JPEG
compression algorithms, when all the pixels of one channel have the same
value, that channel is compressed to one pixel.  Therefore a Grayscale
image and a CMYK image that has only the K-channel filled are
approximately the same file size.


File: gnuastro.info,  Node: Aligning images with small WCS offsets,  Next: Annotations for figure in paper,  Prev: Color,  Up: ConvertType

5.2.3 Aligning images with small WCS offsets
--------------------------------------------

In order to have nice color images, it is important that the images be
properly aligned.  This is usually the case in many scenarios, but it
some times happens that the images have a small WCS offset, even though
they have the same size.  In such cases you can use the script below to
align the images into approximately the same pixel grid (to within about
0.5 pixels which is sufficient in many color-image usage scenarios).

   The script below does the job using Gnuastro’s *note Warp:: and *note
Crop:: programs.  Simply copy the lines below into a plain-text file
with your favorite text editor and save it as ‘my-align.sh’.  Don’t
forget to set the variables of the first three lines to specify the file
names (without the ‘.fits’ suffix) and the HDUs of your inputs.  These
four lines are all you need to edit, leave the rest unchanged.  Also, if
you are copy/pasting the script from a PDF, be careful that the
single-quotes used in AWK may need to be corrected.

     #!/bin/sh

     # Set the input names (without the '.fits' suffix),
     # and their HDUs.
     r=RED_IMAGE_NO_SUFFIX;   rhdu=1
     g=GREEN_IMAGE_NO_SUFFIX; ghdu=1
     b=BLUE_IMAGE_NO_SUFFIX;  bhdu=1

     # To stop the script if there is a crash
     set -e

     # Align all the images to the celestial poles.
     astwarp $r.fits --align -h$rhdu -o $r-aligned.fits
     astwarp $g.fits --align -h$ghdu -o $g-aligned.fits
     astwarp $b.fits --align -h$bhdu -o $b-aligned.fits

     # Calculate the final WCS-based center and image-based width based on
     # the G-band (in RGB) image.
     centerwcs=$(astfits $g-aligned.fits --skycoverage --quiet \
                         | awk 'NR==1{printf "%g %g", $1,$2}')
     widthpix=$(astfits $g-aligned.fits -h1 --quiet \
                        --keyvalue=NAXIS1,NAXIS2 \
                    | awk '{printf "%d,%d", $1, $2}')

     # Crop all the images around the desired center and width.
     for f in $r $g $b; do
       centerpix=$(echo $centerwcs \
                        | asttable -c'arith $1 $2 wcs-to-img' \
                                   --wcsfile=$f-aligned.fits \
                        | awk '{printf "%g,%g", $1, $2}')
       astcrop $f-aligned.fits --mode=img --width=$widthpix \
               --center=$centerpix -o$f-use.fits
       rm $f-aligned.fits
     done

   Once you have have saved the file and come back to your command-line
you can run the script like this:

     $ chmod +x my-align.sh
     $ ./my-align.sh

Of course, feel free to hack it and modify it to fit your datasets, like
the rest of Gnuastro, this script is released under GNU GPLv.3 and
above, see *note Your rights::.


File: gnuastro.info,  Node: Annotations for figure in paper,  Next: Invoking astconvertt,  Prev: Aligning images with small WCS offsets,  Up: ConvertType

5.2.4 Annotations for figure in paper
-------------------------------------

To make a nice figure from your FITS images, it is important to show
more than merely the raw image (converted to a printer friendly format
like PDF or JPEG). Annotations (or visual metadata) over the raw image
greatly help the readers clearly see your argument and put the
image/result in a larger context.  Examples include:
   • Coordinates (Right Ascension and Declination) on the edges of the
     image, so viewers of your paper or presentation slides can get a
     physical feeling of the field’s sky coverage.
   • Thick line that has a fixed tangential size (for example in kilo
     parsecs) at the redshift/distance of interest.
   • Contours over the image to show radio/X-ray emission, over an
     optical image for example.
   • Text or arrows or etc, over certain parts of the image.

   Because of the modular philosophy of Gnuastro, ConvertType is only
focused on converting your FITS images to printer friendly formats like
JPEG or PDF. But to present your results in a slide or paper, you will
often need to annotate the raw JPEG or PDF with some of the features
above.  The good news is that there are many powerful plotting programs
that you can use to add such annotations.  As a result, there is no
point in making a new one, specific to Gnuastro.  In this section, we’ll
demonstrate this using the very powerful PGFPlots(1) package of LaTeX.

*Single script for easy running:* In this section we are reviewing the
reason and details of every step which is good for educational purposes.
But when you know the steps already, these separate code blocks can be
annoying.  Therefore the full script (except for the data download step)
is available in *note Full script of annotations on figure::.

   PGFPlots uses the same LaTeX graphic engine that typesets your
paper/slide.  Therefore when you build your plots and figures using
PGFPlots (and its underlying package PGF/TiKZ(2)) your plots will blend
beautifully within your text: same fonts, same colors, same line
properties and etc.  Since most papers (and presentation slides(3)) are
made with LaTeX, PGFPlots is therefore the best tool for those who use
LaTeX to create documents.  PGFPlots also doesn’t need any extra
dependencies beyond a basic/minimal TeX-live installation, so it is much
more reliable than tools like Matplotlib in Python that have hundreds of
fast-evolving dependencies(4).

   To demonstrate this, we’ll create a surface brightness image of a
galaxy in the F160W filter of the ABYSS survey(5).  In the code-block
below, let’s make a “build” directory to keep intermediate files and
avoid populating the source.  Afterwards, we’ll download the full image
and crop out a 20 arcmin wide image around the galaxy with the commands
below.  You can run these commands in an empty directory.

     $ mkdir build
     $ wget http://cdsarc.u-strasbg.fr/ftp/J/A+A/621/A133/fits/ah_f160w.fits
     $ astcrop ah_f160w.fits --center=53.1616278,-27.7802446 --mode=wcs \
               --width=20/3600 --output=build/crop.fits

   To better show the low surface brightness (LSB) outskirts, we’ll warp
the image, then convert the pixel units to surface brightness with the
commands below.  It is very important that the warping is done _before_
the conversion to surface brightness (in units of mag/arcsec$^2$),
because the definition of surface brightness is non-linear.  For more,
see the Surface brightness topic of *note Brightness flux magnitude::.

     $ zeropoint=25.94
     $ astwarp build/crop.fits --centeroncorner --scale=1/3 \
               --output=build/scaled.fits
     $ pixarea=$(astfits build/scaled.fits --pixelscale --quiet \
                         | awk '{print $1*3600*$2*3600}')
     $ astarithmetic build/scaled.fits abs $zeropoint counts-to-mag \
                     $pixarea log10 2.5 x + --output=build/sb.fits

   We are now ready to convert the surface brightness image into a PDF.
To better show the LSB features, we’ll also limit the color range with
the ‘--fluxlow’ and ‘--fluxhigh’ options: all pixels with a surface
brightness brighter than 22 mag/arcsec$^2$ will be shown as black, and
all pixels with a surface brightness fainter than 30 mag/arcsec$^2$ will
be white.  These thresholds are being defined as variables, because we
will also need them below (to pass into PGFPlots).  We will also set
‘--borderwidth=0’, because the coordinate system we will add over the
image will effectively be a border for the image (separating it from the
background).

     $ sblow=22
     $ sbhigh=30
     $ astconvertt build/sb.fits --colormap=gray --borderwidth=0 \
                   --fluxhigh=$sbhigh --fluxlow=$sblow --output=build/sb.pdf

   Please open ‘sb.pdf’ and have a look.  Also, please open ‘sb.fits’ in
DS9 (or any other FITS viewer) and play with the color range.  Can the
surface brightness limits be changed to better show the LSB structure?
If so, you are free to change the limits above.

   We now have the printable PDF representation of the image, but as
discussed above, its not enough for a paper.  We’ll add 1) a thick line
showing the size of 20 kpc (kilo parsecs) at the redshift of the central
galaxy, 2) coordinates and 3) a color bar, showing the surface
brightness level of each grayscale level.

   To get the first job done, we first need to know the redshift of the
central galaxy.  To do this, we can use Gnuastro’s Query program to look
into all the objects in NED within this image (only asking for the RA,
Dec and redshift columns).  We will then use the Match program to find
the NED entry that corresponds to our galaxy.

     $ astquery ned --dataset=objdir --overlapwith=build/sb.fits \
                --column=ra,dec,z --output=ned.fits
     $ astmatch ned.fits -h1 --coord=53.1616278,-27.7802446 \
                --ccol1=RA,Dec --aperture=1/3600
     $ redshift=$(asttable ned_matched.fits -cz)
     $ echo $redshift

   Now that we know the redshift of the central object, we can define
the coordinates of the thick line that will show the length of 20 kpc at
that redshift.  It will be a horizontal line (fixed Declination) across
a range of RA. The start of this thick line will be located at the top
edge of the image (at the 95-percent of the width and height of the
image).  With the commands below we’ll find the three necessary
parameters (one declination and two RAs).  Just note that in
astronomical images, RA increases to the left/east, which is the reason
we are using the minimum and ‘+’ to find the RA starting point.

     $ scalelineinkpc=20
     $ coverage=$(astfits build/sb.fits --skycoverage --quiet | awk 'NR==2')
     $ scalelinedec=$(echo      $coverage | awk '{print $4-($4-$3)*0.05}')
     $ scalelinerastart=$(echo  $coverage | awk '{print $1+($2-$1)*0.05}')
     $ scalelineraend=$(astcosmiccal --redshift=$redshift --arcsectandist \
                           | awk '{start='$scalelinerastart'; \
                                  width='$scalelineinkpc'/$1/3600; \
                                  print start+width}')

   To draw coordinates over the image, we need to feed these values into
PGFPlots.  But manually entering numbers into the PGFPlots source will
be very frustrating and prone to many errors!  Fortunately there is an
easy way to do this: LaTeX macros.  New macros are defined by this LaTeX
command:
     \newcommand{\macroname}{value}
Anywhere that LaTeX confronts ‘\macroname’, it will replace ‘value’ when
building the output.  We will have one file called ‘macros.tex’ in the
build directory and define macros based on those values.  We will use
the shell’s ‘printf’ command to write these macro definition lines into
the macro file.  We just have to use double backslashes in the ‘printf’
command, because backslash is a meaningful character for ‘printf’, but
we want to keep one of them.  Also, we put a ‘\n’ at the end of each
line, otherwise, all the commands will go into a single line of the
macro file.  We will also place the random ‘‘ma’’ string at the start of
all our LaTeX macros to help identify the macros for this plot.

     $ macros=build/macros.tex
     $ printf '\\newcommand{\\maScaleDec}'"{$scalelinedec}\n" > $macros
     $ printf '\\newcommand{\\maScaleRAa}'"{$scalelinerastart}\n" >> $macros
     $ printf '\\newcommand{\\maScaleRAb}'"{$scalelineraend}\n" >> $macros
     $ printf '\\newcommand{\\maScaleKpc}'"{$scalelineinkpc}\n" >> $macros
     $ printf '\\newcommand{\\maCenterZ}'"{$redshift}\n" >> $macros

   Please open the macros file after these commands and have a look to
see if they do conform to the expected format above.  Another set of
macros we will need to feed into PGFPlots is the coordinates of the
image corners.  Fortunately the ‘coverage’ variable found above is also
useful here.  We just need to extract each item before feeding it into
the macros.  To do this, we’ll use AWK and keep each value with the
temporary shell variable ‘‘v’’.

     $ v=$(echo $coverage | awk '{print $1}')
     $ printf '\\newcommand{\\maCropRAMin}'"{$v}\n" >> $macros
     $ v=$(echo $coverage | awk '{print $2}')
     $ printf '\\newcommand{\\maCropRAMax}'"{$v}\n" >> $macros
     $ v=$(echo $coverage | awk '{print $3}')
     $ printf '\\newcommand{\\maCropDecMin}'"{$v}\n" >> $macros
     $ v=$(echo $coverage | awk '{print $4}')
     $ printf '\\newcommand{\\maCropDecMax}'"{$v}\n" >> $macros

   Finally, we also need to pass some other numbers to PGFPlots: 1) the
major tick distance (in the coordinate axes that will be printed on the
edge of the image).  We’ll assume 7 ticks for this image.  2) The
minimum and maximum surface brightness values that we gave to
ConvertType when making the PDF; PGFPlots will define its color-bar
based on these two values.

     $ v=$(echo $coverage | awk '{print ($2-$1)/7}')
     $ printf '\\newcommand{\\maTickDist}'"{$v}\n" >> $macros
     $ printf '\\newcommand{\\maSBlow}'"{$sblow}\n" >> $macros
     $ printf '\\newcommand{\\maSBhigh}'"{$sbhigh}\n" >> $macros

   All the necessary numbers are now ready.  Please copy the contents
below into a file called ‘my-figure.tex’.  This is the PGFPlots source
for this particular plot.  Besides the coordinates and scale-line, we
will also add some text over the image and an orange arrow pointing to
the central object with its redshift printed over it.  The parameters
are generally human-readable, so you should be able to get a good
feeling of every line.  There are also comments which will show up as a
different color when you copy this into a plain-text editor.

\begin{tikzpicture}

  %% Define the coordinates and colorbar
  \begin{axis}[
      at={(0,0)},
      axis on top,
      x dir=reverse,
      scale only axis,
      width=\linewidth,
      height=\linewidth,
      minor tick num=10,
      xmin=\maCropRAMin,
      xmax=\maCropRAMax,
      ymin=\maCropDecMin,
      ymax=\maCropDecMax,
      enlargelimits=false,
      every tick/.style={black},
      xtick distance=\maTickDist,
      ytick distance=\maTickDist,
      yticklabel style={rotate=90},
      ylabel={Declination (degrees)},
      xlabel={Right Ascension (degrees)},
      ticklabel style={font=\small,
        /pgf/number format/.cd, precision=4,/tikz/.cd},
      x label style={at={(axis description cs:0.5,0.02)},
        anchor=north,font=\small},
      y label style={at={(axis description cs:0.07,0.5)},
        anchor=south,font=\small},
      colorbar,
      colormap name=gray,
      point meta min=\maSBlow,
      point meta max=\maSBhigh,
      colorbar style={
        at={(1.01,1)},
        ylabel={Surface brightness (mag/arcsec$^2$)},
        yticklabel style={
          /pgf/number format/.cd, precision=1, /tikz/.cd},
        y label style={at={(axis description cs:5.3,0.5)},
          anchor=south,font=\small},
      },
    ]

    %% Put the image in the proper positions of the plot.
    \addplot graphics[ xmin=\maCropRAMin,  xmax=\maCropRAMax,
                       ymin=\maCropDecMin, ymax=\maCropDecMax]
             {sb.pdf};

    %% Draw the scale factor.
    \addplot[black, line width=5, name=scaleline] coordinates
            {(\maScaleRAa,\maScaleDec) (\maScaleRAb,\maScaleDec)}
            node [anchor=north west] {\large $\maScaleKpc$ kpc};
  \end{axis}

  %% Add some text anywhere over the plot. The text is added two
  %% times: the first time with a white background (that with a
  %% certain opacity), the second time just the text with opacity.
  \node[anchor=south west, fill=white, opacity=0.5]
       at (0.01\linewidth,0.01\linewidth)
       {(a) Text can be added here};
  \node[anchor=south west]
       at (0.01\linewidth,0.01\linewidth)
       {(a) Text can be added here};

  %% Add an arrow to highlight certain structures.
  \draw [->, red!70!yellow, line width=5]
  (0.35\linewidth,0.35\linewidth)
  -- node [anchor=south, rotate=45]{$z=\maCenterZ$}
  (0.45\linewidth,0.45\linewidth);
\end{tikzpicture}

   Finally, we need another simple LaTeX source for the main PDF
“report” that will host this figure.  This can actually be your paper or
slides for example.  Here, we’ll suffice to the minimal working example.

\documentclass{article}

%% Import the TiKZ package and activate its "external" feature.
\usepackage{tikz}
\usetikzlibrary{external}
\tikzexternalize

%% PGFPlots (which uses TiKZ).
\usepackage{pgfplots}
\pgfplotsset{axis line style={thick}}
\pgfplotsset{
  /pgfplots/colormap={gray}{rgb255=(0,0,0) rgb255=(255,255,255)}
}

%% Import the macros.
\input{macros.tex}

%% Start document.
\begin{document}
You can write anything here.

%% Add the figure and its caption.
\begin{figure}
  \input{my-figure.tex}
  \caption{A demo image.}
\end{figure}

%% Finish the document.
\end{document}

   You are now ready to create the PDF. But LaTeX creates many temporary
files, so to avoid populating our top-level directory, we’ll copy the
two ‘.tex’ files into the build directory, go there and run LaTeX.
Before running it though, we’ll first delete all the files that have the
name pattern ‘*-figure0*’, these are “external” files created by
TiKZ+PGFPlots, including the actual PDF of the figure.

     $ cp report.tex my-figure.tex build
     $ cd build
     $ rm -f *-figure0*
     $ pdflatex -shell-escape -halt-on-error report.tex

   You now have the full “report” in ‘report.pdf’.  Try adding some
extra text on top of the figure, or in the caption and re-running the
last four commands.  Also try changing the 20kpc scale line length to
50kpc, or try changing the redshift, to see how the length and text of
the thick scale-line will automatically change.  But the good news is
that you also have the raw PDF of the figure that you can use in other
places.  You can see that file in ‘report-figure0.pdf’.

   In a larger paper, you can add multiple such figures (with different
‘.tex’ files that are placed in different ‘figure’ environments with
different captions).  Each figure will get a number in the build
directory.  TiKZ also allows setting a file name for each “external”
figure (to avoid such numbers that can be annoying if the image orders
are changed).  PGFPlots is also highly customizable, you can make a lot
of changes and customizations.  Both TiKZ(6) and PGFPLots(7) have
wonderful manuals, so have a look trough them.

* Menu:

* Full script of annotations on figure::  All the steps in one script

   ---------- Footnotes ----------

   (1)
<http://mirrors.ctan.org/graphics/pgf/contrib/pgfplots/doc/pgfplots.pdf>

   (2) <http://mirrors.ctan.org/graphics/pgf/base/doc/pgfmanual.pdf>

   (3) To build slides, LaTeX has packages like Beamer, see
<http://mirrors.ctan.org/macros/latex/contrib/beamer/doc/beameruserguide.pdf>

   (4) See Figure 1 of Alliez et al.  2020 at
<https://arxiv.org/pdf/1905.11123.pdf>

   (5) <http://research.iac.es/proyecto/abyss>

   (6) <http://mirrors.ctan.org/graphics/pgf/base/doc/pgfmanual.pdf>

   (7)
<http://mirrors.ctan.org/graphics/pgf/contrib/pgfplots/doc/pgfplots.pdf>


File: gnuastro.info,  Node: Full script of annotations on figure,  Prev: Annotations for figure in paper,  Up: Annotations for figure in paper

5.2.4.1 Full script of annotations on figure
............................................

In *note Annotations for figure in paper::, we each one of the steps to
add annotations over an image were described in detail.  So if you have
understood the steps, but need to add annotations over an image,
repeating those steps individually will be annoying.  Therefore in this
section, we’ll summarize all the steps in a single script that you can
simply copy-paste into a text editor, configure, and run.

*Necessary files:* To run this script, you will need an image to crop
your object from (here assuming its called ‘ah_f160w.fits’ with a
certain zeropoint) and two ‘my-figure.tex’ and ‘report.tex’ files that
were fully included in *note Annotations for figure in paper::.  Also,
we have brought the redshift as a parameter here.  But if the center of
your image always points to your main object, you can also include the
Query command to automatically find the object’s redshift from NED.
Alternatively, your image may already be cropped, in this case, you can
remove the cropping step and

# Parameters.
sblow=22                 # Minimum surface brightness.
sbhigh=30                # Maximum surface brightness.
bdir=build               # Build directory location on filesystem.
numticks=7               # Number of major ticks in each axis.
redshift=0.619           # Redshift of object of interest.
zeropoint=25.94          # Zeropoint of input image.
scalelineinkpc=20        # Length of scale-line (in kilo parsecs).
input=ah_f160w.fits      # Name of input (to crop).

# Stop the script in case of a crash.
set -e

# Build directory
if ! [ -d $bdir ]; then mkdir $bdir; fi

# Crop out the desired region.
crop=$bdir/crop.fits
astcrop $input --center=53.1616278,-27.7802446 --mode=wcs \
        --width=20/3600 --output=$crop

# Warp the image to larger pixels to show surface brightness better.
scaled=$bdir/scaled.fits
astwarp $crop --centeroncorner --scale=1/3 --output=$scaled

# Calculate the pixel area and convert image to Surface brightness.
sb=$bdir/sb.fits
pixarea=$(astfits $scaled --pixelscale --quiet \
              | awk '{print $1*3600*$2*3600}')
astarithmetic $scaled abs $zeropoint counts-to-mag \
              $pixarea log10 2.5 x + --output=$sb

# Convert the Surface brightness image into PDF.
sbpdf=$bdir/sb.pdf
astconvertt $sb --colormap=gray --borderwidth=0 \
	    --fluxhigh=$sbhigh --fluxlow=$sblow --output=$sbpdf

# Specify the coordinates of the scale line (specifying a certain
# width in kpc). We will put it on the top-right side of the image (5%
# of the full width of the image away from the edge).
coverage=$(astfits $sb --skycoverage --quiet | awk 'NR==2')
scalelinedec=$(echo $coverage | awk '{print $4-($4-$3)*0.05}')
scalelinerastart=$(echo  $coverage | awk '{print $1+($2-$1)*0.05}')
scalelineraend=$(astcosmiccal --redshift=$redshift --arcsectandist \
		     | awk '{start='$scalelinerastart'; \
                             width='$scalelineinkpc'/$1/3600; \
                             print start+width}')

# Write the LaTeX macros to use in plot. Start with the thick line
# showing tangential distance.
macros=$bdir/macros.tex
printf '\\newcommand{\\maScaleDec}'"{$scalelinedec}\n" > $macros
printf '\\newcommand{\\maScaleRAa}'"{$scalelinerastart}\n" >> $macros
printf '\\newcommand{\\maScaleRAb}'"{$scalelineraend}\n" >> $macros
printf '\\newcommand{\\maScaleKpc}'"{$scalelineinkpc}\n" >> $macros
printf '\\newcommand{\\maCenterZ}'"{$redshift}\n" >> $macros

# Add image extrema for the coordinates.
v=$(echo $coverage | awk '{print $1}')
printf '\\newcommand{\maCropRAMin}'"{$v}\n" >> $macros
v=$(echo $coverage | awk '{print $2}')
printf '\\newcommand{\maCropRAMax}'"{$v}\n" >> $macros
v=$(echo $coverage | awk '{print $3}')
printf '\\newcommand{\maCropDecMin}'"{$v}\n" >> $macros
v=$(echo $coverage | awk '{print $4}')
printf '\\newcommand{\maCropDecMax}'"{$v}\n" >> $macros

# Distance between each tick value.
v=$(echo $coverage | awk '{print ($2-$1)/'$numticks'}')
printf '\\newcommand{\maTickDist}'"{$v}\n" >> $macros
printf '\\newcommand{\maSBlow}'"{$sblow}\n" >> $macros
printf '\\newcommand{\maSBhigh}'"{$sbhigh}\n" >> $macros

# Copy the LaTeX source into the build directory and go there to run
# it and have all the temporary LaTeX files there.
cp report.tex my-figure.tex $bdir
cd $bdir
rm -f *-figure0*
pdflatex -shell-escape -halt-on-error report.tex


File: gnuastro.info,  Node: Invoking astconvertt,  Prev: Annotations for figure in paper,  Up: ConvertType

5.2.5 Invoking ConvertType
--------------------------

ConvertType will convert any recognized input file type to any specified
output type.  The executable name is ‘astconvertt’ with the following
general template

     $ astconvertt [OPTION...] InputFile [InputFile2] ... [InputFile4]

One line examples:

     ## Convert an image in FITS to PDF:
     $ astconvertt image.fits --output=pdf

     ## Similar to before, but use the Viridis color map:
     $ astconvertt image.fits --colormap=viridis --output=pdf

     ## Convert an image in JPEG to FITS (with multiple extensions
     ## if its color):
     $ astconvertt image.jpg -oimage.fits

     ## Use three plain text 2D arrays to create an RGB JPEG output:
     $ astconvertt f1.txt f2.txt f3.fits -o.jpg

     ## Use two images and one blank for an RGB EPS output:
     $ astconvertt M31_r.fits M31_g.fits blank -oeps

     ## Directly pass input from output of another program through Standard
     ## input (not a file).
     $ cat 2darray.txt | astconvertt -oimg.fits

The output’s file format will be interpreted from the value given to the
‘--output’ option.  It can either be given on the command-line or in any
of the configuration files (see *note Configuration files::).  Note that
if the output suffix is not recognized, it will default to plain text
format, see *note Recognized file formats::.

   At most four input files (one for each color channel for formats that
allow it) are allowed in ConvertType.  The first input dataset can
either be a file or come from Standard input (see *note Standard
input::).  The order of multiple input files is important.  After
reading the input file(s) the number of color channels in all the inputs
will be used to define which color space to use for the outputs and how
each color channel is interpreted.

   Some formats can allow more than one color channel (for example in
the JPEG format, see *note Recognized file formats::).  If there is one
input dataset (color channel) the output will be gray-scale, if three
input datasets (color channels) are given, they are respectively
considered to be the red, green and blue color channels.  Finally, if
there are four color channels they will be be cyan, magenta, yellow and
black (CMYK colors).

   The value to ‘--output’ (or ‘-o’) can be either a full file name or
just the suffix of the desired output format.  In the former case, it
will used for the output.  In the latter case, the name of the output
file will be set based on the automatic output guidelines, see *note
Automatic output::.  Note that the suffix name can optionally start a
‘.’ (dot), so for example ‘--output=.jpg’ and ‘--output=jpg’ are
equivalent.  See *note Recognized file formats::.

   Besides the common set of options explained in *note Common
options::, the options to ConvertType can be classified into input,
output and flux related options.  The majority of the options are to do
with the flux range.  Astronomical data usually have a very large
dynamic range (difference between maximum and minimum value) and
different subjects might be better demonstrated with a limited flux
range.

Input:
‘-h STR/INT’
‘--hdu=STR/INT’
     Input HDU name or counter (counting from 0) for each input FITS
     file.  If the same HDU should be used from all the FITS files, you
     can use the ‘--globalhdu’ option described below.  In ConvertType,
     it is possible to call the HDU option multiple times for the
     different input FITS or TIFF files in the same order that they are
     called on the command-line.  Note that in the TIFF standard, one
     ‘directory’ (similar to a FITS HDU) may contain multiple color
     channels (for example when the image is in RGB).

     Except for the fact that multiple calls are possible, this option
     is identical to the common ‘--hdu’ in *note Input output options::.
     The number of calls to this option cannot be less than the number
     of input FITS or TIFF files, but if there are more, the extra HDUs
     will be ignored, note that they will be read in the order described
     in *note Configuration file precedence::.

     Unlike CFITSIO, libtiff (which is used to read TIFF files) only
     recognizes numbers (counting from zero, similar to CFITSIO) for
     ‘directory’ identification.  Hence the concept of names is not
     defined for the directories and the values to this option for TIFF
     files must be numbers.

‘-g STR/INT’
‘--globalhdu=STR/INT’
     Use the value given to this option (a HDU name or a counter,
     starting from 0) for the HDU identifier of all the input FITS
     files.  This is useful when all the inputs are distributed in
     different files, but have the same HDU in those files.

Output:

‘-w FLT’
‘--widthincm=FLT’
     The width of the output in centimeters.  This is only relevant for
     those formats that accept such a width (not plain text for
     example).  For most digital purposes, the number of pixels is far
     more important than the value to this parameter because you can
     adjust the absolute width (in inches or centimeters) in your
     document preparation program.

‘-b INT’
‘--borderwidth=INT’
     The width of the border to be put around the EPS and PDF outputs in
     units of PostScript points.  There are 72 or 28.35 PostScript
     points in an inch or centimeter respectively.  In other words,
     there are roughly 3 PostScript points in every millimeter.  If you
     are planning on adding a border, its significance is highly
     correlated with the value you give to the ‘--widthincm’ parameter.

     Unfortunately in the document structuring convention of the
     PostScript language, the “bounding box” has to be in units of
     PostScript points with no fractions allowed.  So the border values
     only have to be specified in integers.  To have a final border that
     is thinner than one PostScript point in your document, you can ask
     for a larger width in ConvertType and then scale down the output
     EPS or PDF file in your document preparation program.  For example
     by setting ‘width’ in your ‘includegraphics’ command in TeX or
     LaTeX.  Since it is vector graphics, the changes of size have no
     effect on the quality of your output quality (pixels don’t get
     different values).

‘-x’
‘--hex’
     Use Hexadecimal encoding in creating EPS output.  By default the
     ASCII85 encoding is used which provides a much better compression
     ratio.  When converted to PDF (or included in TeX or LaTeX which is
     finally saved as a PDF file), an efficient binary encoding is used
     which is far more efficient than both of them.  The choice of EPS
     encoding will thus have no effect on the final PDF.

     So if you want to transfer your EPS files (for example if you want
     to submit your paper to arXiv or journals in PostScript), their
     storage might become important if you have large images or lots of
     small ones.  By default ASCII85 encoding is used which offers a
     much better compression ratio (nearly 40 percent) compared to
     Hexadecimal encoding.

‘-u INT’
‘--quality=INT’
     The quality (compression) of the output JPEG file with values from
     0 to 100 (inclusive).  For other formats the value to this option
     is ignored.  Note that only in gray-scale (when one input color
     channel is given) will this actually be the exact quality (each
     pixel will correspond to one input value).  If it is in color mode,
     some degradation will occur.  While the JPEG standard does support
     loss-less graphics, it is not commonly supported.

‘--colormap=STR[,FLT,...]’
     The color map to visualize a single channel.  The first value given
     to this option is the name of the color map, which is shown below.
     Some color maps can be configured.  In this case, the configuration
     parameters are optionally given as numbers following the name of
     the color map for example see ‘hsv’.  The table below contains the
     usable names of the color maps that are currently supported:

     ‘gray’
     ‘grey’
          Grayscale color map.  This color map doesn’t have any
          parameters.  The full dataset range will be scaled to 0 and
          $2^8-1=255$ to be stored in the requested format.

     ‘hsv’
          Hue, Saturation, Value(1) color map.  If no values are given
          after the name (‘--colormap=hsv’), the dataset will be scaled
          to 0 and 360 for hue covering the full spectrum of colors.
          However, you can limit the range of hue (to show only a
          special color range) by explicitly requesting them after the
          name (for example ‘--colormap=hsv,20,240’).

          The mapping of a single-channel dataset to HSV is done through
          the Hue and Value elements: Lower dataset elements have lower
          “value” _and_ lower “hue”.  This creates darker colors for
          fainter parts, while also respecting the range of colors.

     ‘viridis’
          Viridis is the default colormap of the popular Matplotlib
          module of Python and available in many other visualization
          tools like PGFPlots.

     ‘sls’
          The SLS color range, taken from the commonly used SAO DS9
          (http://ds9.si.edu).  The advantage of this color range is
          that it starts with black, going into dark blue and finishes
          with the brighter colors of red and white.  So unlike the HSV
          color range, it includes black and white and brighter colors
          (like yellow, red) show the larger values.

     ‘sls-inverse’
          The inverse of the SLS color map (see above), where the lowest
          value corresponds to white and the highest value is black.
          While SLS is good for visualizing on the monitor, SLS-inverse
          is good for printing.

‘--rgbtohsv’
     When there are three input channels and the output is in the FITS
     format, interpret the three input channels as red, green and blue
     channels (RGB) and convert them to the hue, saturation, value (HSV)
     color space.

     The currently supported output formats of ConvertType don’t have
     native support for HSV. Therefore this option is only supported
     when the output is in FITS format and each of the hue, saturation
     and value arrays can be saved as one FITS extension in the output
     for further analysis (for example to select a certain color).

Flux range:

‘-c STR’
‘--change=STR’
     (‘=STR’) Change pixel values with the following format ‘"from1:to1,
     from2:to2,..."’.  This option is very useful in displaying labeled
     pixels (not actual data images which have noise) like segmentation
     maps.  In labeled images, usually a group of pixels have a fixed
     integer value.  With this option, you can manipulate the labels
     before the image is displayed to get a better output for print or
     to emphasize on a particular set of labels and ignore the rest.
     The labels in the images will be changed in the same order given.
     By default first the pixel values will be converted then the pixel
     values will be truncated (see ‘--fluxlow’ and ‘--fluxhigh’).

     You can use any number for the values irrespective of your final
     output, your given values are stored and used in the double
     precision floating point format.  So for example if your input
     image has labels from 1 to 20000 and you only want to display those
     with labels 957 and 11342 then you can run ConvertType with these
     options:

          $ astconvertt --change=957:50000,11342:50001 --fluxlow=5e4 \
             --fluxhigh=1e5 segmentationmap.fits --output=jpg

     While the output JPEG format is only 8 bit, this operation is done
     in an intermediate step which is stored in double precision
     floating point.  The pixel values are converted to 8-bit after all
     operations on the input fluxes have been complete.  By placing the
     value in double quotes you can use as many spaces as you like for
     better readability.

‘-C’
‘--changeaftertrunc’
     Change pixel values (with ‘--change’) after truncation of the flux
     values, by default it is the opposite.

‘-L FLT’
‘--fluxlow=FLT’
     The minimum flux (pixel value) to display in the output image, any
     pixel value below this value will be set to this value in the
     output.  If the value to this option is the same as ‘--fluxhigh’,
     then no flux truncation will be applied.  Note that when multiple
     channels are given, this value is used for all the color channels.

‘-H FLT’
‘--fluxhigh=FLT’
     The maximum flux (pixel value) to display in the output image, see
     ‘--fluxlow’.

‘-m INT’
‘--maxbyte=INT’
     This is only used for the JPEG and EPS output formats which have an
     8-bit space for each channel of each pixel.  The maximum value in
     each pixel can therefore be $2^8-1=255$.  With this option you can
     change (decrease) the maximum value.  By doing so you will decrease
     the dynamic range.  It can be useful if you plan to use those
     values for other purposes.

‘-A INT’
‘--forcemin=INT’
     Enforce the value of ‘--fluxlow’ (when its given), even if its
     smaller than the minimum of the dataset and the output is format
     supporting color.  This is particularly useful when you are
     converting a number of images to a common image format like JPEG or
     PDF with a single command and want them all to have the same range
     of colors, independent of the contents of the dataset.  Note that
     if the minimum value is smaller than ‘--fluxlow’, then this option
     is redundant.

     By default, when the dataset only has two values, _and_ the output
     format is PDF or EPS, ConvertType will use the PostScript
     optimization that allows setting the pixel values per bit, not byte
     (*note Recognized file formats::).  This can greatly help reduce
     the file size.  However, when ‘--fluxlow’ or ‘--fluxhigh’ are
     called, this optimization is disabled: even though there are only
     two values (is binary), the difference between them does not
     correspond to the full contrast of black and white.

‘-B INT’
‘--forcemax=INT’
     Similar to ‘--forcemin’, but for the maximum.

‘-i’
‘--invert’
     For 8-bit output types (JPEG, EPS, and PDF for example) the final
     value that is stored is inverted so white becomes black and vice
     versa.  The reason for this is that astronomical images usually
     have a very large area of blank sky in them.  The result will be
     that a large are of the image will be black.  Note that this
     behavior is ideal for gray-scale images, if you want a color image,
     the colors are going to be mixed up.

   ---------- Footnotes ----------

   (1) <https://en.wikipedia.org/wiki/HSL_and_HSV>


File: gnuastro.info,  Node: Table,  Next: Query,  Prev: ConvertType,  Up: Data containers

5.3 Table
=========

Tables are the high-level products of processing on low-leveler data
like images or spectra.  For example in Gnuastro, MakeCatalog will
process the pixels over an object and produce a catalog (or table) with
the properties of each object like magnitudes, positions and etc (see
*note MakeCatalog::).  Each one of these properties is a column in its
output catalog (or table) and for each input object, we have a row.

   When there are only a small number of objects (rows) and not too many
properties (columns), then a simple plain text file is mainly enough to
store, transfer, or even use the produced data.  However, to be more
efficient, astronomers have defined the FITS binary table standard to
store data in a binary format (which cannot be seen in a text editor
text).  This can offer major advantages: the file size will be greatly
reduced and the reading and writing will also be faster (because the RAM
and CPU also work in binary).  The acceptable table formats are fully
described in *note Tables::.

   Binary tables are not easily readable with basic plain-text editors.
There is no fixed/unified standard on how the zero and ones should be
interpreted.  Unix-like operating systems have flourished because of a
simple fact: communication between the various tools is based on human
readable characters(1).  So while the FITS table standards are very
beneficial for the tools that recognize them, they are hard to use in
the vast majority of available software.  This creates limitations for
their generic use.

   Table is Gnuastro’s solution to this problem.  Table has a large set
of operations that you can directly do on any recognized table (like
selecting certain rows, doing arithmetic on the columns and etc).  For
operations that Table doesn’t do internally, FITS tables (ASCII or
binary) are directly accessible to the users of Unix-like operating
systems (in particular those working the command-line or shell, see
*note Command-line interface::).  With Table, a FITS table (in binary or
ASCII formats) is only one command away from AWK (or any other tool you
want to use).  Just like a plain text file that you read with the ‘cat’
command.  You can pipe the output of Table into any other tool for
higher-level processing, see the examples in *note Invoking asttable::
for some simple examples.

   In the sections below we describe how to effectively use the Table
program.  We start with *note Column arithmetic::, where the basic
concept and methods of applying arithmetic operations on one or more
columns are discussed.  Afterwards, in *note Operation precedence in
Table::, we review the various types of operations available and their
precedence in an instance of calling Table.  This is a good place to get
a general feeling of all the things you can do with Table.  Finally, in
*note Invoking asttable::, we give some examples and describe each
option in Table.

* Menu:

* Column arithmetic::           How to do operations on table columns.
* Operation precedence in Table::  Order of running options in Table.
* Invoking asttable::           Options and arguments to Table.

   ---------- Footnotes ----------

   (1) In “The art of Unix programming”, Eric Raymond makes this
suggestion to programmers: “When you feel the urge to design a complex
binary file format, or a complex binary application protocol, it is
generally wise to lie down until the feeling passes.”.  This is a great
book and strongly recommended, give it a look if you want to truly enjoy
your work/life in this environment.


File: gnuastro.info,  Node: Column arithmetic,  Next: Operation precedence in Table,  Prev: Table,  Up: Table

5.3.1 Column arithmetic
-----------------------

In many scenarios, you want to apply some kind of operation on the
columns and save them in another table or feed them into another
program.  With Table you can do a rich set of operations on the contents
of one or more columns in a table, and save the resulting values as new
column(s) in the output table.  For seeing the precedence of Column
arithmetic in relation to other Table operators, see *note Operation
precedence in Table::.

   To enable column arithmetic, the first 6 characters of the value to
‘--column’ (‘-c’) should be the activation word ‘‘arith ’’ (note the
space character in the end, after ‘‘arith’’).  After the activation
word, you can use reverse polish notation to identify the operators and
their operands, see *note Reverse polish notation::.  Just note that
white-space characters are used between the tokens of the arithmetic
expression and that they are meaningful to the command-line environment.
Therefore the whole expression (including the activation word) has to be
quoted on the command-line or in a shell script (see the examples
below).

   To identify a column you can directly use its name, or specify its
number (counting from one, see *note Selecting table columns::).  When
you are giving a column number, it is necessary to prefix the number
with a ‘$’, similar to AWK. Otherwise the number is not distinguishable
from a constant number to use in the arithmetic operation.

   For example with the command below, the first two columns of
‘table.fits’ will be printed along with a third column that is the
result of multiplying the first column with $10^{10}$ (for example to
convert wavelength from Meters to Angstroms).  Note that without the
‘<$>’, it is not possible to distinguish between “1” as a
column-counter, or “1” as a constant number to use in the arithmetic
operation.  Also note that because of the significance of <$> for the
command-line environment, the single-quotes are the recommended quoting
method (as in an AWK expression), not double-quotes (for the
significance of using single quotes see the box below).

     $ asttable table.fits -c1,2 -c'arith $1 1e10 x'

*Single quotes when string contains <$>*: On the command-line, or in
shell-scripts, <$> is used to expand variables, for example ‘echo $PATH’
prints the value (a string of characters) in the variable ‘PATH’, it
will not simply print ‘$PATH’.  This operation is also permitted within
double quotes, so ‘echo "$PATH"’ will produce the same output.  This is
good when printing values, for example in the command below, ‘$PATH’
will expand to the value within it.

     $ echo "My path is: $PATH"

   If you actually want to return the literal string ‘$PATH’, not the
value in the ‘PATH’ variable (like the scenario here in column
arithmetic), you should put it in single quotes like below.  The printed
value here will include the ‘$’, please try it to see for your self and
compare to above.

     $ echo 'My path is: $PATH'

   Therefore, when your column arithmetic involves the <$> sign (to
specify columns by number), quote your ‘arith ’ string with a single
quotation mark.  Otherwise you can use both single or double quotes.

   Alternatively, if the columns have meta-data and the first two are
respectively called ‘AWAV’ and ‘SPECTRUM’, the command above is
equivalent to the command below.  Note that the character ‘<$>’ is no
longer necessary in this scenario (because names will not be confused
with numbers):

     $ asttable table.fits -cAWAV,SPECTRUM -c'arith AWAV 1e10 x'

   Comparison of the two commands above clearly shows why it is
recommended to use column names instead of numbers.  When the columns
have descriptive names, the command/script actually becomes much more
readable, describing the intent of the operation.  It is also
independent of the low-level table structure: for the second command,
the column numbers of the ‘AWAV’ and ‘SPECTRUM’ columns in ‘table.fits’
is irrelevant.

   Column arithmetic changes the values of the data within the column.
So the old column meta data can’t be used any more.  By default the
output column of the arithmetic operation will be given a generic
metadata (for example its name will be ‘ARITH_1’, which is hardly
useful!).  But meta data are critically important and it is good
practice to always have short, but descriptive, names for each columns,
units and also some comments for more explanation.  To add metadata to a
column, you can use the ‘--colmetadata’ option that is described in
*note Invoking asttable:: and *note Operation precedence in Table::.

   Finally, since the arithmetic expressions are a value to ‘--column’,
it doesn’t necessarily have to be a separate option, so the commands
above are also identical to the command below (note that this only has
one ‘-c’ option).  Just be very careful with the quoting!

     $ asttable table.fits -cAWAV,SPECTRUM,'arith AWAV 1e10 x'

   Almost all the arithmetic operators of *note Arithmetic operators::
are also supported for column arithmetic in Table.  In particular, the
few that are not present in the Gnuastro library(1) aren’t yet supported
for column arithmetic.  Besides the operators in *note Arithmetic
operators::, several operators are only available in Table to use on
table columns.

‘wcs-to-img’
     Convert the given WCS positions to image/dataset coordinates based
     on the number of dimensions in the WCS structure of ‘--wcshdu’
     extension/HDU in ‘--wcsfile’.  It will output the same number of
     columns.  The first popped operand is the last FITS dimension.

     For example the two commands below (which have the same output)
     will produce 5 columns.  The first three columns are the input
     table’s ID, RA and Dec columns.  The fourth and fifth columns will
     be the pixel positions in ‘image.fits’ that correspond to each RA
     and Dec.

          $ asttable table.fits -cID,RA,DEC,'arith RA DEC wcs-to-img' \
                     --wcsfile=image.fits
          $ asttable table.fits -cID,RA -cDEC \
                     -c'arith RA DEC wcs-to-img' --wcsfile=image.fits

‘img-to-wcs’
     Similar to ‘wcs-to-img’, except that image/dataset coordinates are
     converted to WCS coordinates.

‘distance-flat’
     Return the distance between two points assuming they are on a flat
     surface.  Note that each point needs two coordinates, so this
     operator needs four operands (currently it only works for 2D
     spaces).  The first and second popped operands are considered to
     belong to one point and the third and fourth popped operands to the
     second point.

     Each of the input points can be a single coordinate or a full table
     column (containing many points).  In other words, the following
     commands are all valid:

          $ asttable table.fits \
                     -c'arith X1 Y1 X2 Y2 distance-flat'
          $ asttable table.fits \
                     -c'arith X Y 12.345 6.789 distance-flat'
          $ asttable table.fits \
                     -c'arith 12.345 6.789 X Y distance-flat'

     In the first case we are assuming that ‘table.fits’ has the
     following four columns ‘X1’, ‘Y1’, ‘X2’, ‘Y2’.  The returned column
     by this operator will be the difference between two points in each
     row with coordinates like the following (‘X1’, ‘Y1’) and (‘X2’,
     ‘Y2’).  In other words, for each row, the distance between
     different points is calculated.  In the second and third cases
     (which are identical), it is assumed that ‘table.fits’ has the two
     columns ‘X’ and ‘Y’.  The returned column by this operator will be
     the difference of each row with the fixed point at (12.345, 6.789).

‘distance-on-sphere’
     Return the spherical angular distance (along a great circle, in
     degrees) between the given two points.  Note that each point needs
     two coordinates (in degrees), so this operator needs four operands.
     The first and second popped operands are considered to belong to
     one point and the third and fourth popped operands to the second
     point.

     Each of the input points can be a single coordinate or a full table
     column (containing many points).  In other words, the following
     commands are all valid:

          $ asttable table.fits \
                     -c'arith RA1 DEC1 RA2 DEC2 distance-on-sphere'
          $ asttable table.fits \
                     -c'arith RA DEC 9.876 5.432 distance-on-sphere'
          $ asttable table.fits \
                     -c'arith 9.876 5.432 RA DEC distance-on-sphere'

     In the first case we are assuming that ‘table.fits’ has the
     following four columns ‘RA1’, ‘DEC1’, ‘RA2’, ‘DEC2’.  The returned
     column by this operator will be the difference between two points
     in each row with coordinates like the following (‘RA1’, ‘DEC1’) and
     (‘RA2’, ‘DEC2’).  In other words, for each row, the angular
     distance between different points is calculated.  In the second and
     third cases (which are identical), it is assumed that ‘table.fits’
     has the two columns ‘RA’ and ‘DEC’.  The returned column by this
     operator will be the difference of each row with the fixed point at
     (9.876, 5.432).

     The distance (along a great circle) on a sphere between two points
     is calculated with the equation below, where $r_1$, $r_2$, $d_1$
     and $d_2$ are the right ascensions and declinations of points 1 and
     2.

    $$\cos(d)=\sin(d_1)\sin(d_2)+\cos(d_1)\cos(d_2)\cos(r_1-r_2)$$

‘ra-to-degree’
     Convert the hour-wise Right Ascension (RA) string, in the
     sexagesimal format of ‘_h_m_s’ or ‘_:_:_’, to degrees.  Note that
     the input column has to have a string format.  In FITS tables,
     string columns are well-defined.  For plain-text tables, please
     follow the standards defined in *note Gnuastro text table format::,
     otherwise the string column won’t be read.
          $ asttable catalog.fits -c'arith RA ra-to-degree'
          $ asttable catalog.fits -c'arith $5 ra-to-degree'

‘dec-to-degree’
     Convert the sexagesimal Declination (Dec) string, in the format of
     ‘_d_m_s’ or ‘_:_:_’, to degrees (a single floating point number).
     For more details please see the ‘ra-to-degree’ operator.

‘degree-to-ra’
     Convert degrees (a column with a single floating point number) to
     the Right Ascension, RA, string (in the sexagesimal format hours,
     minutes and seconds, written as ‘_h_m_s’).  The output will be a
     string column so no further mathematical operations can be done on
     it.  The output file can be in any format (for example FITS or
     plain-text).  If it is plain-text, the string column will be
     written following the standards described in *note Gnuastro text
     table format::.

‘degree-to-dec’
     Convert degrees (a column with a single floating point number) to
     the Declination, Dec, string (in the format of ‘_d_m_s’).  See the
     ‘degree-to-ra’ for more on the format of the output.

‘date-to-sec’
     The popped operand should be a string column in the FITS date
     format (most generally: ‘YYYY-MM-DDThh:mm:ss.ddd...’).  This
     operator will return the corresponding Unix epoch time (number of
     seconds that have passed since 00:00:00 Thursday, January 1st,
     1970).  The returned operand will be named ‘UNIXSEC’ (short for
     Unix-seconds).  If any of the times have sub-second precision, the
     numeric datatype of the column will be 64-bit floating point type.
     Otherwise it will be a 64-bit, signed integer, see *note Numeric
     data types::.

     For example, in the example below we are using this operator, in
     combination with the ‘--keyvalue’ option of the Fits program, to
     sort your desired FITS files by observation date (value in the
     ‘DATE-OBS’ keyword in example below):

          $ astfits *.fits --keyvalue=DATE-OBS --colinfoinstdout \
                    | asttable -cFILENAME,'arith DATE-OBS date-to-sec' \
                               --colinfoinstdout \
                    | asttable --sort=UNIXSEC

     If you don’t need to see the Unix-seconds any more, you can add a
     ‘-cFILENAME’ (short for ‘--column=FILENAME’) at the end.  For more
     on ‘--keyvalue’, see *note Keyword inspection and manipulation::.

   ---------- Footnotes ----------

   (1) For a list of the Gnuastro library arithmetic operators, please
see the macros starting with ‘GAL_ARITHMETIC_OP’ and ending with the
operator name in *note Arithmetic on datasets::.


File: gnuastro.info,  Node: Operation precedence in Table,  Next: Invoking asttable,  Prev: Column arithmetic,  Up: Table

5.3.2 Operation precedence in Table
-----------------------------------

The Table program can do many operations on the rows and columns of the
input tables and they aren’t always applied in the order you call the
operation on the command-line.  In this section we will describe which
operation is done before/after which operation.  Knowing this precedence
table is important to avoid confusion when you ask for more than one
operation.  For a description of each option, please see *note Invoking
asttable::.

Column information (‘--information’ or ‘-i’)
     When given this option, the column data are not read at all.  Table
     simply reads the column metadata (name, units, numeric data type
     and comments), and the number of rows and prints them.  Table then
     terminates and no other operation is done.  This can therefore be
     called at the end of an arbitrarily long Table command only to
     remember the column metadata, then deleted to continue writing the
     command (using the shell’s history to retrieve the previous command
     with an up-arrow key).

Columns from other files (‘--catcolumns’ and ‘--catcolumnfile’)
     With this feature, you can import columns from other tables (in
     other files).  The rest of the operations below are done on the
     rows, therefore you can merge the columns of various tables into
     one table, then start limiting the rows to have in the output.

     If any of the row-based operations below are requested in the same
     Table command, they will also be applied to the rows of these added
     columns.  However, the conditions to keep/reject rows can only be
     applied to the rows of the main input table.

Row selection by value in a column
        • ‘--range’: only keep rows within a certain interval in given
          column.
        • ‘--inpolygon’: only keep rows within the polygon of
          ‘--polygon’.
        • ‘--outpolygon’: only keep rows outside the polygon of
          ‘--polygon’.
        • ‘--equal’: only keep rows with specified value in given
          column.
        • ‘--notequal’: only keep rows without specified value in given
          column.
     These options take certain column(s) as input and remove some rows
     from the full table (all columns), based on the given limitations.
     They can be called any number of times (to limit the final rows
     based on values in different columns for example).  Since these are
     row-rejection operations, their internal order is irrelevant.  In
     other words, it makes no difference if ‘--equal’ is called before
     or after ‘--range’ for example.

     As a side-effect, because NaN/blank values are defined to fail on
     any condition, these operations will also remove rows with
     NaN/blank values in the specified column they are checking.  Also,
     the columns that are used for these operations don’t necessarily
     have to be in the final output table (you may not need the column
     after doing the selection based on it).

     Even though these options are applied after merging columns from
     other tables, currently their condition-columns can only come from
     the main input table.  In other words, even though the rows of the
     added columns (from another file) will also be selected with these
     options, the condition to keep/reject rows cannot be taken from the
     newly added columns.

     These options are applied first because the speed of later
     operations can be greatly affected by the number of rows.  For
     example, if you also call the ‘--sort’ option, and your row
     selection will result in 50 rows (from an input of 1000 rows),
     limiting the number of rows can greatly speed up the sorting in
     your final output.

Sorting (‘--sort’)
     Sort of the rows based on values in a certain column.  The column
     to sort by can only come from the main input table columns (not
     columns that may have been added with ‘--catcolumnfile’).

Row selection (by position)
        • ‘--head’: keep only requested number of top rows.
        • ‘--tail’: keep only requested number of bottom rows.
        • ‘--rowrandom’: keep only a random number of rows.
        • ‘--rowlimit’: keep only rows within a certain positional
          interval.

     These options limit/select rows based on their position within the
     table (not their value in any certain column).

Column arithmetic
     Once the final rows are selected in the requested order, column
     arithmetic is done (if requested).  For more on column arithmetic,
     see *note Column arithmetic::.

Column metadata (‘--colmetadata’)
     Changing column metadata is necessary after column arithmetic or
     adding new columns from other tables (that were done above).

Row selection (‘--noblank’)
     Theoretically, this method of row selection by value should have
     been done with the other logically similar options like ‘--range’
     or ‘--equal’.  However, those options are applied to the raw
     (input) column value.  In some scenarios, you need to apply
     arithmetic operations on the columns (through *note Column
     arithmetic::) before rejecting the undesired rows.  After the
     arithmetic operation is done, you can use the ‘where’ operator to
     set the non-desired columns to NaN/blank and use ‘--noblank’ to
     remove them at the end.  See the example below for applying any
     generic value-based row selection based on ‘--noblank’.

   As an example, let’s review how Table interprets the command below.
We are assuming that ‘table.fits’ contains at least three columns: ‘RA’,
‘DEC’ and ‘PARAM’ and you only want the RA and Dec of the rows where
$p\times 2<5$ ($p$ is the value of each row in the ‘PARAM’ column).

     asttable table.fits -cRA,DEC --noblank=MULTIP \
              -c'arith PARAM 2 x set-i i i 5 gt nan where' \
              --colmetadata=3,MULTIP,unit,"Description of column"

Due to the precedence described in this section, Table does these
operations (which are independent of the order of the operations written
on the command-line):

  1. At the start (with ‘-cRA,DEC’), Table reads the ‘RA’ and ‘DEC’
     columns.
  2. In between all the operations in the command above, Column
     arithmetic (with ‘-c'arith ...'’) has the highest precedence.  So
     the arithmetic operation is done and stored as a new (third)
     column.  In this arithmetic operation, we multiply all the values
     of the ‘PARAM’ column by 2, then set all those with a value larger
     than 5 to NaN (for more on understanding this operation, see the
     ‘‘set-’’ and ‘‘where’’ operators in *note Arithmetic operators::).
  3. Updating column metadata (with ‘--colmetadata’) is then done to
     give a name (‘MULTIP’) to the newly calculated (third) column.
     During the process, besides a name, we also set a unit and
     description for the new column.  These metadata entries are _very
     important_, so always be sure to add metadata after doing column
     arithmetic.
  4. The lowest precedence operation is ‘--noblank=MULTIP’.  So only
     rows that aren’t blank/NaN in the ‘MULTIP’ column are kept.
  5. Finally, the output table (with three columns) is written to the
     command-line.  If you also want to print the column metadata, you
     can use the ‘--colinfoinstdout’ option.  Alternatively, if you want
     the output in a file, you can use the ‘--output’ option to save the
     table in FITS or plain-text format.

*Out of precedence:* It may happen that your desired operation needs a
separate precedence.  In this case you can pipe the output of Table into
another call of Table and use the ‘--colinfoinstdout’ option to preserve
the metadata between the two calls.

   For example, let’s assume that you want to sort the output table from
the example command above based on the new ‘MULTIP’ column.  Since
sorting is done prior to column arithmetic, you can’t do it in one
command, but you can circumvent this limitation by simply piping the
output (including metadata) to another call to Table:

     asttable table.fits -cRA,DEC --noblank=MULTIP --colinfoinstdout \
              -c'arith PARAM 2 x set-i i i 5 gt nan where' \
              --colmetadata=3,MULTIP,unit,"Description of column" \
              | asttable --sort=MULTIP --output=selected.fits


File: gnuastro.info,  Node: Invoking asttable,  Prev: Operation precedence in Table,  Up: Table

5.3.3 Invoking Table
--------------------

Table will read/write, select, modify, or show the information of the
rows and columns in recognized Table formats (including FITS binary,
FITS ASCII, and plain text table files, see *note Tables::).  Output
columns can also be determined by number or regular expression matching
of column names, units, or comments.  The executable name is ‘asttable’
with the following general template

     $ asttable [OPTION...] InputFile

One line examples:

     ## Get the table column information (name, data type, or units):
     $ asttable bintab.fits --information

     ## Print columns named RA and DEC, followed by all the columns where
     ## the name starts with "MAG_":
     $ asttable bintab.fits --column=RA --column=DEC --column=/^MAG_/

     ## Similar to the above, but with one call to `--column' (or `-c'),
     ## also sort the rows by the input's photometric redshift (`Z_PHOT')
     ## column. To confirm the sort, you can add `Z_PHOT' to the columns
     ## to print.
     $ asttable bintab.fits -cRA,DEC,/^MAG_/ --sort=Z_PHOT

     ## Similar to the above, but only print rows that have a photometric
     ## redshift between 2 and 3.
     $ asttable bintab.fits -cRA,DEC,/^MAG_/ --range=Z_PHOT,2:3

     ## Only print rows with a value in the 10th column above 100000:
     $ asttable bintab.fits --range=10,10e5,inf

     ## Only print the 2nd column, and the third column multiplied by 5,
     ## Save the resulting two columns in `table.txt'
     $ asttable bintab.fits -c2,'arith $2 5 x' -otable.fits

     ## Sort the output columns by the third column, save output:
     $ asttable bintab.fits --sort=3 -ooutput.txt

     ## Subtract the first column from the second in `cat.fits' (can also
     ## be a text table) and keep the third and fourth columns.
     $ asttable cat.txt -c'arith $2 $1 -',3,4 -ocat.fits

     ## Convert sexagesimal coordinates to degrees (same can be done in a
     ## large table given as argument).
     $ echo "7h34m35.5498 31d53m14.352s" | asttable

     ## Convert RA and Dec in degrees to sexagesimal (same can be done in a
     ## large table given as argument).
     echo "113.64812416667 31.88732" \
          | asttable -c'arith $1 degree-to-ra $2 degree-to-dec'

   Table’s input dataset can be given either as a file or from Standard
input (piped from another program, see *note Standard input::).  In the
absence of selected columns, all the input’s columns and rows will be
written to the output.  The full set of operations Table can do are
described in detail below, but for a more high-level introduction to the
various operations, and their precedence, see *note Operation precedence
in Table::.

   If any output file is explicitly requested (with ‘--output’) the
output table will be written in it.  When no output file is explicitly
requested the output table will be written to the standard output.  If
the specified output is a FITS file, the type of FITS table (binary or
ASCII) will be determined from the ‘--tabletype’ option.  If the output
is not a FITS file, it will be printed as a plain text table (with space
characters between the columns).  When the columns are accompanied by
meta-data (like column name, units, or comments), this information will
also printed in the plain text file before the table, as described in
*note Gnuastro text table format::.

   For the full list of options common to all Gnuastro programs please
see *note Common options::.  Options can also be stored in directory,
user or system-wide configuration files to avoid repeating on the
command-line, see *note Configuration files::.  Table does not follow
Automatic output that is common in most Gnuastro programs, see *note
Automatic output::.  Thus, in the absence of an output file, the
selected columns will be printed on the command-line with no column
information, ready for redirecting to other tools like ‘awk’.

*Sexagesimal coordinates as floats in plain-text tables:* When a column
is determined to be a floating point type (32-bit or 64-bit) in a
plain-text table, it can contain sexagesimal values in the format of
‘‘_h_m_s’’ (for RA) and ‘‘_d_m_s’’ (for Dec), where the ‘‘_’’s are
place-holders for numbers.  In this case, the string will be immediately
converted to a single floating point number (in units of degrees) and
stored in memory with the rest of the column or table.  Besides being
useful in large tables, with this feature, conversion to sexagesimal
coordinates to degrees becomes very easy, for example:
     echo "7h34m35.5498 31d53m14.352s" | asttable

The inverse can also be done with the more general column arithmetic
operators:
     echo "113.64812416667 31.88732" \
          | asttable -c'arith $1 degree-to-ra $2 degree-to-dec'

If you want to preserve the sexagesimal contents of a column, you should
store that column as a string, see *note Gnuastro text table format::.

‘-i’
‘--information’
     Only print the column information in the specified table on the
     command-line and exit.  Each column’s information (number, name,
     units, data type, and comments) will be printed as a row on the
     command-line.  Note that the FITS standard only requires the data
     type (see *note Numeric data types::), and in plain text tables, no
     meta-data/information is mandatory.  Gnuastro has its own
     convention in the comments of a plain text table to store and
     transfer this information as described in *note Gnuastro text table
     format::.

     This option will take precedence over all other operations in
     Table, so when it is called along with other operations, they will
     be ignored, see *note Operation precedence in Table::.  This can be
     useful if you forget the identifier of a column after you have
     already typed some on the command-line.  You can simply add a ‘-i’
     to your already-written command (without changing anything) and run
     Table, to see the whole list of column names and information.  Then
     you can use the shell history (with the up arrow key on the
     keyboard), and retrieve the last command with all the previously
     typed columns present, delete ‘-i’ and add the identifier you had
     forgot.

‘-c STR/INT’
‘--column=STR/INT’
     Set the output columns either by specifying the column number, or
     name.  For more on selecting columns, see *note Selecting table
     columns::.  If a value of this option starts with ‘‘arith ’’,
     column arithmetic will be activated, allowing you to
     edit/manipulate column contents.  For more on column arithmetic see
     *note Column arithmetic::.

     To ask for multiple columns this option can be used in two ways: 1)
     multiple calls to this option, 2) using a comma between each column
     specifier in one call to this option.  These different solutions
     may be mixed in one call to Table: for example, ‘‘-cRA,DEC,MAG’’,
     or ‘‘-cRA,DEC -cMAG’’ are both equivalent to ‘‘-cRA -cDEC -cMAG’’.
     The order of the output columns will be the same order given to the
     option or in the configuration files (see *note Configuration file
     precedence::).

     This option is not mandatory, if no specific columns are requested,
     all the input table columns are output.  When this option is called
     multiple times, it is possible to output one column more than once.

‘-w FITS’
‘--wcsfile=FITS’
     FITS file that contains the WCS to be used in the ‘wcs-to-img’ and
     ‘img-to-wcs’ operators of *note Column arithmetic::.  The extension
     name/number within the FITS file can be specified with ‘--wcshdu’.

     If the value to this option is ‘‘none’’, no WCS will be written in
     the output.

‘-W STR’
‘--wcshdu=STR’
     FITS extension/HDU in the FITS file given to ‘--wcsfile’ (see the
     description of ‘--wcsfile’ for more).

‘-L FITS/TXT’
‘--catcolumnfile=FITS/TXT’
     Concatenate (or add, or append) the columns of this option’s value
     (a filename) to the output columns.  This option may be called
     multiple times (to add columns from more than one file into the
     final output), the columns from each file will be added in the same
     order that this option is called.  The number of rows in the
     file(s) given to this option has to be the same as the input table
     (before any type of row-selection), see *note Operation precedence
     in Table::.

     By default all the columns of the given file will be appended, if
     you only want certain columns to be appended, use the
     ‘--catcolumns’ option to specify their name or number (see *note
     Selecting table columns::).  Note that the columns given to
     ‘--catcolumns’ must be present in all the given files (if this
     option is called more than once with more than one file).

     If the file given to this option is a FITS file, its necessary to
     also define the corresponding HDU/extension with ‘--catcolumnhdu’.
     Also note that no operation (for example row selection, arithmetic
     or etc) is applied to the table given to this option.

     If the appended columns have a name, the column names of each file
     will be appended with a ‘-N’, where ‘N’ is a counter starting from
     1 for each appended file.  This is done because when concatenating
     columns from multiple tables (more than two) into one, they may
     have the same name, and its not good practice to have multiple
     columns with the same name.  You can disable this feature with
     ‘--catcolumnrawname’.  Generally, you can use the ‘--colmetadata’
     option to update column metadata.

     For example, let’s assume you have two catalogs of the same objects
     (same number of rows) in different filters.  Such that
     ‘f160w-cat.fits’ has a ‘MAGNITUDE’ column that has the magnitude of
     each object in the ‘F160W’ filter and similarly ‘f105w-cat.fits’,
     also has a ‘MAGNITUDE’ column, but for the ‘F105W’ filter.  You can
     use column concatenation like below to import the ‘MAGNITUDE’
     column from the ‘F105W’ catalog into the ‘F160W’ catalog, while
     giving each magnitude column a different name:

          asttable f160w-cat.fits --output=both.fits \
            --catcolumnfile=f105w-cat.fits --catcolumns=MAGNITUDE \
            --colmetadata=MAGNITUDE,MAG-F160W,log,"Magnitude in F160W" \
            --colmetadata=MAGNITUDE-1,MAG-F105W,log,"Magnitude in F105W"

     For a more complete example, see *note Working with catalogs
     estimating colors::.

‘-u STR/INT’
‘--catcolumnhdu=STR/INT’
     The HDU/extension of the FITS file(s) that should be concatenated,
     or appended, with ‘--catcolumnfile’.  If ‘--catcolumn’ is called
     more than once with more than one FITS file, its necessary to call
     this option more than once.  The HDUs will be loaded in the same
     order as the FITS files given to ‘--catcolumnfile’.

‘-C STR/INT’
‘--catcolumns=STR/INT’
     The column(s) in the file(s) given to ‘--catcolumnfile’ to append.
     When this option is not given, all the columns will be
     concatenated.  See ‘--catcolumnfile’ for more.

‘--catcolumnrawname’
     Don’t modify the names of the concatenated (appended) columns, see
     description in ‘--catcolumnfile’.

‘-O’
‘--colinfoinstdout’
     Add column metadata when the output is printed in the standard
     output.  Usually the standard output is used for a fast visual
     check, or to pipe into other metadata-agnostic programs (like AWK)
     for further processing.  So by default meta-data aren’t included.
     But when piping to other Gnuastro programs (where metadata can be
     interpreted and used) it is recommended to use this option and use
     column names in the next program.

‘-r STR,FLT:FLT’
‘--range=STR,FLT:FLT’
     Only output rows that have a value within the given range in the
     ‘STR’ column (can be a name or counter).  Note that the range is
     only inclusive in the lower-limit.  For example with
     ‘--range=sn,5:20’ the output’s columns will only contain rows that
     have a value in the ‘sn’ column (not case-sensitive) that is
     greater or equal to 5, and less than 20.  For the precedence of
     this operation in relation to others, see *note Operation
     precedence in Table::.

     This option can be called multiple times (different ranges for
     different columns) in one run of the Table program.  This is very
     useful for selecting the final rows from multiple criteria/columns.

     The chosen column doesn’t have to be in the output columns.  This
     is good when you just want to select using one column’s values, but
     don’t need that column anymore afterwards.

     For one example of using this option, see the example under
     ‘--sigclip-median’ in *note Invoking aststatistics::.

‘--inpolygon=STR1,STR2’
     Only return rows where the given coordinates are inside the polygon
     specified by the ‘--polygon’ option.  The coordinate columns are
     the given ‘STR1’ and ‘STR2’ columns, they can be a column name or
     counter (see *note Selecting table columns::).  For the precedence
     of this operation in relation to others, see *note Operation
     precedence in Table::.

     Note that the chosen columns doesn’t have to be in the output
     columns (which are specified by the ‘--column’ option).  For
     example if we want to select rows in the polygon specified in *note
     Dataset inspection and cropping::, this option can be used like
     this (you can remove the double quotations and write them all in
     one line if you remove the white-spaces around the colon separating
     the column vertices):

          asttable table.fits --inpolygon=RA,DEC      \
                   --polygon="53.187414,-27.779152    \
                              : 53.159507,-27.759633  \
                              : 53.134517,-27.787144  \
                              : 53.161906,-27.807208" \

     *Flat/Euclidean space: * The ‘--inpolygon’ option assumes a
     flat/Euclidean space so it is only correct for RA and Dec when the
     polygon size is very small like the example above.  If your polygon
     is a degree or larger, it may not return correct results.  Please
     get in touch if you need such a feature (see *note Suggest new
     feature::).

‘--outpolygon=STR1,STR2’
     Only return rows where the given coordinates are outside the
     polygon specified by the ‘--polygon’ option.  This option is very
     similar to the ‘--inpolygon’ option, so see the description there
     for more.

‘--polygon=STR’
‘--polygon=FLT,FLT:FLT,FLT:...’
     The polygon to use for the ‘--inpolygon’ and ‘--outpolygon’
     options.  This option behaves identically to the same option in the
     Crop program, so for more information on how to use it, see *note
     Crop options::.

‘-e STR,INT/FLT,...’
‘--equal=STR,INT/FLT,...’
     Only output rows that are equal to the given number(s) in the given
     column.  The first argument is the column identifier (name or
     number, see *note Selecting table columns::), after that you can
     specify any number of values.  For the precedence of this operation
     in relation to others, see *note Operation precedence in Table::.

     For example ‘--equal=ID,5,6,8’ will only print the rows that have a
     value of 5, 6, or 8 in the ‘ID’ column.  This option can also be
     called multiple times, so ‘--equal=ID,4,5 --equal=ID,6,7’ has the
     same effect as ‘--equal=4,5,6,7’.

     The ‘--equal’ and ‘--notequal’ options also work when the given
     column has a string type.  In this case the given value to the
     option will also be parsed as a string, not as a number.  When
     dealing with string columns, be careful with trailing white space
     characters (the actual value maybe adjusted to the right, left, or
     center of the column’s width).  If you need to account for such
     white spaces, you can use shell quoting.  For example
     ‘--equal=NAME," myname "’.

     *Equality and floating point numbers:* Floating point numbers are
     only approximate values (see *note Numeric data types::).  In this
     context, their equality depends on how the the input table was
     originally stored (as a plain text table or as an ASCII/binary FITS
     table).  If you want to select floating point numbers, it is
     strongly recommended to use the ‘--range’ option and set a very
     small interval around your desired number, don’t use ‘--equal’ or
     ‘--notequal’.

‘-n STR,INT/FLT,...’
‘--notequal=STR,INT/FLT,...’
     Only output rows that are _not_ equal to the given number(s) in the
     given column.  The first argument is the column identifier (name or
     number, see *note Selecting table columns::), after that you can
     specify any number of values.  For example ‘--notequal=ID,5,6,8’
     will only print the rows where the ‘ID’ column doesn’t have value
     of 5, 6, or 8.  This option can also be called multiple times, so
     ‘--notequal=ID,4,5 --notequal=ID,6,7’ has the same effect as
     ‘--notequal=4,5,6,7’.

     Be very careful if you want to use the non-equality with floating
     point numbers, see the special note under ‘--equal’ for more.  This
     option also works when the given column has a string type, see the
     description under ‘--equal’ (above) for more.

‘-s STR’
‘--sort=STR’
     Sort the output rows based on the values in the ‘STR’ column (can
     be a column name or number).  By default the sort is done in
     ascending/increasing order, to sort in a descending order, use
     ‘--descending’.  For the precedence of this operation in relation
     to others, see *note Operation precedence in Table::.

     The chosen column doesn’t have to be in the output columns.  This
     is good when you just want to sort using one column’s values, but
     don’t need that column anymore afterwards.

‘-d’
‘--descending’
     When called with ‘--sort’, rows will be sorted in descending order.

‘-H INT’
‘--head=INT’
     Only print the given number of rows from the _top_ of the final
     table.  Note that this option only affects the _output_ table.  For
     example if you use ‘--sort’, or ‘--range’, the printed rows are the
     first _after_ applying the sort sorting, or selecting a range of
     the full input.  This option cannot be called with ‘--tail’,
     ‘--rowlimit’ or ‘--rowrandom’.  For the precedence of this
     operation in relation to others, see *note Operation precedence in
     Table::.

     If the given value to ‘--head’ is 0, the output columns won’t have
     any rows and if its larger than the number of rows in the input
     table, all the rows are printed (this option is effectively
     ignored).  This behavior is taken from the ‘head’ program in GNU
     Coreutils.

‘-t INT’
‘--tail=INT’
     Only print the given number of rows from the _bottom_ of the final
     table.  See ‘--head’ for more.  This option cannot be called with
     ‘--head’, ‘--rowlimit’ or ‘--rowrandom’.

‘--rowlimit=INT,INT’
     Only return the rows within the requested positional range
     (inclusive on both sides).  Therefore, ‘--rowlimit=5,7’ will return
     3 of the input rows, row 5, 6 and 7.  This option will abort if any
     of the given values is larger than the total number of rows in the
     table.  For the precedence of this operation in relation to others,
     see *note Operation precedence in Table::.

     With the ‘--head’ or ‘--tail’ options you can only see the top or
     bottom few rows.  However, with this option, you can limit the
     returned rows to a contiguous set of rows in the middle of the
     table.  Therefore this option cannot be called with ‘--head’,
     ‘--tail’, or ‘--rowrandom’.

‘--rowrandom=INT’
     Select ‘INT’ rows from the input table by random (assuming a
     uniform distribution).  This option is applied _after_ the
     value-based selection options (like ‘--sort’, ‘--range’,
     ‘--polygon’ and etc).  On the other hand, only the row counters are
     randomly selected, this option doesn’t change the order.
     Therefore, if ‘--rowrandom’ is called together with ‘--sort’, the
     returned rows are still sorted.  This option cannot be called with
     ‘--head’, ‘--tail’, or ‘--rowlimit’.  For the precedence of this
     operation in relation to others, see *note Operation precedence in
     Table::.

     This option will only have an effect if ‘INT’ is larger than the
     number of rows when it is activated (after the value-based
     selection options have been applied).  When there are fewer rows, a
     warning is printed, saying that this option has no effect.  The
     warning can be disabled with the ‘--quiet’ option.

     Due to its nature (to be random), the output of this option differs
     in each run.  Therefore 5 calls to Table with ‘--rowrandom’ on the
     same input table will generate 5 different outputs.  If you want a
     reproducible random selection, set the ‘GSL_RNG_SEED’ environment
     variable and also use the ‘--envseed’ option, for more see *note
     Generating random numbers::.

‘--envseed’
     Read the random number generator seed from the ‘GSL_RNG_SEED’
     environment variable for ‘--rowrandom’ (instead of generating a
     different seed internally on every run).  This is useful if you
     want a reproducible random selection of the input rows.  For more,
     see *note Generating random numbers::.

‘-b STR[,STR[,STR]]’
‘--noblank=STR[,STR[,STR]]’
     Remove all rows in the given _output_ columns that have a blank
     value.  Like above, the columns can be specified by their name or
     number (counting from 1).

     For example if ‘table.fits’ has blank values (NaN in floating point
     types) in the ‘magnitude’ and ‘sn’ columns, with
     ‘--noblank=magnitude,sn’, the output will not contain any rows with
     blank values in these columns.

     If you want _all_ columns to be checked, simply set the value to
     ‘_all’ (in other words: ‘--noblank=_all’).  This mode is useful
     when there are many columns in the table and you want a “clean”
     output table (with no blank values in any column): entering their
     name or number one-by-one can be buggy and frustrating.  In this
     mode, no other column name should be given.  For example if you
     give ‘--noblank=_all,magnitude’, then Table will assume that your
     table actually has a column named ‘_all’ and ‘magnitude’, and if it
     doesn’t, it will abort with an error.

     This option is applied just before writing the final table (after
     ‘--colmetadata’ has finished).  So in case you changed the column
     metadata, or added new columns, you can use the new names, or the
     newly defined column numbers.  For the precedence of this operation
     in relation to others, see *note Operation precedence in Table::.

‘-m STR/INT,STR[,STR[,STR]]’
‘--colmetadata=STR/INT,STR[,STR[,STR]]’
     Update the specified column metadata in the output table.  This
     option is applied after all other column-related operations are
     complete, for example column arithmetic, or column concatenation.
     For the precedence of this operation in relation to others, see
     *note Operation precedence in Table::.

     The first value (before the first comma) given to this option is
     the column’s identifier.  It can either be a counter (positive
     integer, counting from 1), or a name (the column’s name in the
     output if this option wasn’t called).

     After the to-be-updated column is identified, at least one other
     string should be given, with a maximum of three strings.  The first
     string after the original name will the the selected column’s new
     name.  The next (optional) string will be the selected column’s
     unit and the third (optional) will be its comments.  If the two
     optional strings aren’t given, the original column’s units or
     comments will remain unchanged.  Some examples of this option are
     available in the tutorials, in particular *note Working with
     catalogs estimating colors::.  Here are some more specific examples

     ‘--colmetadata=MAGNITUDE,MAG_F160W’
          This will convert name of the original ‘MAGNITUDE’ column to
          ‘MAG_F160W’, leaving the unit and comments unchanged.

     ‘--colmetadata=3,MAG_F160W,mag’
          This will convert name of the third column of the final output
          to ‘MAG_F160W’ and the units to ‘mag’, while leaving the
          comments untouched.

     ‘--colmetadata=MAGNITUDE,MAG_F160W,mag,"Magnitude in F160W filter"’
          This will convert name of the original ‘MAGNITUDE’ column to
          ‘MAG_F160W’, and the units to ‘mag’ and the comments to
          ‘Magnitude in F160W filter’.  Note the double quotations
          around the comment string, they are necessary to preserve the
          white-space characters within the column comment from the
          command-line, into the program (otherwise, upon reaching a
          white-space character, the shell will consider this option to
          be finished and cause un-expected behavior).

     If your table is large and generated by a script, you can first do
     all your operations on your table’s data and write it into a
     temporary file (maybe called ‘temp.fits’).  Then, look into that
     file’s metadata (with ‘asttable temp.fits -i’) to see the exact
     column positions and possible names, then add the necessary calls
     to this option to your previous call to ‘asttable’, so it writes
     proper metadata in the same run (for example in a script or
     Makefile).  Recall that when a name is given, this option will
     update the metadata of the first column that matches, so if you
     have multiple columns with the same name, you can call this options
     multiple times with the same first argument to change them all to
     different names.

     Finally, if you already have a FITS table by other means (for
     example by downloading) and you merely want to update the column
     metadata and leave the data intact, it is much more efficient to
     directly modify the respective FITS header keywords with ‘astfits’,
     using the keyword manipulation features described in *note Keyword
     inspection and manipulation::.  ‘--colmetadata’ is mainly intended
     for scenarios where you want to edit the data so it will always
     load the full/partial dataset into memory, then write out the
     resulting datasets with updated/corrected metadata.


File: gnuastro.info,  Node: Query,  Prev: Table,  Up: Data containers

5.4 Query
=========

There are many astronomical databases available for downloading
astronomical data.  Most follow the International Virtual Observatory
Alliance (IVOA, <https://ivoa.net>) standards (and in particular the
Table Access Protocol, or TAP(1)). With TAP, it is possible to submit
your queries via a command-line downloader (for example ‘curl’) to only
get specific tables, targets (rows in a table) or measurements (columns
in a table): you don’t have to download the full table (which can be
very large in some cases)!  These customizations are done through the
Astronomical Data Query Language (ADQL(2)).

   Therefore, if you are sufficiently familiar with TAP and ADQL, you
can easily custom-download any part of an online dataset.  However, you
also need to keep a record of the URLs of each database and in many
cases, the commands will become long and hard/buggy to type on the
command-line.  On the other hand, most astronomers don’t know TAP or
ADQL at all, and are forced to go to the database’s web page which is
slow (it needs to download so many images, and has too much annoying
information), requires manual interaction (further making it slow and
buggy), and can’t be automated.

   Gnuastro’s Query program is designed to be the middle-man in this
process: it provides a simple high-level interface to let you specify
your constraints on what you want to download.  It then internally
constructs the command to download the data based on your inputs and
runs it to download your desired data.  Query also prints the full
command before it executes it (if not called with ‘--quiet’).  Also, if
you ask for a FITS output table, the full command is written into its
0-th extension along with other input parameters to query (all Gnuastro
programs generally keep their input configuration parameters as FITS
keywords in the zero-th output).  You can see it with Gnuastro’s Fits
program, like below:

     $ astfits query-output.fits -h0

   With the full command used to download the dataset, you only need a
minimal knowledge of ADQL to do lower-level customizations on your
downloaded dataset.  You can simply copy that command and change the
parts of the query string you want: ADQL is very powerful!  For example
you can ask the server to do mathematical operations on the columns and
apply selections after those operations, or combine/match multiple
datasets and etc.  We will try to add high-level interfaces for such
capabilities, but generally, don’t limit yourself to the high-level
operations (that can’t cover everything!).

* Menu:

* Available databases::         List of available databases to Query.
* Invoking astquery::           Inputs, outputs and configuration of Query.

   ---------- Footnotes ----------

   (1) <https://ivoa.net/documents/TAP>

   (2) <https://ivoa.net/documents/ADQL>


File: gnuastro.info,  Node: Available databases,  Next: Invoking astquery,  Prev: Query,  Up: Query

5.4.1 Available databases
-------------------------

The current list of databases supported by Query are listed at the end
of this section.  To get the list of available datasets within each
database, you can use the ‘--information’ option.  For example with the
command below you can get a list of the roughly 100 datasets that are
available within the ESA Gaia server with their description:

     $ astquery gaia --information

However, other databases like VizieR host many more datasets (tens of
thousands!).  Therefore it is very inconvenient to get the _full_
information every time you want to find your dataset of interest (the
full metadata file VizieR is more than 20Mb).  In such cases, you can
limit the downloaded and displayed information with the ‘--limitinfo’
option.  For example with the first command below, you can get all
datasets relating to the MUSE (an instrument on the Very Large
Telescope), and those that include Roland Bacon (Principle Investigator
of MUSE) as an author (‘Bacon, R.’).  Recall that ‘-i’ is the short
format of ‘--information’.

     $ astquery vizier -i --limitinfo=MUSE
     $ astquery vizier -i --limitinfo="Bacon R."

   Once you find the recognized name of your desired dataset, you can
see the column information of that dataset with adding the dataset name.
For example, with the command below you can see the column metadata in
the ‘J/A+A/608/A2/udf10’ dataset (one of the datasets in the search
above) using this command:

     $ astquery vizier --dataset=J/A+A/608/A2/udf10 -i

   For very popular datasets of a database, Query provides an
easier-to-remember short name that you can feed to ‘--dataset’.  This
short name will map to the officially recognized name of the dataset on
the server.  In this mode, Query will also set positional columns
accordingly.  For example most VizieR datasets have an ‘RAJ2000’ column
(the RA and the epoch of 2000) so it is the default RA column name for
coordinate search (using ‘--center’ or ‘--overlapwith’).  However, some
datasets don’t have this column (for example SDSS DR12).  So when you
use the short name and Query knows about this dataset, it will
internally set the coordinate columns that SDSS DR12 has: ‘RA_ICRS’ and
‘DEC_ICRS’.  Recall that you can always change the coordinate columns
with ‘--ccol’.

   For example in the VizieR and Gaia databases, the recognized name for
the early data release 3 data is respectively ‘I/350/gaiaedr3’ and
‘gaiaedr3.gaia_source’.  These technical names can be hard to remember.
Therefore Query provides ‘gaiaedr3’ (for VizieR) and ‘edr3’ (for ESA’s
Gaia) shortcuts which you can give to ‘--dataset’ instead.  They will be
directly mapped to the fully recognized name by Query.  In the list
below that describes the available databases, the available short names
are also listed.

*Not all datasets support TAP:* Large databases like VizieR have TAP
access for all their datasets.  However, smaller databases haven’t
implemented TAP for all their tables.  Therefore some datasets that are
searchable in their web interface may not be available for a TAP search.
To see the full list of TAP-ed datasets in a database, use the
‘--information’ (or ‘-i’) option with the dataset name like the command
below.

     $ astquery astron -i

If your desired dataset isn’t in this list, but has web-access, contact
the database maintainers and ask them to add TAP access for it.  After
they do it, you should see the name added to the output list of the
command above.

   The list of databases recognized by Query (and their names in Query)
is described below.  Since Query is a new member of the Gnuastro family
(first available in Gnuastro 0.14), this list will hopefully grow
significantly in the next releases.  If you have any particular datasets
in mind, please let us know by sending an email to
‘bug-gnuastro@gnu.org’.  If the dataset supports IVOA’s TAP (Table
Access Protocol), it should be very easy to add.

‘astron’
     The ASTRON Virtual Observatory service (<https://vo.astron.nl>) is
     a database focused on radio astronomy data and images, primarily
     those collected by ASTRON itself.  A query to ‘astron’ is submitted
     to ‘https://vo.astron.nl/__system__/tap/run/tap/sync’.

     Here is the list of short names for dataset(s) in ASTRON’s VO
     service:
        • ‘tgssadr --> tgssadr.main’

‘gaia’
     The Gaia project (<https://www.cosmos.esa.int/web/gaia>) database
     which is a large collection of star positions on the celestial
     sphere, as well as peculiar velocities, parallaxes and magnitudes
     in some bands among many others.  Besides scientific studies (like
     studying resolved stellar populations in the Galaxy and its halo),
     Gaia is also invaluable for raw data calibrations, like astrometry.
     A query to ‘gaia’ is submitted to
     ‘https://gea.esac.esa.int/tap-server/tap/sync’.

     Here is the list of short names for popular datasets within Gaia:
        • ‘edr3 --> gaiaedr3.gaia_source’
        • ‘dr2 --> gaiadr2.gaia_source’
        • ‘dr1 --> gaiadr1.gaia_source’
        • ‘tycho2 --> public.tycho2’
        • ‘hipparcos --> public.hipparcos’

‘ned’
     The NASA/IPAC Extragalactic Database (NED,
     <http://ned.ipac.caltech.edu>) is a fusion database, integrating
     the information about extra-galactic sources from many large sky
     surveys into a single catalog.  It covers the full spectrum, from
     Gamma rays to radio frequencies and is updated when new data
     arrives.  A TAP query to ‘ned’ is submitted to
     ‘https://ned.ipac.caltech.edu/tap/sync’.

        • ‘objdir --> NEDTAP.objdir’: default TAP-based dataset in NED.

        • ‘extinction’: A command-line interface to the NED Extinction
          Calculator
          (https://ned.ipac.caltech.edu/extinction_calculator).  It only
          takes a central coordinate and returns a VOTable of the
          calculated extinction in many commonly used filters at that
          point.  As a result, options like ‘--width’ or ‘--radius’ are
          not supported.  However, Gnuastro doesn’t yet support the
          VOTable format.  Therefore, if you specify an ‘--output’ file,
          it should have an ‘.xml’ suffix and the downloaded file will
          not be checked.

          Until VOTable support is added to Gnuastro, you can use GREP,
          AWK and SED to convert the VOTable data into a FITS table with
          a command like below (assuming the queried VOTable is called
          ‘ned-extinction.xml’):

          grep '^<TR><TD>' ned-extinction.xml \
           | sed -e's|<TR><TD>||' \
                 -e's|</TD></TR>||' \
                 -e's|</TD><TD>|@|g' \
           | awk 'BEGIN{FS="@"; \
               print "# Column 1: FILTER [name,str15] Filter name"; \
               print "# Column 2: CENTRAL [um,f32] Central Wavelength"; \
               print "# Column 3: EXTINCTION [mag,f32] Galactic Ext."; \
               print "# Column 4: ADS_REF [ref,str50] ADS reference"} \
                  {printf "%-15s %g %g %s\n", $1, $2, $3, $4}' \
           | asttable -oned-extinction.fits

          Once the table is in FITS, you can easily get the extinction
          for a certain filter (for example the ‘SDSS r’ filter) like
          the command below:

               asttable ned-extinction.fits --equal=FILTER,"SDSS r" \
                        -cEXTINCTION

‘vizier’
     Vizier (<https://vizier.u-strasbg.fr>) is arguably the largest
     catalog database in astronomy: containing more than 20500 catalogs
     as of mid January 2021.  Almost all published catalogs in major
     projects, and even the tables in many papers are archived and
     accessible here.  For example VizieR also has a full copy of the
     Gaia database mentioned below, with some additional standardized
     columns (like RA and Dec in J2000).

     The current implementation of ‘--limitinfo’ only looks into the
     description of the datasets, but since VizieR is so large, there is
     still a lot of room for improvement.  Until then, if ‘--limitinfo’
     isn’t sufficient, you can use VizieR’s own web-based search for
     your desired dataset: <http://cdsarc.u-strasbg.fr/viz-bin/cat>

     Because VizieR curates such a diverse set of data from tens of
     thousands of projects and aims for interoperability between them,
     the column names in VizieR may not be identical to the column names
     in the surveys’ own databases (Gaia in the example above).  A query
     to ‘vizier’ is submitted to
     ‘http://tapvizier.u-strasbg.fr/TAPVizieR/tap/sync’.

     Here is the list of short names for popular datasets within VizieR
     (sorted alphabetically by their short name).  Please feel free to
     suggest other major catalogs (covering a wide area or commonly used
     in your field)..  For details on each dataset with necessary
     citations, and links to web pages, look into their details with
     their ViziR names in <https://vizier.u-strasbg.fr/viz-bin/VizieR>.
        • ‘2mass --> II/246/out’ (2MASS All-Sky Catalog)
        • ‘akarifis --> II/298/fis’ (AKARI/FIS All-Sky Survey)
        • ‘allwise --> II/328/allwise’ (AllWISE Data Release)
        • ‘apass9 --> II/336/apass9’ (AAVSO Photometric All Sky Survey,
          DR9)
        • ‘catwise --> II/365/catwise’ (CatWISE 2020 catalog)
        • ‘des1 --> II/357/des_dr1’ (Dark Energy Survey data release 1)
        • ‘gaiadr2 --> I/345/gaia2’ (GAIA Data Release 2)
        • ‘gaiaedr3 --> I/350/gaiaedr3’ (GAIA early Data Release 3)
        • ‘galex5 --> II/312/ais’ (All-sky Survey of GALEX DR5)
        • ‘nomad --> I/297/out’ (Naval Observatory Merged Astrometric
          Dataset)
        • ‘panstarrs1 --> II/349/ps1’ (Pan-STARRS Data Release 1).
        • ‘ppmxl --> I/317/sample’ (Positions and proper motions on the
          ICRS)
        • ‘sdss12 --> V/147/sdss12’ (SDSS Photometric Catalogue, Release
          12)
        • ‘usnob1 --> I/284/out’ (Whole-Sky USNO-B1.0 Catalog)
        • ‘ucac5 --> I/340/ucac5’ (5th U.S. Naval Obs.  CCD Astrograph
          Catalog)
        • ‘unwise --> II/363/unwise’ (Band-merged unWISE Catalog)
        • ‘wise --> II/311/wise’ (WISE All-Sky data Release)


File: gnuastro.info,  Node: Invoking astquery,  Prev: Available databases,  Up: Query

5.4.2 Invoking Query
--------------------

Query provides a high-level interface to downloading subsets of data
from databases.  The executable name is ‘astquery’ with the following
general template

     $ astquery DATABASE-NAME [OPTION...] ...

One line examples:


     ## Information about all datasets in ESA's GAIA database:
     $ astquery gaia --information

     ## Only show catalogs in VizieR that have 'MUSE' in their
     ## description. The '-i' is short for '--information'.
     $ astquery vizier -i --limitinfo=MUSE

     ## List of columns in 'J/A+A/608/A2/udf10' (one of the above).
     $ astquery vizier --dataset=J/A+A/608/A2/udf10 -i

     ## ID, RA and Dec of all Gaia sources within an image.
     $ astquery gaia --dataset=edr3 --overlapwith=image.fits \
                -csource_id,ra,dec

     ## RA, Dec and Spectroscopic redshifts of objects in SDSS DR12
     ## spectroscopic redshift that overlap with 'image.fits'.
     $ astquery vizier --dataset=sdss12 --overlapwith=image.fits \
                -cRA_ICRS,DE_ICRS,zsp --range=zsp,1e-10,inf

     ## All columns of all entries in the Gaia eDR3 catalog (hosted at
     ## VizieR) within 1 arc-minute of the given coordinate.
     $ astquery vizier --dataset=I/350/gaiaedr3 --output=my-gaia.fits \
                --center=113.8729761,31.9027152 --radius=1/60 \

     ## Similar to above, but only ID, RA and Dec columns for objects with
     ## magnitude range 10 to 15. In VizieR, this column is called 'Gmag'.
     $ astquery vizier --dataset=I/350/gaiaedr3 --output=my-gaia.fits \
                --center=113.8729761,31.9027152 --radius=1/60 \
                --range=Gmag,10:15 -cEDR3Name,RAJ2000,DEJ2000

   Query takes a single argument which is the name of the database.  For
the full list of available databases and accessing them, see *note
Available databases::.  There are two methods to query the databases,
each is more fully discussed in its option’s description below.
   • *Low-level:* With ‘--query’ you can directly give a raw query
     statement that is recognized by the database.  This is very low
     level and will require a good knowledge of the database’s query
     language, but of course, it is much more powerful.  If this option
     is given, the raw string is directly passed to the server and all
     other constraints/options (for Query’s high-level interface) are
     ignored.
   • *High-level:* With the high-level options (like ‘--column’,
     ‘--center’, ‘--radius’, ‘--range’ and other constraining options
     below), the low-level query will be constructed automatically for
     the particular database.  This method is only limited to the
     generic capabilities that Query provides for all servers.  So
     ‘--query’ is more powerful, however, in this mode, you don’t need
     any knowledge of the database’s query language.  You can see the
     internally generated query on the terminal (if ‘--quiet’ is not
     used) or in the 0-th extension of the output (if it is a FITS
     file).  This full command contains the internally generated query.

   The name of the downloaded output file can be set with ‘--output’.
The requested output format can have any of the *note Recognized table
formats:: (currently ‘.txt’ or ‘.fits’).  Like all Gnuastro programs, if
the output is a FITS file, the zero-th/first HDU of the output will
contain all the command-line options given to Query as well as the full
command used to access the server.  When ‘--output’ is not set, the
output name will be in the format of ‘NAME-STRING.fits’, where ‘NAME’ is
the name of the database and ‘STRING’ is a randomly selected 6-character
set of numbers and alphabetic characters.  With this feature, a second
run of ‘astquery’ that isn’t called with ‘--output’ will not over-write
an already downloaded one.  Generally, when calling Query more than
once, it is recommended to set an output name for each call based on
your project’s context.

   The outputs of Query will have a common output format, irrespective
of the used database.  To achieve this, Query will ask the databases to
provide a FITS table output (for larger tables, FITS can consume much
less download volume).  After downloading is complete, the raw
downloaded file will be read into memory once by Query, and written into
the file given to ‘--output’.  The raw downloaded file will be deleted
by default, but can be preserved with the ‘--keeprawdownload’ option.
This strategy avoids unnecessary surprises depending on database.  For
example some databases can download a compressed FITS table, even though
we ask for FITS. But with the strategy above, the final output will be
an uncompressed FITS file.  The metadata that is added by Query
(including the full download command) is also very useful for future
usage of the downloaded data.  Unfortunately many databases don’t write
the input queries into their generated tables.

‘--dry-run’
     Only print the final download command to contact the server, don’t
     actually run it.  This option is good when you want to check the
     finally constructed query or download options given to the download
     program.  You may also want to use the constructed command as a
     base to do further customizations on it and run it yourself.

‘-k’
‘--keeprawdownload’
     Don’t delete the raw downloaded file from the database.  The name
     of the raw download will have a ‘OUTPUT-raw-download.fits’ format.
     Where ‘OUTPUT’ is either the base-name of the final output file
     (without a suffix).

‘-i’
‘--information’
     Print the information of all datasets (tables) within a database or
     all columns within a database.  When ‘--dataset’ is specified, the
     latter mode (all column information) is downloaded and printed and
     when its not defined, all dataset information (within the database)
     is printed.

     Some databases (like VizieR) contain tens of thousands of datasets,
     so you can limit the downloaded and printed information for
     available databases with the ‘--limitinfo’ option (described
     below).  Dataset descriptions are often large and contain a lot of
     text (unlike column descriptions).  Therefore when printing the
     information of all datasets within a database, the information
     (e.g., database name) will be printed on separate lines before the
     description.  However, when printing column information, the output
     has the same format as a similar option in Table (see *note
     Invoking asttable::).

     Important note to consider: the printed order of the datasets or
     columns is just for displaying in the printed output.  You cannot
     ask for datasets or columns based on the printed order, you need to
     use dataset or column names.

‘-L STR’
‘--limitinfo=STR’
     Limit the information that is downloaded and displayed (with
     ‘--information’) to those that have the string given to this option
     in their description.  Note that _this is case-sensitive_.  This
     option is only relevant when ‘--information’ is also called.

     Databases may have thousands (or tens of thousands) of datasets.
     Therefore just the metadata (information) to show with
     ‘--information’ can be tens of megabytes (for example the full
     VizieR metadata file is about 23Mb as of January 2021).  Once
     downloaded, it can also be hard to parse manually.  With
     ‘--limitinfo’, only the metadata of datasets that contain this
     string _in their description_ will be downloaded and displayed,
     greatly improving the speed of finding your desired dataset.

‘-Q "STR"’
‘--query="STR"’
     Directly specify the query to be passed onto the database.  The
     queries will generally contain space and other meta-characters, so
     we recommend placing the query within quotations.

‘-s STR’
‘--dataset=STR’
     The dataset to query within the database (not compatible with
     ‘--query’).  This option is mandatory when ‘--query’ or
     ‘--information’ aren’t provided.  You can see the list of available
     datasets within a database using ‘--information’ (possibly
     supplemented by ‘--limitinfo’).  The output of ‘--information’ will
     contain the recognized name of the datasets within that database.
     You can pass the recognized name directly to this option.  For more
     on finding and using your desired database, see *note Available
     databases::.

‘-c STR’
‘--column=STR[,STR[,...]]’
     The column name(s) to retrieve from the dataset in the given order
     (not compatible with ‘--query’).  If not given, all the dataset’s
     columns for the selected rows will be queried (which can be
     large!).  This option can take multiple values in one instance (for
     example ‘--column=ra,dec,mag’), or in multiple instances (for
     example ‘-cra -cdec -cmag’), or mixed (for example ‘-cra,dec
     -cmag’).

     In case, you don’t know the full list of the dataset’s column names
     a-priori, and you don’t want to download all the columns (which can
     greatly decrease your download speed), you can use the
     ‘--information’ option combined with the ‘--dataset’ option, see
     *note Available databases::.

‘-H INT’
‘--head=INT’
     Only ask for the first ‘INT’ rows of the finally selected columns,
     not all the rows.  This can be good when your search can result a
     large dataset, but before downloading the full volume, you want to
     see the top rows and get a feeling of what the whole dataset looks
     like.

‘-v FITS’
‘--overlapwith=FITS’
     File name of FITS file containing an image (in the HDU given by
     ‘--hdu’) to use for identifying the region to query in the give
     database and dataset.  Based on the image’s WCS and pixel size, the
     sky coverage of the image is estimated and values to the
     ‘--center’, ‘--width’ will be calculated internally.  Hence this
     option cannot be used with ‘--center’, ‘--width’ or ‘--radius’.
     Also, since it internally generates the query, it can’t be used
     with ‘--query’.

     Note that if the image has WCS distortions and the reference point
     for the WCS is not within the image, the WCS will not be
     well-defined.  Therefore the resulting catalog may not overlap, or
     correspond to a larger/small area in the sky.

‘-C FLT,FLT’
‘--center=FLT,FLT’
     The spatial center position (mostly RA and Dec) to use for the
     automatically generated query (not compatible with ‘--query’).  The
     given values will be compared to two columns in the database to
     find/return rows within a certain region around this center
     position will be requested and downloaded.  Pre-defined RA and Dec
     column names are defined in Query for every database, however you
     can use ‘--ccol’ to select other columns to use instead.  The
     region can either be a circle and the point (configured with
     ‘--radius’) or a box/rectangle around the point (configured with
     ‘--width’).

‘--ccol=STR,STR’
     The name of the coordinate-columns in the dataset to compare with
     the values given to ‘--center’.  Query will use its internal
     defaults for each dataset (for example ‘RAJ2000’ and ‘DEJ2000’ for
     VizieR data).  But each dataset is treated separately and it isn’t
     guaranteed that these columns exist in all datasets.  Also, more
     than one coordinate system/epoch may be present in a dataset and
     you can use this option to construct your spatial constraint based
     on the others coordinate systems/epochs.

‘-r FLT’
‘--radius=FLT’
     The radius about the requested center to use for the automatically
     generated query (not compatible with ‘--query’).  The radius is in
     units of degrees, but you can use simple division with this option
     directly on the command-line.  For example if you want a radius of
     20 arc-minutes or 20 arc-seconds, you can use ‘--radius=20/60’ or
     ‘--radius=20/3600’ respectively (which is much more human-friendly
     than ‘0.3333’ or ‘0.005556’).

‘-w FLT[,FLT]’
‘--width=FLT[,FLT]’
     The square (or rectangle) side length (width) about the requested
     center to use for the automatically generated query (not compatible
     with ‘--query’).  If only one value is given to ‘--width’ the
     region will be a square, but if two values are given, the widths of
     the query box along each dimension will be different.  The value(s)
     is (are) in the same units as the coordinate column (see ‘--ccol’,
     usually RA and Dec which are degrees).  You can use simple division
     for each value directly on the command-line if you want relatively
     small (and more human-friendly) sizes.  For example if you want
     your box to be 1 arc-minutes along the RA and 2 arc-minutes along
     Dec, you can use ‘--width=1/60,2/60’.

‘-g STR,FLT,FLT’
‘--range=STR,FLT,FLT’
     The column name and numerical range (inclusive) of acceptable
     values in that column (not compatible with ‘--query’).  This option
     can be called multiple times for applying range limits on many
     columns in one call (thus greatly reducing the download size).  For
     example when used on the ESA gaia database, you can use
     ‘--range=phot_g_mean_mag,10:15’ to only get rows that have a value
     between 10 and 15 (inclusive on both sides) in the
     ‘phot_g_mean_mag’ column.

     If you want all rows larger, or smaller, than a certain number, you
     can use ‘inf’, or ‘-inf’ as the first or second values
     respectively.  For example, if you want objects with SDSS
     spectroscopic redshifts larger than 2 (from the VizieR ‘sdss12’
     database), you can use ‘--range=zsp,2,inf’

     If you want the interval to not be inclusive on both sides, you can
     run ‘astquery’ once and get the command that it executes.  Then you
     can edit it to be non-inclusive on your desired side.

‘-b STR[,STR]’
‘--noblank=STR[,STR]’
     Only ask for rows that don’t have a blank value in the ‘STR’
     column.  This option can be called many times, and each call can
     have multiple column names (separated by a comma or <,>).  For
     example if you want the retrieved rows to not have a blank value in
     columns ‘A’, ‘B’, ‘C’ and ‘D’, you can use ‘--noblank=A -bB,C,D’.

‘--sort=STR[,STR]’
     Ask for the server to sort the downloaded data based on the given
     columns.  For example let’s assume your desired catalog has column
     ‘Z’ for redshift and column ‘MAG_R’ for magnitude in the R band.
     When you call ‘--sort=Z,MAG_R’, it will primarily sort the columns
     based on the redshift, but if two objects have the same redshift,
     they will be sorted by magnitude.  You can add as many columns as
     you like for higher-level sorting.


File: gnuastro.info,  Node: Data manipulation,  Next: Data analysis,  Prev: Data containers,  Up: Top

6 Data manipulation
*******************

Images are one of the major formats of data that is used in astronomy.
The functions in this chapter explain the GNU Astronomy Utilities which
are provided for their manipulation.  For example cropping out a part of
a larger image or convolving the image with a given kernel or applying a
transformation to it.

* Menu:

* Crop::                        Crop region(s) from a dataset.
* Arithmetic::                  Arithmetic on input data.
* Convolve::                    Convolve an image with a kernel.
* Warp::                        Warp/Transform an image to a different grid.


File: gnuastro.info,  Node: Crop,  Next: Arithmetic,  Prev: Data manipulation,  Up: Data manipulation

6.1 Crop
========

Astronomical images are often very large, filled with thousands of
galaxies.  It often happens that you only want a section of the image,
or you have a catalog of sources and you want to visually analyze them
in small postage stamps.  Crop is made to do all these things.  When
more than one crop is required, Crop will divide the crops between
multiple threads to significantly reduce the run time.

   Astronomical surveys are usually extremely large.  So large in fact,
that the whole survey will not fit into a reasonably sized file.
Because of this, surveys usually cut the final image into separate tiles
and store each tile in a file.  For example the COSMOS survey’s Hubble
space telescope, ACS F814W image consists of 81 separate FITS images,
with each one having a volume of 1.7 Giga bytes.

   Even though the tile sizes are chosen to be large enough that too
many galaxies/targets don’t fall on the edges of the tiles, inevitably
some do.  So when you simply crop the image of such targets from one
tile, you will miss a large area of the surrounding sky (which is
essential in estimating the noise).  Therefore in its WCS mode, Crop
will stitch parts of the tiles that are relevant for a target (with the
given width) from all the input images that cover that region into the
output.  Of course, the tiles have to be present in the list of input
files.

   Besides cropping postage stamps around certain coordinates, Crop can
also crop arbitrary polygons from an image (or a set of tiles by
stitching the relevant parts of different tiles within the polygon), see
‘--polygon’ in *note Invoking astcrop::.  Alternatively, it can crop out
rectangular regions through the ‘--section’ option from one image, see
*note Crop section syntax::.

* Menu:

* Crop modes::                  Basic modes to define crop region.
* Crop section syntax::         How to define a section to crop.
* Blank pixels::                Pixels with no value.
* Invoking astcrop::            Calling Crop on the command-line


File: gnuastro.info,  Node: Crop modes,  Next: Crop section syntax,  Prev: Crop,  Up: Crop

6.1.1 Crop modes
----------------

In order to be comprehensive, intuitive, and easy to use, there are two
ways to define the crop:

  1. From its center and side length.  For example if you already know
     the coordinates of an object and want to inspect it in an image or
     to generate postage stamps of a catalog containing many such
     coordinates.

  2. The vertices of the crop region, this can be useful for larger
     crops over many targets, for example to crop out a uniformly deep,
     or contiguous, region of a large survey.

   Irrespective of how the crop region is defined, the coordinates to
define the crop can be in Image (pixel) or World Coordinate System (WCS)
standards.  All coordinates are read as floating point numbers (not
integers, except for the ‘--section’ option, see below).  By setting the
_mode_ in Crop, you define the standard that the given coordinates must
be interpreted.  Here, the different ways to specify the crop region are
discussed within each standard.  For the full list options, please see
*note Invoking astcrop::.

   When the crop is defined by its center, the respective (integer)
central pixel position will be found internally according to the FITS
standard.  To have this pixel positioned in the center of the cropped
region, the final cropped region will have an add number of pixels (even
if you give an even number to ‘--width’ in image mode).

   Furthermore, when the crop is defined as by its center, Crop allows
you to only keep crops what don’t have any blank pixels in the vicinity
of their center (your primary target).  This can be very convenient when
your input catalog/coordinates originated from another survey/filter
which is not fully covered by your input image, to learn more about this
feature, please see the description of the ‘--checkcenter’ option in
*note Invoking astcrop::.

Image coordinates
     In image mode (‘--mode=img’), Crop interprets the pixel coordinates
     and widths in units of the input data-elements (for example pixels
     in an image, not world coordinates).  In image mode, only one image
     may be input.  The output crop(s) can be defined in multiple ways
     as listed below.

     Center of multiple crops (in a catalog)
          The center of (possibly multiple) crops are read from a text
          file.  In this mode, the columns identified with the
          ‘--coordcol’ option are interpreted as the center of a crop
          with a width of ‘--width’ pixels along each dimension.  The
          columns can contain any floating point value.  The value to
          ‘--output’ option is seen as a directory which will host (the
          possibly multiple) separate crop files, see *note Crop
          output:: for more.  For a tutorial using this feature, please
          see *note Finding reddest clumps and visual inspection::.

     Center of a single crop (on the command-line)
          The center of the crop is given on the command-line with the
          ‘--center’ option.  The crop width is specified by the
          ‘--width’ option along each dimension.  The given coordinates
          and width can be any floating point number.

     Vertices of a single crop
          In Image mode there are two options to define the vertices of
          a region to crop: ‘--section’ and ‘--polygon’.  The former is
          lower-level (doesn’t accept floating point vertices, and only
          a rectangular region can be defined), it is also only
          available in Image mode.  Please see *note Crop section
          syntax:: for a full description of this method.

          The latter option (‘--polygon’) is a higher-level method to
          define any polygon (with any number of vertices) with floating
          point values.  Please see the description of this option in
          *note Invoking astcrop:: for its syntax.

WCS coordinates
     In WCS mode (‘--mode=wcs’), the coordinates and widths are
     interpreted using the World Coordinate System (WCS, that must
     accompany the dataset), not pixel coordinates.  In WCS mode, Crop
     accepts multiple datasets as input.  When the cropped region
     (defined by its center or vertices) overlaps with multiple of the
     input images/tiles, the overlapping regions will be taken from the
     respective input (they will be stitched when necessary for each
     output crop).

     In this mode, the input images do not necessarily have to be the
     same size, they just need to have the same orientation and pixel
     resolution.  Currently only orientation along the celestial
     coordinates is accepted, if your input has a different orientation
     you can use Warp’s ‘--align’ option to align the image before
     cropping it (see *note Warp::).

     Each individual input image/tile can even be smaller than the final
     crop.  In any case, any part of any of the input images which
     overlaps with the desired region will be used in the crop.  Note
     that if there is an overlap in the input images/tiles, the pixels
     from the last input image read are going to be used for the
     overlap.  Crop will not change pixel values, so it assumes your
     overlapping tiles were cutout from the same original image.  There
     are multiple ways to define your cropped region as listed below.

     Center of multiple crops (in a catalog)
          Similar to catalog inputs in Image mode (above), except that
          the values along each dimension are assumed to have the same
          units as the dataset’s WCS information.  For example, the
          central RA and Dec value for each crop will be read from the
          first and second calls to the ‘--coordcol’ option.  The width
          of the cropped box (in units of the WCS, or degrees in RA and
          Dec mode) must be specified with the ‘--width’ option.

     Center of a single crop (on the command-line)
          You can specify the center of only one crop box with the
          ‘--center’ option.  If it exists in the input images, it will
          be cropped similar to the catalog mode, see above also for
          ‘--width’.

     Vertices of a single crop
          The ‘--polygon’ option is a high-level method to define any
          convex polygon (with any number of vertices).  Please see the
          description of this option in *note Invoking astcrop:: for its
          syntax.

     *CAUTION:* In WCS mode, the image has to be aligned with the
     celestial coordinates, such that the first FITS axis is parallel
     (opposite direction) to the Right Ascension (RA) and the second
     FITS axis is parallel to the declination.  If these conditions
     aren’t met for an image, Crop will warn you and abort.  You can use
     Warp’s ‘--align’ option to align the input image with these
     coordinates, see *note Warp::.

   As a summary, if you don’t specify a catalog, you have to define the
cropped region manually on the command-line.  In any case the mode is
mandatory for Crop to be able to interpret the values given as
coordinates or widths.


File: gnuastro.info,  Node: Crop section syntax,  Next: Blank pixels,  Prev: Crop modes,  Up: Crop

6.1.2 Crop section syntax
-------------------------

When in image mode, one of the methods to crop only one rectangular
section from the input image is to use the ‘--section’ option.  Crop has
a powerful syntax to read the box parameters from a string of
characters.  If you leave certain parts of the string to be empty, Crop
can fill them for you based on the input image sizes.

   To define a box, you need the coordinates of two points: the first
(‘X1’, ‘Y1’) and the last pixel (‘X2’, ‘Y2’) pixel positions in the
image, or four integer numbers in total.  The four coordinates can be
specified with one string in this format: ‘‘X1:X2,Y1:Y2’’.  This string
is given to the ‘--section’ option.  Therefore, the pixels along the
first axis that are $\geq$‘X1’ and $\leq$‘X2’ will be included in the
cropped image.  The same goes for the second axis.  Note that each
different term will be read as an integer, not a float.

   The reason it only accepts integers is that ‘--section’ is a
low-level option (which is also very fast!).  For a higher-level way to
specify region (any polygon, not just a box), please see the ‘--polygon’
option in *note Crop options::.  Also note that in the FITS standard,
pixel indexes along each axis start from unity(1) not zero(0).

   You can omit any of the values and they will be filled automatically.
The left hand side of the colon (‘:’) will be filled with ‘1’, and the
right side with the image size.  So, ‘2:,:’ will include the full range
of pixels along the second axis and only those with a first axis index
larger than ‘2’ in the first axis.  If the colon is omitted for a
dimension, then the full range is automatically used.  So the same
string is also equal to ‘2:,’ or ‘2:’ or even ‘2’.  If you want such a
case for the second axis, you should set it to: ‘,2’.

   If you specify a negative value, it will be seen as before the
indexes of the image which are outside the image along the bottom or
left sides when viewed in SAO DS9.  In case you want to count from the
top or right sides of the image, you can use an asterisk (‘*’).  When
confronted with a ‘*’, Crop will replace it with the maximum length of
the image in that dimension.  So ‘*-10:*+10,*-20:*+20’ will mean that
the crop box will be 20\times40 pixels in size and only include the top
corner of the input image with 3/4 of the image being covered by blank
pixels, see *note Blank pixels::.

   If you feel more comfortable with space characters between the
values, you can use as many space characters as you wish, just be
careful to put your value in double quotes, for example
‘--section="5:200, 123:854"’.  If you forget the quotes, anything after
the first space will not be seen by ‘--section’ and you will most
probably get an error because the rest of your string will be read as a
filename (which most probably doesn’t exist).  See *note Command-line::
for a description of how the command-line works.


File: gnuastro.info,  Node: Blank pixels,  Next: Invoking astcrop,  Prev: Crop section syntax,  Up: Crop

6.1.3 Blank pixels
------------------

The cropped box can potentially include pixels that are beyond the image
range.  For example when a target in the input catalog was very near the
edge of the input image.  The parts of the cropped image that were not
in the input image will be filled with the following two values
depending on the data type of the image.  In both cases, SAO DS9 will
not color code those pixels.
   • If the data type of the image is a floating point type (float or
     double), IEEE NaN (Not a number) will be used.
   • For integer types, pixels out of the image will be filled with the
     value of the ‘BLANK’ keyword in the cropped image header.  The
     value assigned to it is the lowest value possible for that type, so
     you will probably never need it any way.  Only for the unsigned
     character type (‘BITPIX=8’ in the FITS header), the maximum value
     is used because it is unsigned, the smallest value is zero which is
     often meaningful.
   You can ask for such blank regions to not be included in the output
crop image using the ‘--noblank’ option.  In such cases, there is no
guarantee that the image size of your outputs are what you asked for.

   In some survey images, unfortunately they do not use the ‘BLANK’ FITS
keyword.  Instead they just give all pixels outside of the survey area a
value of zero.  So by default, when dealing with float or double image
types, any values that are 0.0 are also regarded as blank regions.  This
can be turned off with the ‘--zeroisnotblank’ option.


File: gnuastro.info,  Node: Invoking astcrop,  Prev: Blank pixels,  Up: Crop

6.1.4 Invoking Crop
-------------------

Crop will crop a region from an image.  If in WCS mode, it will also
stitch parts from separate images in the input files.  The executable
name is ‘astcrop’ with the following general template

     $ astcrop [OPTION...] [ASCIIcatalog] ASTRdata ...

One line examples:

     ## Crop all objects in cat.txt from image.fits:
     $ astcrop --catalog=cat.txt image.fits

     ## Crop all options in catalog (with RA,DEC) from all the files
     ## ending in `_drz.fits' in `/mnt/data/COSMOS/':
     $ astcrop --mode=wcs --catalog=cat.txt /mnt/data/COSMOS/*_drz.fits

     ## Crop the outer 10 border pixels of the input image:
     $ astcrop --section=10:*-10,10:*-10 --hdu=2 image.fits

     ## Crop region around RA and Dec of (189.16704, 62.218203):
     $ astcrop --mode=wcs --center=189.16704,62.218203 goodsnorth.fits

     ## Crop region around pixel coordinate (568.342, 2091.719):
     $ astcrop --mode=img --center=568.342,2091.719 --width=201 image.fits

Crop has one mandatory argument which is the input image name(s), shown
above with ‘ASTRdata ...’.  You can use shell expansions, for example
‘*’ for this if you have lots of images in WCS mode.  If the crop box
centers are in a catalog, you can use the ‘--catalog’ option.  In other
cases, you have to provide the single cropped output parameters must be
given with command-line options.  See *note Crop output:: for how the
output file name(s) can be specified.  For the full list of general
options to all Gnuastro programs (including Crop), please see *note
Common options::.

   Floating point numbers can be used to specify the crop region (except
the ‘--section’ option, see *note Crop section syntax::).  In such
cases, the floating point values will be used to find the desired
integer pixel indices based on the FITS standard.  Hence, Crop
ultimately doesn’t do any sub-pixel cropping (in other words, it doesn’t
change pixel values).  If you need such crops, you can use *note Warp::
to first warp the image to the a new pixel grid, then crop from that.
For example, let’s assume you want a crop from pixels 12.982 to 80.982
along the first dimension.  You should first translate the image by
$-0.482$ (note that the edge of a pixel is at integer multiples of
$0.5$).  So you should run Warp with ‘--translate=-0.482,0’ and then
crop the warped image with ‘--section=13:81’.

   There are two ways to define the cropped region: with its center or
its vertices.  See *note Crop modes:: for a full description.  In the
former case, Crop can check if the central region of the cropped image
is indeed filled with data or is blank (see *note Blank pixels::), and
not produce any output when the center is blank, see the description
under ‘--checkcenter’ for more.

   When in catalog mode, Crop will run in parallel unless you set
‘--numthreads=1’, see *note Multi-threaded operations::.  Note that when
multiple outputs are created with threads, the outputs will not be
created in the same order.  This is because the threads are asynchronous
and thus not started in order.  This has no effect on each output, see
*note Finding reddest clumps and visual inspection:: for a tutorial on
effectively using this feature.

* Menu:

* Crop options::                A list of all the options with explanation.
* Crop output::                 The outputs of Crop.
* Crop known issues::           Known issues in running Crop.


File: gnuastro.info,  Node: Crop options,  Next: Crop output,  Prev: Invoking astcrop,  Up: Invoking astcrop

6.1.4.1 Crop options
....................

The options can be classified into the following contexts: Input, Output
and operating mode options.  Options that are common to all Gnuastro
program are listed in *note Common options:: and will not be repeated
here.

   When you are specifying the crop vertices your self (through
‘--section’, or ‘--polygon’) on relatively small regions (depending on
the resolution of your images) the outputs from image and WCS mode can
be approximately equivalent.  However, as the crop sizes get large, the
curved nature of the WCS coordinates have to be considered.  For
example, when using ‘--section’, the right ascension of the bottom left
and top left corners will not be equal.  If you only want regions within
a given right ascension, use ‘--polygon’ in WCS mode.

Input image parameters:

‘--hstartwcs=INT’
     Specify the first keyword card (line number) to start finding the
     input image world coordinate system information.  This is useful
     when certain header keywords of the input may cause bad conflicts
     with your crop (see an example described below).  To get line
     numbers of the header keywords, you can pipe the fully printed
     header into ‘cat -n’ like below:

          $ astfits image.fits -h1 | cat -n

     For example, distortions have only been present in WCSLIB from
     version 5.15 (released in mid 2016).  Therefore some pipelines
     still apply their own specific set of WCS keywords for distortions
     and put them into the image header along with those that WCSLIB
     does recognize.  So now that WCSLIB recognizes most of the standard
     distortion parameters, they will get confused with the old ones and
     give wrong results.  For example in the CANDELS-GOODS South images
     that were created before WCSLIB 5.15(1).

     The two ‘--hstartwcs’ and ‘--hendwcs’ are thus provided so when
     using older datasets, you can specify what region in the FITS
     headers you want to use to read the WCS keywords.  Note that this
     is only relevant for reading the WCS information, basic data
     information like the image size are read separately.  These two
     options will only be considered when the value to ‘--hendwcs’ is
     larger than that of ‘--hstartwcs’.  So if they are equal or
     ‘--hstartwcs’ is larger than ‘--hendwcs’, then all the input
     keywords will be parsed to get the WCS information of the image.

‘--hendwcs=INT’
     Specify the last keyword card to read for specifying the image
     world coordinate system on the input images.  See ‘--hstartwcs’

Crop box parameters:

‘-c FLT[,FLT[,...]]’
‘--center=FLT[,FLT[,...]]’
     The central position of the crop in the input image.  The positions
     along each dimension must be separated by a comma (<,>) and
     fractions are also acceptable.  The number of values given to this
     option must be the same as the dimensions of the input dataset.
     The width of the crop should be set with ‘--width’.  The units of
     the coordinates are read based on the value to the ‘--mode’ option,
     see below.

‘-w FLT[,FLT[,...]]’
‘--width=FLT[,FLT[,...]]’
     Width of the cropped region about coordinate given to ‘--center’.
     If in WCS mode, value(s) given to this option will be read in the
     same units as the dataset’s WCS information along this dimension.
     This option may take either a single value (to be used for all
     dimensions: ‘--width=10’ in image-mode will crop a $10\times10$
     pixel image) or multiple values (a specific value for each
     dimension: ‘--width=10,20’ in image-mode will crop a $10\times20$
     pixel image).

     The ‘--width’ option also accepts fractions.  For example if you
     want the width of your crop to be 3 by 5 arcseconds along RA and
     Dec respectively and you are in wcs-mode, you can use:
     ‘--width=3/3600,5/3600’.

     The final output will have an odd number of pixels to allow easy
     identification of the pixel which keeps your requested coordinate
     (from ‘--center’ or ‘--catalog’).  If you want an even sided crop,
     you can run Crop afterwards with ‘--section=":*-1,:*-1"’ or
     ‘--section=2:,2:’ (depending on which side you don’t need), see
     *note Crop section syntax::.

     The basic reason for making an odd-sided crop is that your given
     central coordinate will ultimately fall within a discrete pixel in
     the image (defined by the FITS standard).  When the crop has an odd
     number of pixels in each dimension, that pixel can be very well
     defined as the “central” pixel of the crop, making it unambiguously
     easy to identify.  However, for an even-sided crop, it will be very
     hard to identify the central pixel (it can be on any of the four
     pixels adjacent to the central point of the image!).

‘-l STR’
‘-l FLT:FLT,...’
‘--polygon=STR’
‘--polygon=FLT,FLT:FLT,FLT:...’
     Polygon vertice coordinates (when value is in ‘FLT,FLT:FLT,FLT:...’
     format) or the filename of a SAO DS9 region file (when the value
     has no ‘,’ or ‘:’ characters).  The vertices are used to define the
     polygon: in the same order given to this option.  When the vertices
     are not necessarily ordered in the proper order (for example one
     vertice in a square comes after its diagonal opposite), you can add
     the ‘--polygonsort’ option which will attempt to sort the vertices
     before cropping.  Note that for concave polygons, sorting is not
     recommended because there is no unique solution, for more, see the
     description under ‘--polygonsort’.

     This option can be used both in the image and WCS modes, see *note
     Crop modes::.  If a SAO DS9 region file is used, the coordinate
     mode of Crop will be determined by the contents of the file and any
     value given to ‘--mode’ is ignored.  The cropped image will be the
     size of the rectangular region that completely encompasses the
     polygon.  By default all the pixels that are outside of the polygon
     will be set as blank values (see *note Blank pixels::).  However,
     if ‘--polygonout’ is called all pixels internal to the vertices
     will be set to blank.  In WCS-mode, you may provide many FITS
     images/tiles: Crop will stitch them to produce this cropped region,
     then apply the polygon.

     The syntax for the polygon vertices is similar to, and simpler
     than, that for ‘--section’.  In short, the dimensions of each
     coordinate are separated by a comma (<,>) and each vertex is
     separated by a colon (<:>).  You can define as many vertices as you
     like.  If you would like to use space characters between the
     dimensions and vertices to make them more human-readable, then you
     have to put the value to this option in double quotation marks.

     For example, let’s assume you want to work on the deepest part of
     the WFC3/IR images of Hubble Space Telescope eXtreme Deep Field
     (HST-XDF). According to the web page
     (https://archive.stsci.edu/prepds/xdf/)(2) the deepest part is
     contained within the coordinates:

          [ (53.187414,-27.779152), (53.159507,-27.759633),
            (53.134517,-27.787144), (53.161906,-27.807208) ]

     They have provided mask images with only these pixels in the
     WFC3/IR images, but what if you also need to work on the same
     region in the full resolution ACS images?  Also what if you want to
     use the CANDELS data for the shallow region?  Running Crop with
     ‘--polygon’ will easily pull out this region of the image for you,
     irrespective of the resolution.  If you have set the operating mode
     to WCS mode in your nearest configuration file (see *note
     Configuration files::), there is no need to call ‘--mode=wcs’ on
     the command line.

          $ astcrop --mode=wcs desired-filter-image(s).fits           \
             --polygon="53.187414,-27.779152 : 53.159507,-27.759633 : \
                        53.134517,-27.787144 : 53.161906,-27.807208"

     More generally, you have an image and want to define the polygon
     yourself (it isn’t already published like the example above).  As
     the number of vertices increases, checking the vertex coordinates
     on a FITS viewer (for example SAO DS9) and typing them in, one by
     one, can be very tedious and prone to typo errors.  In such cases,
     you can make a polygon “region” in DS9 and using your mouse, easily
     define (and visually see) it.  Given that SAO DS9 has a graphic
     user interface (GUI), if you don’t have the polygon vertices
     before-hand, it is much more easier build your polygon there and
     pass it onto Crop through the region file.

     You can take the following steps to make an SAO DS9 region file
     containing your polygon.  Open your desired FITS image with SAO DS9
     and activate its “region” mode with Edit→Region.  Then define the
     region as a polygon with Region→Shape→Polygon.  Click on the
     approximate center of the region you want and a small square will
     appear.  By clicking on the vertices of the square you can shrink
     or expand it, clicking and dragging anywhere on the edges will
     enable you to define a new vertex.  After the region has been
     nicely defined, save it as a file with Region→“Save Regions”.  You
     can then select the name and address of the output file, keep the
     format as ‘REG (*.reg)’ and press the “OK” button.  In the next
     window, keep format as “ds9” and “Coordinate System” as “fk5” for
     RA and Dec (or “Image” for pixel coordinates).  A plain text file
     is now created (let’s call it ‘ds9.reg’) which you can pass onto
     Crop with ‘--polygon=ds9.reg’.

     For the expected format of the region file, see the description of
     ‘gal_ds9_reg_read_polygon’ in *note SAO DS9 library::.  However,
     since SAO DS9 makes this file for you, you don’t usually need to
     worry about its internal format unless something un-expected
     happens and you find a bug.

‘--polygonout’
     Keep all the regions outside the polygon and mask the inner ones
     with blank pixels (see *note Blank pixels::).  This is practically
     the inverse of the default mode of treating polygons.  Note that
     this option only works when you have only provided one input image.
     If multiple images are given (in WCS mode), then the full area
     covered by all the images has to be shown and the polygon excluded.
     This can lead to a very large area if large surveys like COSMOS are
     used.  So Crop will abort and notify you.  In such cases, it is
     best to crop out the larger region you want, then mask the smaller
     region with this option.

‘--polygonsort’
     Sort the given set of vertices to the ‘--polygon’ option.  For a
     concave polygon it will sort the vertices correctly, however for a
     convex polygon it there is no unique sorting, so be careful because
     the crop may not be what you expected.

     Polygons come in two classes: convex and concave (or generally,
     non-convex!), see below for a demonstration.  Convex polygons are
     those where all inner angles are less than 180 degrees.  By
     contrast, a convex polygon is one where an inner angle may be more
     than 180 degrees.

                      Concave Polygon        Convex Polygon

                       D --------C          D------------- C
                        \        |        E /              |
                         \E      |          \              |
                         /       |           \             |
                        A--------B             A ----------B

‘-s STR’
‘--section=STR’
     Section of the input image which you want to be cropped.  See *note
     Crop section syntax:: for a complete explanation on the syntax
     required for this input.

‘-C FITS/TXT’
‘--catalog=FITS/TXT’
     File name of catalog for making multiple crops from the input
     images/cubes.  The catalog can be in any of Gnuastro’s recognized
     *note Recognized table formats::.  The columns containing the
     coordinates for the crop centers can be specified with the
     ‘--coordcol’ option (using column names or numbers, see *note
     Selecting table columns::).  The catalog can also contain the name
     of each crop, you can specify the column containing the name with
     the ‘--namecol’.

‘--cathdu=STR/INT’
     The HDU (extension) containing the catalog (if the file given to
     ‘--catalog’ is a FITS file).  This can either be the HDU name (if
     it has one) or number (counting from 0).  By default (if this
     option is not given), the second HDU will be used (equivalent to
     ‘--cathdu=1’.  For more on how to specify the HDU, see the
     explanation of the ‘--hdu’ option in *note Input output options::.

‘-x STR/INT’
‘--coordcol=STR/INT’
     The column in a catalog to read as a coordinate.  The value can be
     either the column number (starting from 1), or a match/search in
     the table meta-data, see *note Selecting table columns::.  This
     option must be called multiple times, depending on the number of
     dimensions in the input dataset.  If it is called more than
     necessary, the extra columns (later calls to this option on the
     command-line or configuration files) will be ignored, see *note
     Configuration file precedence::.

‘-n STR/INT’
‘--namecol=STR/INT’
     Column selection of crop file name.  The value can be either the
     column number (starting from 1), or a match/search in the table
     meta-data, see *note Selecting table columns::.  This option can be
     used both in Image and WCS modes, and not a mandatory.  When a
     column is given to this option, the final crop base file name will
     be taken from the contents of this column.  The directory will be
     determined by the ‘--output’ option (current directory if not
     given) and the value to ‘--suffix’ will be appended.  When this
     column isn’t given, the row number will be used instead.

Output options:

‘-c FLT/INT’
‘--checkcenter=FLT/INT’
     Square box width of region in the center of the image to check for
     blank values.  If any of the pixels in this central region of a
     crop (defined by its center) are blank, then it will not be stored
     in an output file.  If the value to this option is zero, no
     checking is done.  This check is only applied when the cropped
     region(s) are defined by their center (not by the vertices, see
     *note Crop modes::).

     The units of the value are interpreted based on the ‘--mode’ value
     (in WCS or pixel units).  The ultimate checked region size (in
     pixels) will be an odd integer around the center (converted from
     WCS, or when an even number of pixels are given to this option).
     In WCS mode, the value can be given as fractions, for example if
     the WCS units are in degrees, ‘0.1/3600’ will correspond to a check
     size of 0.1 arcseconds.

     Because survey regions don’t often have a clean square or rectangle
     shape, some of the pixels on the sides of the survey FITS image
     don’t commonly have any data and are blank (see *note Blank
     pixels::).  So when the catalog was not generated from the input
     image, it often happens that the image does not have data over some
     of the points.

     When the given center of a crop falls in such regions or outside
     the dataset, and this option has a non-zero value, no crop will be
     created.  Therefore with this option, you can specify a width of a
     small box (3 pixels is often good enough) around the central pixel
     of the cropped image.  You can check which crops were created and
     which weren’t from the command-line (if ‘--quiet’ was not called,
     see *note Operating mode options::), or in Crop’s log file (see
     *note Crop output::).

‘-p STR’
‘--suffix=STR’
     The suffix (or post-fix) of the output files for when you want all
     the cropped images to have a special ending.  One case where this
     might be helpful is when besides the science images, you want the
     weight images (or exposure maps, which are also distributed with
     survey images) of the cropped regions too.  So in one run, you can
     set the input images to the science images and ‘--suffix=_s.fits’.
     In the next run you can set the weight images as input and
     ‘--suffix=_w.fits’.

‘--primaryimghdu’
     Write the output into the primary (0-th) HDU/extension of the
     output.  By default, like all Gnuastro’s default outputs, no data
     is written in the primary extension because the FITS standard
     suggests keeping that extension free of data and only for meta
     data.

‘-b’
‘--noblank’
     Pixels outside of the input image that are in the crop box will not
     be used.  By default they are filled with blank values (depending
     on type), see *note Blank pixels::.  This option only applies only
     in Image mode, see *note Crop modes::.

‘-z’
‘--zeroisnotblank’
     In float or double images, it is common to give the value of zero
     to blank pixels.  If the input image type is one of these two
     types, such pixels will also be considered as blank.  You can
     disable this behavior with this option, see *note Blank pixels::.

Operating mode options:

‘-O STR’
‘--mode=STR’
     Operate in Image mode or WCS mode when the input coordinates can be
     both image or WCS. The value must either be ‘img’ or ‘wcs’, see
     *note Crop modes:: for a full description.

   ---------- Footnotes ----------

   (1) <https://archive.stsci.edu/pub/hlsp/candels/goods-s/gs-tot/v1.0/>

   (2) <https://archive.stsci.edu/prepds/xdf/>


File: gnuastro.info,  Node: Crop output,  Next: Crop known issues,  Prev: Crop options,  Up: Invoking astcrop

6.1.4.2 Crop output
...................

The string given to ‘--output’ option will be interpreted depending on
how many crops were requested, see *note Crop modes:::

   • When a catalog is given, the value of the ‘--output’ (see *note
     Common options::) will be read as the directory to store the output
     cropped images.  Hence if it doesn’t already exist, Crop will abort
     with an “No such file or directory” error.

     The crop file names will consist of two parts: a variable part (the
     row number of each target starting from 1) along with a fixed
     string which you can set with the ‘--suffix’ option.  Optionally,
     you may also use the ‘--namecol’ option to define a column in the
     input catalog to use as the file name instead of numbers.

   • When only one crop is desired, the value to ‘--output’ will be read
     as a file name.  If no output is specified or if it is a directory,
     the output file name will follow the automatic output names of
     Gnuastro, see *note Automatic output::: The string given to
     ‘--suffix’ will be replaced with the ‘.fits’ suffix of the input.

   By default, as suggested by the FITS standard and implemented in all
Gnuastro programs, the first/primary extension of the output files will
only contain meta data.  The cropped images/cubes will be written into
the 2nd HDU of their respective FITS file (which is actually counted as
‘1’ because HDU counting starts from ‘0’).  However, if you want the
cropped data to be written into the primary (0-th) HDU, run Crop with
the ‘--primaryimghdu’ option.

   The header of each output cropped image will contain the names of the
input image(s) it was cut from.  If a name is longer than the 70
character space that the FITS standard allows for header keyword values,
the name will be cut into several keywords from the nearest slash (</>).
The keywords have the following format: ‘ICFn_m’ (for Crop File).  Where
‘n’ is the number of the image used in this crop and ‘m’ is the part of
the name (it can be broken into multiple keywords).  Following the name
is another keyword named ‘ICFnPIX’ which shows the pixel range from that
input image in the same syntax as *note Crop section syntax::.  So this
string can be directly given to the ‘--section’ option later.

   Once done, a log file can be created in the current directory with
the ‘--log’ option.  This file will have three columns and the same
number of rows as the number of cropped images.  There are also comments
on the top of the log file explaining basic information about the run
and descriptions for the columns.  A short description of the columns is
also given below:

  1. The cropped image file name for that row.
  2. The number of input images that were used to create that image.
  3. A ‘0’ if the central few pixels (value to the ‘--checkcenter’
     option) are blank and ‘1’ if they aren’t.  When the crop was not
     defined by its center (see *note Crop modes::), or ‘--checkcenter’
     was given a value of 0 (see *note Invoking astcrop::), the center
     will not be checked and this column will be given a value of ‘-1’.


File: gnuastro.info,  Node: Crop known issues,  Prev: Crop output,  Up: Invoking astcrop

6.1.4.3 Crop known issues
.........................

When running Crop, you may encounter strange errors and bugs.  In these
cases, please report a bug and we will try to fix it as soon as
possible, see *note Report a bug::.  However, some things are beyond our
control, or may take too long to fix directly.  In this section we list
such known issues that may occur in known cases and suggest the hack (or
work-around) to fix the problem:

Crash with ‘Killed’ when cropping catalog from ‘.fits.gz’
     This happens because CFISTIO (that reads and writes FITS files)
     will internally decompress the file in a temporary place (possibly
     in the RAM), then start reading from it.  On the other hand, by
     default when given a catalog (with many crops) and not specifying
     ‘--numthreads’, Crop will use the maximum number of threads
     available on your system to do each crop faster.  On an normal (not
     compressed) file, parallel access will not cause a problem,
     however, when attempting parallel access with the maximum number of
     threads on a compressed file, CFITSIO crashes with ‘Killed’.
     Therefore the following solutions can be used to fix this crash:

        • Decrease the number of threads (at the minimum, set
          ‘--numthreads=1’).  Since this solution doesn’t attempt to
          change any of your previous Crop command components or doesn’t
          change your local file structure, it is the preferred way.

        • Decompress the file (with the command below) and feed the
          ‘.fits’ file into Crop without changing the number of threads.

               $ gunzip -k image.fits.gz


File: gnuastro.info,  Node: Arithmetic,  Next: Convolve,  Prev: Crop,  Up: Data manipulation

6.2 Arithmetic
==============

It is commonly necessary to do operations on some or all of the elements
of a dataset independently (pixels in an image).  For example, in the
reduction of raw data it is necessary to subtract the Sky value (*note
Sky value::) from each image image.  Later (once the images as warped
into a single grid using Warp for example, see *note Warp::), the images
are co-added (the output pixel grid is the average of the pixels of the
individual input images).  Arithmetic is Gnuastro’s program for such
operations on your datasets directly from the command-line.  It
currently uses the reverse polish or post-fix notation, see *note
Reverse polish notation:: and will work on the native data types of the
input images/data to reduce CPU and RAM resources, see *note Numeric
data types::.  For more information on how to run Arithmetic, please see
*note Invoking astarithmetic::.

* Menu:

* Reverse polish notation::     The current notation style for Arithmetic
* Arithmetic operators::        List of operators known to Arithmetic
* Invoking astarithmetic::      How to run Arithmetic: options and output


File: gnuastro.info,  Node: Reverse polish notation,  Next: Arithmetic operators,  Prev: Arithmetic,  Up: Arithmetic

6.2.1 Reverse polish notation
-----------------------------

The most common notation for arithmetic operations is the infix notation
(https://en.wikipedia.org/wiki/Infix_notation) where the operator goes
between the two operands, for example $4+5$.  The infix notation is the
preferred way in most programming languages which come with scripting
features for large programs.  This is because the infix notation
requires a way to define precedence when more than one operator is
involved.

   For example consider the statement ‘5 + 6 / 2’.  Should 6 first be
divided by 2, then added by 5?  Or should 5 first be added with 6, then
divided by 2?  Therefore we need parenthesis to show precedence:
‘5+(6/2)’ or ‘(5+6)/2’.  Furthermore, if you need to leave a value for
later processing, you will need to define a variable for it; for example
‘a=(5+6)/2’.

   Gnuastro provides libraries where you can also use infix notation in
C or C++ programs.  However, Gnuastro’s programs are primarily designed
to be run on the command-line and the level of complexity that infix
notation requires can be annoying/confusing to write on the command-line
(where they can get confused with the shell’s parenthesis or variable
definitions).  Therefore Gnuastro’s Arithmetic and Table (when doing
column arithmetic) programs use the the post-fix notation, also known as
reverse polish notation
(https://en.wikipedia.org/wiki/Reverse_Polish_notation).  For example,
instead of writing ‘5+6’, we write ‘5 6 +’.

   The Wikipedia article on the reverse polish notation provides some
excellent explanation on this notation but here we will give a short
summary here for self-sufficiency.  In short, in the reverse polish
notation, the operator is placed after the operands.  As we will see
below this removes the need to define parenthesis and lets you use
previous values without needing to define a variable.  In the future(1)
we do plan to also optionally allow infix notation when arithmetic
operations on datasets are desired, but due to time constraints on the
developers we can’t do it immediately.

   To easily understand how the reverse polish notation works, you can
think of each operand (‘5’ and ‘6’ in the example above) as a node in a
“last-in-first-out” stack.  One such stack in daily life is a stack of
dishes in the kitchen: you put a clean dish, on the top of a stack of
dishes when its ready for later usage.  Later, when you need a dish, you
pick the top one (hence the “last” dish placed “in” the stack is the
“first” dish that comes “out” when necessary).

   Each operator will need a certain number of operands (in the example
above, the ‘+’ operator needs two operands: ‘5’ and ‘6’).  In the
kitchen metaphor, an operator can be an oven.  Every time an operator is
confronted, the operator takes (or “pops”) the number of operands it
needs from the top of the stack (so they don’t exist in the stack any
more), does its operation, and places (or “pushes”) the result back on
top of the stack.  So if you want the average of 5 and 6, you would
write: ‘5 6 + 2 /’.  The operations that are done are:

  1. ‘5’ is an operand, so Arithmetic pushes it to the top of the stack
     (which is initially empty).  In the kitchen metaphor, you can
     visualize this as taking a new dish from the cabinet, putting the
     number 5 inside of the dish, and putting the dish on top of the
     (empty) cooking table in front of you.  You now have a stack of one
     dish on the table in front of you.
  2. ‘6’ is also an operand, so it is pushed to the top of the stack.
     Like before, you can visualize this as taking a new dish from the
     cabinet, putting the number 6 in it and placing it on top of the
     previous dish.  You now have a stack of two dishes on the table in
     front of you.
  3. ‘+’ is a _binary_ operator, so it will pop the top two elements of
     the stack out of it, and perform addition on them (the order is
     $5+6$ in the example above).  The result is ‘11’ which is pushed to
     the top of the stack.

     To visualize this, you can think of the ‘+’ operator as an oven
     with a place for two dishes.  You pick up the top-most dish (that
     has the number 6 in it) and put it in the oven.  The top dish is
     now the one that has has the number 5.  You also pick it up and put
     it in the oven, and close the oven door.  When the oven has
     finished its cooking, it produces a single output (in one dish,
     with the number 11 inside of it).  You take that output dish and
     put it back on the table.  You now have a stack of one dish on the
     table in front of you.
  4. ‘2’ is an operand so push it onto the top of the stack.  In the
     kitchen metaphor, you again go to the cabinet, pick up a dish and
     put the number 2 inside of it and put the dish over the previous
     dish (that has the number 11).  You now have a stack of two dishes
     on the table in front of you.
  5. ‘/’ (division) is a binary operator, so pull out the top two
     elements of the stack (top-most is ‘2’, then ‘11’) and divide the
     second one by the first.  In the kitchen metaphor, the ‘/’ operator
     can be visualized as a microwave that takes two dishes.  But unlike
     the oven (‘+’ operator) before, the order of inputs matters (they
     are on top of each other: with the top dish holder being the
     nominator and the bottom one being the denominator).  Again, you
     look to your stack of dishes on the table.

     You pick up the top one (with value 2 inside of it) and put it in
     the microwave’s bottom (denominator) dish holder.  Then you go back
     to your stack of dishes on the table and pick up the top dish (with
     value 11 inside of it) and put that in the top (nominator) dish
     holder.  The microwave will do its work and when its finished,
     returns a new dish with the single value 5.5 inside of it.  You
     pick up the dish from the microwave and place it back on the table.

  6. There are no more operands or operators, so simply return the
     remaining operand in the output.  In the kitchen metaphor, you see
     that your recipe has no more steps, so you just pick up the
     remaining dish and take it to the dining room to enjoy a good
     dinner.

   In the Arithmetic program, the operands can be FITS images of any
dimensionality, or numbers (see *note Invoking astarithmetic::).  In
Table’s column arithmetic, they can be any column in the table (a series
of numbers in an array) or a single number (see *note Column
arithmetic::).

   With this notation, very complicated procedures can be created
without the need for parenthesis or worrying about precedence.  Even
functions which take an arbitrary number of arguments can be defined in
this notation.  This is a very powerful notation and is used in
languages like Postscript (2) which produces PDF files when compiled.

   ---------- Footnotes ----------

   (1) <https://savannah.gnu.org/task/index.php?13867>

   (2) See the EPS and PDF part of *note Recognized file formats:: for a
little more on the Postscript language.


File: gnuastro.info,  Node: Arithmetic operators,  Next: Invoking astarithmetic,  Prev: Reverse polish notation,  Up: Arithmetic

6.2.2 Arithmetic operators
--------------------------

In this section, list of recognized operators in Arithmetic (and the
Table program’s *note Column arithmetic::) and discussed in detail with
examples.  As mentioned before, to be able to easily do complex
operations on the command-line, the Reverse Polish Notation is used
(where you write ‘$4\quad5\quad+$’ instead of ‘$4 + 5$’), if you aren’t
already familiar with it, before continuing, please see *note Reverse
polish notation::.

   The operands to all operators can be a data array (for example a FITS
image or data cube) or a number, the output will be an array or number
according to the inputs.  For example a number multiplied by an array
will produce an array.  The numerical data type of the output of each
operator is described within it.

*Blank pixels in Arithmetic:* Blank pixels in the image (see *note Blank
pixels::) will be stored based on the data type.  When the input is
floating point type, blank values are NaN. One aspect of NaN values is
that by definition they will fail on _any_ comparison.  Hence both equal
and not-equal operators will fail when both their operands are NaN!
Therefore, the only way to guarantee selection of blank pixels is
through the ‘isblank’ operator explained above.

   One way you can exploit this property of the NaN value to your
advantage is when you want a fully zero-valued image (even over the
blank pixels) based on an already existing image (with same size and
world coordinate system settings).  The following command will produce
this for you:

     $ astarithmetic input.fits nan eq --output=all-zeros.fits

Note that on the command-line you can write NaN in any case (for example
‘NaN’, or ‘NAN’ are also acceptable).  Reading NaN as a floating point
number in Gnuastro isn’t case-sensitive.

* Menu:

* Basic mathematical operators::  For example +, -, /, log, pow, and etc.
* Trigonometric and hyperbolic operators::  sin, cos, atan, asinh, and etc
* Unit conversion operators::   magnitudes to counts, or parsecs to AUs, and etc.
* Statistical operators::       Statistics of a single dataset (for example mean).
* Stacking operators::          Coadding or combining multiple datasets into one.
* Filtering operators::         Smoothing a dataset through mixing pixel with neighbors.
* Interpolation operators::     Giving blank pixels a value.
* Dimensionality changing operators::  Collapse or expand a dataset.
* Conditional operators::       Select certain pixels within the dataset.
* Mathematical morphology operators::  Work on binary images, for example erode.
* Bitwise operators::           Work on bits within one pixel.
* Numerical type conversion operators::  Convert the numeric datatype of a dataset.
* Adding noise operators::      Add noise to a dataset.
* Elliptical shape operators::  Operations that are focused on an ellipse.
* Building new dataset::        How to construct an empty dataset from scratch.
* Operand storage in memory or a file::  Tools for complex operations in one command.


File: gnuastro.info,  Node: Basic mathematical operators,  Next: Trigonometric and hyperbolic operators,  Prev: Arithmetic operators,  Up: Arithmetic operators

6.2.2.1 Basic mathematical operators
....................................

These are some of the most common operations you will be doing on your
data and include, so no further explanation is necessary.  If you are
new to Gnuastro, just read the description of each carefully.

‘+’
     Addition, so “‘4 5 +’” is equivalent to $4+5$.  For example in the
     command below, the value 20000 is added to each pixel’s value in
     ‘image.fits’:
          $ astarithmetic 20000 image.fits +
     You can also use this operator is to sum the values of one pixel in
     two images (which have to be the same size).  For example in the
     commands below (which are identical, see paragraph after the
     commands), each pixel of ‘sum.fits’ is the sum of the same pixel’s
     values in ‘a.fits’ and ‘b.fits’.
          $ astarithmetic a.fits b.fits + -h1 -h1 --output=sum.fits
          $ astarithmetic a.fits b.fits + -g1     --output=sum.fits
     The HDU/extension has to be specified for each image with ‘-h’.
     However, if the HDUs are the same in all inputs, you can use ‘-g’
     to only specify the HDU once

     If you need to add more than one dataset, one way is to use this
     operator multiple times, for example see the two commands below
     that are identical in the Reverse Polish Notation (*note Reverse
     polish notation::):
          $ astarithmetic a.fits b.fits + c.fits + -osum.fits
          $ astarithmetic a.fits b.fits c.fits + + -osum.fits

     However, this can get annoying/buggy if you have more than three or
     four images, in that case, a better way to sum data is to use the
     ‘sum’ operator (which also ignores blank pixels), that is discussed
     below.

‘-’
     Subtraction, so “‘4 5 -’” is equivalent to $4-5$.  Usage of this
     operator is similar to ‘+’ operator, for example:
          $ astarithmetic 20000 image.fits -
          $ astarithmetic a.fits b.fits - -g1 --output=sub.fits

‘x’
     Multiplication, so “‘4 5 x’” is equivalent to $4\times5$.  For
     example in the command below, the value of each output pixel is 5
     times its value in ‘image.fits’:
          $ astarithmetic image.fits 5 x
     And you can multiply the value of each pixel in two images, like
     this:
          $ astarithmetic a.fits a.fits x -g1 –output=multip.fits

‘/’
     Division, so “‘4 5 /’” is equivalent to $4/5$.  Like the
     multiplication, for example
          $ astarithmetic image.fits 5 -h1 /
          $ astarithmetic a.fits b.fits / -g1 –output=div.fits

‘%’
     Modulo (remainder), so “‘3 2 %’” will return $1$.  Note that the
     modulo operator only works on integer types (see *note Numeric data
     types::).  This operator is therefore not defined for most
     processed astronomical astronomical images that have floating-point
     value.  However it is useful in labeled images, for example *note
     Segment output::).  In such cases, each pixel is the integer label
     of the object it is associated with hence with the example command
     below, we can change the labels to only be between 1 and 4 and
     decrease all objects on the image to 4/5th (all objects with a
     label that is a multiple of 5 will be set to 0).
          $ astarithmetic label.fits 5 1 %

‘abs’
     Absolute value of first operand, so “‘4 abs’” is equivalent to
     $|4|$.  For example the output of the command bellow will not have
     any negative pixels (all negative pixels will be multiplied by $-1$
     to become positive)
          $ astarithmetic image.fits abs

‘pow’
     First operand to the power of the second, so “‘4.3 5 pow’” is
     equivalent to $4.3^{5}$.  For example with the command below all
     pixels will be squared
          $ astarithmetic image.fits 2 pow

‘sqrt’
     The square root of the first operand, so “‘5 sqrt’” is equivalent
     to $\sqrt{5}$.  Since the square root is only defined for positive
     values, any negative-valued pixel will become NaN (blank).  The
     output will have a floating point type, but its precision is
     determined from the input: if the input is a 64-bit floating point,
     the output will also be 64-bit.  Otherwise, the output will be
     32-bit floating point (see *note Numeric data types:: for the
     respective precision).  Therefore if you require 64-bit precision
     in estimating the square root, convert the input to 64-bit floating
     point first, for example with ‘5 float64 sqrt’.  For example each
     pixel of the output of the command below will be the square root of
     that pixel in the input.
          $ astarithmetic image.fits sqrt

     If you just want to scale an image with negative values using this
     operator (for better visual inspection, and the actual values don’t
     matter for you), you can subtract the image from its minimum value,
     then take its square root:

          $ astarithmetic image.fits image.fits minvalue - sqrt -g1

     Alternatively, to avoid reading the image into memory two times,
     you can use the ‘set-’ operator to read it into the variable ‘i’
     and use ‘i’ two times to speed up the operation (described below):

          $ astarithmetic image.fits set-i i i minvalue - sqrt

‘log’
     Natural logarithm of first operand, so “‘4 log’” is equivalent to
     $ln(4)$.  Negative pixels will become NaN, and the output type is
     determined from the input, see the explanation under ‘sqrt’ for
     more on these features.  For example the command below will take
     the natural logarithm of every pixel in the input.
          $ astarithmetic image.fits log --output=log.fits

‘log10’
     Base-10 logarithm of first popped operand, so “‘4 log’” is
     equivalent to $log_{10}(4)$.  Negative pixels will become NaN, and
     the output type is determined from the input, see the explanation
     under ‘sqrt’ for more on these features.  For example the command
     below will take the base-10 logarithm of every pixel in the input.
          $ astarithmetic image.fits log10


File: gnuastro.info,  Node: Trigonometric and hyperbolic operators,  Next: Unit conversion operators,  Prev: Basic mathematical operators,  Up: Arithmetic operators

6.2.2.2 Trigonometric and hyperbolic operators
..............................................

All the trigonometric and hyperbolic functions are described here.  One
good thing with these operators is that they take inputs and outputs in
degrees (which we usually need as input or output), not radians (like
most other programs/libraries).

‘sin’
‘cos’
‘tan’
     Basic trigonometric functions.  They take one operand, in units of
     degrees.

‘asin’
‘acos’
‘atan’
     Inverse trigonometric functions.  They take one operand and the
     returned values are in units of degrees.

‘atan2’
     Inverse tangent (output in units of degrees) that uses the signs of
     the input coordinates to distinguish between the quadrants.  This
     operator therefore needs two operands: the first popped operand is
     assumed to be the X axis position of the point, and the second
     popped operand is its Y axis coordinate.

     For example see the commands below.  To be more clear, we are using
     Table’s *note Column arithmetic:: which uses exactly the same
     internal library function as the Arithmetic program for images.  We
     are showing the results for four points in the four quadrants of
     the 2D space (if you want to try running them, you don’t need to
     type/copy the parts after <#>).  The first point (2,2) is in the
     first quadrant, therefore the returned angle is 45 degrees.  But
     the second, third and fourth points are in the quadrants of the
     same order, and the returned angles reflect the quadrant.

          $ echo " 2  2" | asttable -c'arith $2 $1 atan2'   # -->   45
          $ echo " 2 -2" | asttable -c'arith $2 $1 atan2'   # -->  -45
          $ echo "-2 -2" | asttable -c'arith $2 $1 atan2'   # --> -135
          $ echo "-2  2" | asttable -c'arith $2 $1 atan2'   # -->  135

     However, if you simply use the classic arc-tangent operator
     (‘atan’) for the same points, the result will only be in two
     quadrants as you see below:

          $ echo " 2  2" | asttable -c'arith $2 $1 / atan'  # -->   45
          $ echo " 2 -2" | asttable -c'arith $2 $1 / atan'  # -->  -45
          $ echo "-2 -2" | asttable -c'arith $2 $1 / atan'  # -->   45
          $ echo "-2  2" | asttable -c'arith $2 $1 / atan'  # -->  -45

‘sinh’
‘cosh’
‘tanh’
     Hyperbolic sine, cosine, and tangent.  These operators take a
     single operand.

‘asinh’
‘acosh’
‘atanh’
     Inverse Hyperbolic sine, cosine, and tangent.  These operators take
     a single operand.


File: gnuastro.info,  Node: Unit conversion operators,  Next: Statistical operators,  Prev: Trigonometric and hyperbolic operators,  Up: Arithmetic operators

6.2.2.3 Unit conversion operators
.................................

It often happens that you have data in one unit (for example magnitudes
to measure the brightness of a galaxy), but would like to convert it
into another (for example electron counts on your CCD). While the
equations for the unit conversions can be easily found on the internet,
the operators in this section are designed to simplify the process and
let you do it easily.

‘counts-to-mag’
     Convert counts (usually CCD outputs) to magnitudes using the given
     zeropoint.  The zero point is the first popped operand and the
     count image or value is the second popped operand.

     For example assume you have measured the standard deviation of the
     noise in an image to be ‘0.1’ counts, and the image’s zero point is
     ‘22.5’ and you want to measure the _per-pixel_ surface brightness
     limit of the dataset(1).  To apply this operator on an image,
     simply replace ‘0.1’ with the image name, as described below.

          $ astarithmetic 0.1 22.5 counts-to-mag --quiet

     Of course, you can also convert every pixel in an image (or table
     column in Table’s *note Column arithmetic::) with this operator if
     you replace the second popped operand with an image/column name.
     For an example of applying this operator on an image, see the
     description of surface brightness in *note Brightness flux
     magnitude::, where we’ll convert an image’s pixel values to surface
     brightness.

‘mag-to-counts’
     Convert magnitudes to counts (usually CCD outputs) using the given
     zeropoint.  The zero point is the first popped operand and the
     magnitude value is the second.  For example if an object has a
     magnitude of 20, you can estimate the counts corresponding to it
     (when the image has a zeropoint of 24.8) with this command: Note
     that because the output is a single number, we are using ‘--quiet’
     to avoid printing extra information.

          $ astarithmetic 20 24.8 mag-to-counts --quiet

‘counts-to-jy’
     Convert counts (usually CCD outputs) to Janskys through an
     AB-magnitude based zeropoint.  The top-popped operand is assumed to
     be the AB-magnitude zero point and the second-popped operand is
     assumed to be a dataset in units of counts (an image in Arithmetic,
     and a column in Table’s *note Column arithmetic::).  For the full
     equation and basic definitions, see *note Brightness flux
     magnitude::.

     For example SDSS images are calibrated in units of nanomaggies,
     with a fixed zero point magnitude of 22.5.  Therefore you can
     convert the units of SDSS image pixels to Janskys with the command
     below:

          $ astarithmetic sdss-image.fits 22.5 counts-to-jy

‘au-to-pc’
     Convert Astronomical Units (AUs) to Parsecs (PCs).  This operator
     takes a single argument which is interpreted to be the input AUs.
     The conversion is based on the definition of Parsecs: $1 \rm{PC} =
     1/tan(1^{\prime\prime}) \rm{AU}$, where $1^{\prime\prime}$ is one
     arcseconds.  In other words, $1 (\rm{PC}) = 648000/\pi (\rm{AU})$.
     For example, if we take Pluto’s average distance to the Sun to be
     40 AUs, we can obtain its distance in Parsecs using this command:

          echo 40 | asttable -c'arith $1 au-to-pc'

‘pc-to-au’
     Convert Parsecs (PCs) to Astronomical Units (AUs).  This operator
     takes a single argument which is interpreted to be the input PCs.
     For more on the conversion equation, see description of ‘au-to-pc’.
     For example, Proxima Centauri (the nearest star to the Solar
     system) is 1.3020 Parsecs from the Sun, we can calculate this
     distance in units of AUs with the command below:

          echo 1.3020 | asttable -c'arith $1 pc-to-au'

‘ly-to-pc’
     Convert Light-years (LY) to Parsecs (PCs).  This operator takes a
     single argument which is interpreted to be the input LYs.  The
     conversion is done from IAU’s definition of the light-year
     (9460730472580800 m $\approx$ 63241.077 AU = 0.306601 PC, for the
     conversion of AU to PC, see the description of ‘au-to-pc’).

     For example the distance of Andromeda galaxy to our galaxy is 2.5
     million light-years, so its distance in kilo-Parsecs can be
     calculated with the command below (note that we want the output in
     kilo-parsecs, so we are dividing the output of this operator by
     1000):

          echo 2.5e6 | asttable -c'arith $1 ly-to-pc 1000 /'

‘pc-to-ly’
     Convert Parsecs (PCs) to Light-years (LY). This operator takes a
     single argument which is interpreted to be the input PCs.  For the
     conversion and an example of the inverse of this operator, see the
     description of ‘ly-to-pc’.

‘ly-to-au’
     Convert Light-years (LY) to Astronomical Units (AUs).  This
     operator takes a single argument which is interpreted to be the
     input LYs.  For the conversion and a similar example, see the
     description of ‘ly-to-pc’.

‘au-to-ly’
     Convert Astronomical Units (AUs) to Light-years (LY). This operator
     takes a single argument which is interpreted to be the input AUs.
     For the conversion and a similar example, see the description of
     ‘ly-to-pc’.

   ---------- Footnotes ----------

   (1) The _per-pixel_ surface brightness limit is the magnitude of the
noise standard deviation.  For more on surface brightness see *note
Brightness flux magnitude::.  In the example command, because the output
is a single number, we are using ‘--quiet’ to avoid printing extra
information.


File: gnuastro.info,  Node: Statistical operators,  Next: Stacking operators,  Prev: Unit conversion operators,  Up: Arithmetic operators

6.2.2.4 Statistical operators
.............................

The operators in this section take a single dataset as input, and will
return the desired statistic as a single value.

‘minvalue’
     Minimum value in the first popped operand, so “‘a.fits minvalue’”
     will push the minimum pixel value in this image onto the stack.
     When this operator acts on a single image, the output (operand that
     is put back on the stack) will no longer be an image, but a number.
     The output of this operand is in the same type as the input.  This
     operator is mainly intended for multi-element datasets (for example
     images or data cubes), if the popped operand is a number, it will
     just return it without any change.

     Note that when the final remaining/output operand is a single
     number, it is printed onto the standard output.  For example with
     the command below the minimum pixel value in ‘image.fits’ will be
     printed in the terminal:
          $ astarithmetic image.fits minvalue

     However, the output above also includes a lot of extra information
     that are not relevant in this context.  If you just want the final
     number, run Arithmetic in quiet mode:
          $ astarithmetic image.fits minvalue -q

     Also see the description of ‘sqrt’ for other example usages of this
     operator.

‘maxvalue’
     Maximum value of first operand in the same type, similar to
     ‘minvalue’, see the description there for more.  For example
          $ astarithmetic image.fits maxvalue -q

‘numbervalue’
     Number of non-blank elements in first operand in the ‘uint64’ type
     (since it is always a positive integer, see *note Numeric data
     types::).  Its usage is similar to ‘minvalue’, for example
          $ astarithmetic image.fits numbervalue -q

‘sumvalue’
     Sum of non-blank elements in first operand in the ‘float32’ type.
     Its usage is similar to ‘minvalue’, for example
          $ astarithmetic image.fits sumvalue -q

‘meanvalue’
     Mean value of non-blank elements in first operand in the ‘float32’
     type.  Its usage is similar to ‘minvalue’, for example
          $ astarithmetic image.fits meanvalue -q

‘stdvalue’
     Standard deviation of non-blank elements in first operand in the
     ‘float32’ type.  Its usage is similar to ‘minvalue’, for example
          $ astarithmetic image.fits stdvalue -q

‘medianvalue’
     Median of non-blank elements in first operand with the same type.
     Its usage is similar to ‘minvalue’, for example
          $ astarithmetic image.fits medianvalue -q

‘unique’
     Remove all duplicate (and blank) elements from the first popped
     operand.  The unique elements of the dataset will be stored in a
     single-dimensional dataset.

     Recall that by default, single-dimensional datasets are stored as a
     table column in the output.  But you can use ‘--onedasimage’ or
     ‘--onedonstdout’ to respectively store them as a single-dimensional
     FITS array/image, or to print them on the standard output.

     Although you can use this operator on the floating point dataset,
     due to floating-point errors it may give non-reasonable values:
     because the tenth digit of the decimal point is also considered
     although it may be statistically meaningless, see *note Numeric
     data types::.  It is therefore better/recommended to use it on the
     integer dataset like the labeled images of *note Segment output::
     where each pixel has the integer label of the object/clump it is
     associated with.  For example let’s assume you have cropped a
     region of a larger labeled image and want to find the
     labels/objects that are within the crop.  With this operator, this
     job is trivial:
          $ astarithmetic seg-crop.fits unique

‘size’
     Size of the dataset along a given FITS (or FORTRAN) dimension
     (counting from 1).  The desired dimension should be the first
     popped operand and the dataset must be the second popped operand.
     The output will be a single unsigned integer (dimensions cannot be
     negative).  For example, the following command will produce the
     size of the first extension/HDU (the default HDU) of ‘a.fits’ along
     the second FITS axis.

          $ astarithmetic a.fits 2 size


File: gnuastro.info,  Node: Stacking operators,  Next: Filtering operators,  Prev: Statistical operators,  Up: Arithmetic operators

6.2.2.5 Stacking operators
..........................

The operators in this section are used when you have multiple datasets
that you would like to merge into one, commonly known as “stacking” or
“coaddition”.  For example, you have taken ten exposures of your
scientific target, and you would like to combine them all into one deep
stacked image that is deeper.

   When calling these operators you should determine how many operands
they should take in (unlike the rest of the operators that have a fixed
number of input operands).  As described in the first operand below, you
do this through their first popped operand (which should be a single
integer number that is larger than one).

‘min’
     For each pixel, find the minimum value in all given datasets.  The
     output will have the same type as the input.

     The first popped operand to this operator must be a positive
     integer number which specifies how many further operands should be
     popped from the stack.  All the subsequently popped operands must
     have the same type and size.  This operator (and all the
     variable-operand operators similar to it that are discussed below)
     will work in multi-threaded mode unless Arithmetic is called with
     the ‘--numthreads=1’ option, see *note Multi-threaded operations::.

     Each pixel of the output of the ‘min’ operator will be given the
     minimum value of the same pixel from all the popped
     operands/images.  For example the following command will produce an
     image with the same size and type as the three inputs, but each
     output pixel value will be the minimum of the same pixel’s values
     in all three input images.

          $ astarithmetic a.fits b.fits c.fits 3 min

     Important notes:

        • NaN/blank pixels will be ignored, see *note Blank pixels::.

        • The output will have the same type as the inputs.  This is
          natural for the ‘min’ and ‘max’ operators, but for other
          similar operators (for example ‘sum’, or ‘average’) the
          per-pixel operations will be done in double precision floating
          point and then stored back in the input type.  Therefore, if
          the input was an integer, C’s internal type conversion will be
          used.

        • The operation will be multi-threaded, greatly speeding up the
          process if you have large and numerous data to stack.  You can
          disable multi-threaded operations with the ‘--numthreads=1’
          option (see *note Multi-threaded operations::).

‘max’
     For each pixel, find the maximum value in all given datasets.  The
     output will have the same type as the input.  This operator is
     called similar to the ‘min’ operator, please see there for more.
     For example
          $ astarithmetic a.fits b.fits c.fits 3 min -omax.fits

‘number’
     For each pixel count the number of non-blank pixels in all given
     datasets.  The output will be an unsigned 32-bit integer datatype
     (see *note Numeric data types::).  This operator is called similar
     to the ‘min’ operator, please see there for more.  For example
          $ astarithmetic a.fits b.fits c.fits 3 number -onum.fits

     Some datasets may have blank values (which are also ignored in all
     similar operators like ‘min’, ‘sum’, ‘mean’ or ‘median’).  Hence,
     the final pixel values of this operator will not, in general, be
     equal to the number of inputs.  This operator is therefore mostly
     called in parallel with those operators to know the “weight” of
     each pixel (in case you want to only keep pixels that had the full
     exposure for example).

‘sum’
     For each pixel, calculate the sum in all given datasets.  The
     output will have the a single-precision (32-bit) floating point
     type.  This operator is called similar to the ‘min’ operator,
     please see there for more.  For example
          $ astarithmetic a.fits b.fits c.fits 3 sum -ostack-sum.fits

‘mean’
     For each pixel, calculate the mean in all given datasets.  The
     output will have the a single-precision (32-bit) floating point
     type.  This operator is called similar to the ‘min’ operator,
     please see there for more.  For example
          $ astarithmetic a.fits b.fits c.fits 3 mean -ocoadd-mean.fits

‘std’
     For each pixel, find the standard deviation in all given datasets.
     The output will have the a single-precision (32-bit) floating point
     type.  This operator is called similar to the ‘min’ operator,
     please see there for more.  For example
          $ astarithmetic a.fits b.fits c.fits 3 std -ostd.fits

‘median’
     For each pixel, find the median in all given datasets.  The output
     will have the a single-precision (32-bit) floating point type.
     This operator is called similar to the ‘min’ operator, please see
     there for more.  For example
          $ astarithmetic a.fits b.fits c.fits 3 mean \
                          --output=stack-median.fits

‘quantile’
     For each pixel, find the quantile from all given datasets.  The
     output will have the same numeric data type and size as the input
     datasets.  Besides the input datasets, the quantile operator also
     needs a single parameter (the requested quantile).  The parameter
     should be the first popped operand, with a value between (and
     including) 0 and 1.  The second popped operand must be the number
     of datasets to use.

     In the example below, the first-popped operand (‘0.7’) is the
     quantile, the second-popped operand (‘3’) is the number of datasets
     to pop.

          astarithmetic a.fits b.fits c.fits 3 0.7 quantile

‘sigclip-number’
     For each pixel, find the sigma-clipped number (after removing
     outliers) in all given datasets.  The output will have the an
     unsigned 32-bit integer type (see *note Numeric data types::).

     This operator will combine the specified number of inputs into a
     single output that contains the number of remaining elements after
     $\sigma$-clipping on each element/pixel (for more on
     $\sigma$-clipping, see *note Sigma clipping::).  This operator is
     very similar to ‘min’, with the exception that it expects two
     operands (parameters for sigma-clipping) before the total number of
     inputs.  The first popped operand is the termination criteria and
     the second is the multiple of $\sigma$.

     For example in the command below, the first popped operand (‘0.2’)
     is the sigma clipping termination criteria.  If the termination
     criteria is larger than, or equal to, 1 it is interpreted as the
     number of clips to do.  But if it is between 0 and 1, then it is
     the tolerance level on the standard deviation (see *note Sigma
     clipping::).  The second popped operand (‘5’) is the multiple of
     sigma to use in sigma-clipping.  The third popped operand (‘10’) is
     number of datasets that will be used (similar to the first popped
     operand to ‘min’).

          astarithmetic a.fits b.fits c.fits 3 5 0.2 sigclip-number

‘sigclip-median’
     For each pixel, find the sigma-clipped median in all given
     datasets.  The output will have the a single-precision (32-bit)
     floating point type.  This operator is called similar to the
     ‘sigclip-number’ operator, please see there for more.  For example
          astarithmetic a.fits b.fits c.fits 3 5 0.2 sigclip-median

‘sigclip-mean’
     For each pixel, find the sigma-clipped mean in all given datasets.
     The output will have the a single-precision (32-bit) floating point
     type.  This operator is called similar to the ‘sigclip-number’
     operator, please see there for more.  For example
          astarithmetic a.fits b.fits c.fits 3 5 0.2 sigclip-mean

‘sigclip-std’
     For each pixel, find the sigma-clipped standard deviation in all
     given datasets.  The output will have the a single-precision
     (32-bit) floating point type.  This operator is called similar to
     the ‘sigclip-number’ operator, please see there for more.  For
     example
          astarithmetic a.fits b.fits c.fits 3 5 0.2 sigclip-std


File: gnuastro.info,  Node: Filtering operators,  Next: Interpolation operators,  Prev: Stacking operators,  Up: Arithmetic operators

6.2.2.6 Filtering (smoothing) operators
.......................................

Image filtering is commonly used for smoothing: every pixel value in the
output image is created by applying a certain statistic to the pixels in
its vicinity.

‘filter-mean’
     Apply mean filtering (or moving average
     (https://en.wikipedia.org/wiki/Moving_average)) on the input
     dataset.  During mean filtering, each pixel (data element) is
     replaced by the mean value of all its surrounding pixels (excluding
     blank values).  The number of surrounding pixels in each dimension
     (to calculate the mean) is determined through the earlier operands
     that have been pushed onto the stack prior to the input dataset.
     The number of necessary operands is determined by the dimensions of
     the input dataset (first popped operand).  The order of the
     dimensions on the command-line is the order in FITS format.  Here
     is one example:

          $ astarithmetic 5 4 image.fits filter-mean

     In this example, each pixel is replaced by the mean of a 5 by 4 box
     around it.  The box is 5 pixels along the first FITS dimension
     (horizontal when viewed in ds9) and 4 pixels along the second FITS
     dimension (vertical).

     Each pixel will be placed in the center of the box that the mean is
     calculated on.  If the given width along a dimension is even, then
     the center is assumed to be between the pixels (not in the center
     of a pixel).  When the pixel is close to the edge, the pixels of
     the box that fall outside the image are ignored.  Therefore, on the
     edge, less points will be used in calculating the mean.

     The final effect of mean filtering is to smooth the input image, it
     is essentially a convolution with a kernel that has identical
     values for all its pixels (is flat), see *note Convolution
     process::.

     Note that blank pixels will also be affected by this operator: if
     there are any non-blank elements in the box surrounding a blank
     pixel, in the filtered image, it will have the mean of the
     non-blank elements, therefore it won’t be blank any more.  If blank
     elements are important for your analysis, you can use the ‘isblank’
     with the ‘where’ operator to set them back to blank after
     filtering.

‘filter-median’
     Apply median filtering
     (https://en.wikipedia.org/wiki/Median_filter) on the input dataset.
     This is very similar to ‘filter-mean’, except that instead of the
     mean value of the box pixels, the median value is used to replace a
     pixel value.  For more on how to use this operator, please see
     ‘filter-mean’.

     The median is less susceptible to outliers compared to the mean.
     As a result, after median filtering, the pixel values will be more
     discontinuous than mean filtering.

‘filter-sigclip-mean’
     Apply a $\sigma$-clipped mean filtering onto the input dataset.
     This is very similar to ‘filter-mean’, except that all outliers
     (identified by the $\sigma$-clipping algorithm) have been removed,
     see *note Sigma clipping:: for more on the basics of this
     algorithm.  As described there, two extra input parameters are
     necessary for $\sigma$-clipping: the multiple of $\sigma$ and the
     termination criteria.  ‘filter-sigclip-mean’ therefore needs to pop
     two other operands from the stack after the dimensions of the box.

     For example the line below uses the same box size as the example of
     ‘filter-mean’.  However, all elements in the box that are
     iteratively beyond $3\sigma$ of the distribution’s median are
     removed from the final calculation of the mean until the change in
     $\sigma$ is less than $0.2$.

          $ astarithmetic 3 0.2 5 4 image.fits filter-sigclip-mean

     The median (which needs a sorted dataset) is necessary for
     $\sigma$-clipping, therefore ‘filter-sigclip-mean’ can be
     significantly slower than ‘filter-mean’.  However, if there are
     strong outliers in the dataset that you want to ignore (for example
     emission lines on a spectrum when finding the continuum), this is a
     much better solution.

‘filter-sigclip-median’
     Apply a $\sigma$-clipped median filtering onto the input dataset.
     This operator and its necessary operands are almost identical to
     ‘filter-sigclip-mean’, except that after $\sigma$-clipping, the
     median value (which is less affected by outliers than the mean) is
     added back to the stack.


File: gnuastro.info,  Node: Interpolation operators,  Next: Dimensionality changing operators,  Prev: Filtering operators,  Up: Arithmetic operators

6.2.2.7 Interpolation operators
...............................

Interpolation is the process of removing blank pixels from a dataset (by
giving them a value based on the non-blank neighbors).

‘interpolate-medianngb’
     Interpolate the blank elements of the second popped operand with
     the median of nearest non-blank neighbors to each.  The number of
     the nearest non-blank neighbors used to calculate the median is
     given by the first popped operand.

     The distance of the nearest non-blank neighbors is irrelevant in
     this interpolation.  The neighbors of each blank pixel will be
     parsed in expanding circular rings (for 2D images) or spherical
     surfaces (for 3D cube) and each non-blank element over them is
     stored in memory.  When the requested number of non-blank neighbors
     have been found, their median is used to replace that blank
     element.  For example the line below replaces each blank element
     with the the median of the nearest 5 pixels.

          $ astarithmetic image.fits 5 interpolate-medianngb

     When you want to interpolate blank regions and you want each blank
     region to have a fixed value (for example the centers of saturated
     stars) this operator is not good.  Because the pixels used to
     interpolate various parts of the region differ.  For such
     scenarios, you may use ‘interpolate-maxofregion’ or
     ‘interpolate-inofregion’ (described below).

‘interpolate-minngb’
     Similar to ‘interpolate-medianngb’, but will fill the blank values
     of the dataset with the minimum value of the nearest neighbors.

‘interpolate-maxngb’
     Similar to ‘interpolate-medianngb’, but will fill the blank values
     of the dataset with the maximum value of the nearest neighbors.
     One useful implementation of this operator is to fill the saturated
     pixels of stars in images.

‘interpolate-minofregion’
     Interpolate all blank regions (consisting of many blank pixels that
     are touching) in the second popped operand with the minimum value
     of the pixels that are immediately bordering that region (a single
     value).  The first popped operand is the connectivity (see
     description in ‘connected-components’).

     For example with the command below all the connected blank regions
     of ‘image.fits’ will be filled.  Its an image (2D dataset), so a 2
     connectivity means that the independent blank regions are defined
     by 8-connected neighbors.  If connectivity was 1, the regions would
     be defined by 4-connectivity: blank regions that may only be
     touching on the corner of one pixel would be identified as separate
     regions.

          $ astarithmetic image.fits 2 interpolate-minofregion

‘interpolate-maxofregion’
     Similar to ‘interpolate-minofregion’, but the maximum is used to
     fill the blank regions.

     This operator can be useful in filling saturated pixels in stars
     for example.  Recall that the ‘interpolate-maxngb’ operator looks
     for the maximum value with a given number of neighboring pixels and
     is more useful in small noisy regions.  Therefore as the blank
     regions become larger, ‘interpolate-maxngb’ can cause a
     fragmentation in the connected blank region because the nearest
     neighbor to one part of the blank region, may not fall within the
     pixels searched for the other regions.  With this option, the size
     of the blank region is irrelevant: all the pixels bordering the
     blank region are parsed and their maximum value is used for the
     whole region.


File: gnuastro.info,  Node: Dimensionality changing operators,  Next: Conditional operators,  Prev: Interpolation operators,  Up: Arithmetic operators

6.2.2.8 Dimensionality changing operators
.........................................

Through these operators you can change the dimensions of the output
through certain statistics on the dimensions that should be removed.
For example, let’s assume you have a 3D data cube that has 300 by 300
pixels in the RA and Dec dimensions (first two dimensions), and 3600
slices along the wavelength (third dimension), so the whole cube is
$300\times300\times3600$ voxels (volume elements).  To create a
narrow-band image that only contains 100 slices around a certain
wavelength, you can crop that section (using *note Crop::), giving you a
$300\times300\times100$ cube.  You can now use the ‘collapse-sum’
operator below to “collapse” all the 100 slices into one 2D image that
has $300\times300$ pixels.  Every pixel in this 2D image will have the
flux of the sum of the 100 slices.

‘collapse-sum’
     Collapse the given dataset (second popped operand), by summing all
     elements along the first popped operand (a dimension in FITS
     standard: counting from one, from fastest dimension).  The returned
     dataset has one dimension less compared to the input.

     The output will have a double-precision floating point type
     irrespective of the input dataset’s type.  Doing the operation in
     double-precision (64-bit) floating point will help the collapse
     (summation) be affected less by floating point errors.  But
     afterwards, single-precision floating points are usually enough in
     real (noisy) datasets.  So depending on the type of the input and
     its nature, it is recommended to use one of the type conversion
     operators on the returned dataset.

     If any WCS is present, the returned dataset will also lack the
     respective dimension in its WCS matrix.  Therefore, when the WCS is
     important for later processing, be sure that the input is aligned
     with the respective axes: all non-diagonal elements in the WCS
     matrix are zero.

     One common application of this operator is the creation of pseudo
     broad-band or narrow-band 2D images from 3D data cubes.  For
     example integral field unit (IFU) data products that have two
     spatial dimensions (first two FITS dimensions) and one spectral
     dimension (third FITS dimension).  The command below will collapse
     the whole third dimension into a 2D array the size of the first two
     dimensions, and then convert the output to single-precision
     floating point (as discussed above).

          $ astarithmetic cube.fits 3 collapse-sum float32

‘collapse-mean’
     Similar to ‘collapse-sum’, but the returned dataset will be the
     mean value along the collapsed dimension, not the sum.

‘collapse-number’
     Similar to ‘collapse-sum’, but the returned dataset will be the
     number of non-blank values along the collapsed dimension.  The
     output will have a 32-bit signed integer type.  If the input
     dataset doesn’t have blank values, all the elements in the returned
     dataset will have a single value (the length of the collapsed
     dimension).  Therefore this is mostly relevant when there are blank
     values in the dataset.

‘collapse-min’
     Similar to ‘collapse-sum’, but the returned dataset will have the
     same numeric type as the input and will contain the minimum value
     for each pixel along the collapsed dimension.

‘collapse-max’
     Similar to ‘collapse-sum’, but the returned dataset will have the
     same numeric type as the input and will contain the maximum value
     for each pixel along the collapsed dimension.

‘add-dimension’
     Build a higher-dimensional dataset from all the input datasets
     stacked after one another (along the slowest dimension).  The first
     popped operand has to be a single number.  It is used by the
     operator to know how many operands it should pop from the stack
     (and the size of the output in the new dimension).  The rest of the
     operands must have the same size and numerical data type.  This
     operator currently only works for 2D input operands, please contact
     us if you want inputs to have different dimensions.

     The output’s WCS (which should have a different dimensionality
     compared to the inputs) can be read from another file with the
     ‘--wcsfile’ option.  If no file is specified for the WCS, the first
     dataset’s WCS will be used, you can later add/change the necessary
     WCS keywords with the FITS keyword modification features of the
     Fits program (see *note Fits::).

     If your datasets don’t have the same type, you can use the type
     transformation operators of Arithmetic that are discussed below.
     Just beware of overflow if you are transforming to a smaller type,
     see *note Numeric data types::.

     For example if you want to put the three ‘img1.fits’, ‘img2.fits’
     and ‘img3.fits’ images (each a 2D dataset) into one 3D datacube,
     you can use this command:

          $ astarithmetic img1.fits img2.fits img3.fits 3 add-dimension


File: gnuastro.info,  Node: Conditional operators,  Next: Mathematical morphology operators,  Prev: Dimensionality changing operators,  Up: Arithmetic operators

6.2.2.9 Conditional operators
.............................

Conditional operators take two inputs and return a binary output that
can only have two values 0 (for pixels where the condition was false) or
1 (for the pixels where the condition was true).  Because of the binary
(2-valued) nature of their outputs, the output is therefore stored in an
‘unsigned char’ data type (see *note Numeric data types::) to speed up
process and take less space in your storage.  There are two exceptions
to the general features above: ‘isblank’ only takes one input, and
‘where’ takes three, while not returning a binary output, see their
description for more.

‘lt’
     Less than: creates a binary output (values either 0 or 1) where
     each pixel will be 1 if the second popped operand is smaller than
     the first popped operand and 0 otherwise.  If both operands are
     images, then all the pixels will be compared with their
     counterparts in the other image.

     For example, the pixels in the output of the command below will
     have a value of 1 (true) if their value in ‘image1.fits’ is less
     than their value in ‘image2.fits’.  Otherwise, their value will be
     0 (false).
          $ astarithmetic image1.fits image2.fits lt
     If only one operand is an image, then all the pixels will be
     compared with the single value (number) of the other operand.  For
     example:
          $ astaithmetic image1.fits 1000 lt
     Finally if both are numbers, then the output is also just one
     number (0 or 1).
          $ astarithmetic 4 5 lt

‘le’
     Less or equal: similar to ‘lt’ (‘less than’ operator), but
     returning 1 when the second popped operand is smaller or equal to
     the first.  For example
          $ astaithmetic image1.fits 1000 le

‘gt’
     Greater than: similar to ‘lt’ (‘less than’ operator), but returning
     1 when the second popped operand is greater than the first.  For
     example
          $ astaithmetic image1.fits 1000 gt

‘ge’
     Greater or equal: similar to ‘lt’ (‘less than’ operator), but
     returning 1 when the second popped operand is larger or equal to
     the first.  For example
          $ astaithmetic image1.fits 1000 ge

‘eq’
     Equality: similar to ‘lt’ (‘less than’ operator), but returning 1
     when the two popped operands are equal (to double precision
     floating point accuracy).
          $ astaithmetic image1.fits 1000 eq

‘ne’
     Non-Equality: similar to ‘lt’ (‘less than’ operator), but returning
     1 when the two popped operands are _not_ equal (to double precision
     floating point accuracy).
          $ astaithmetic image1.fits 1000 ne

‘and’
     Logical AND: returns 1 if both operands have a non-zero value and 0
     if both are zero.  Both operands have to be the same kind: either
     both images or both numbers and it mostly makes meaningful values
     when the inputs are binary (with pixel values of 0 or 1).
          $ astarithmetic image1.fits image2.fits -g1 and

     For example if you only want to see which pixels in an image have a
     value _between_ 50 (greater equal, or inclusive) and 200 (less
     than, or exclusive), you can use this command:
          $ astarithmetic image.fits set-i i 50 ge i 200 lt and

‘or’
     Logical OR: returns 1 if either one of the operands is non-zero and
     0 only when both operators are zero.  Both operands have to be the
     same kind: either both images or both numbers.  The usage is
     similar to ‘and’.

     For example if you only want to see which pixels in an image have a
     value _outside of_ -100 (greater equal, or inclusive) and 200 (less
     than, or exclusive), you can use this command:
          $ astarithmetic image.fits set-i i -100 lt i 200 ge or

‘not’
     Logical NOT: returns 1 when the operand is 0 and 0 when the operand
     is non-zero.  The operand can be an image or number, for an image,
     it is applied to each pixel separately.  For example if you want to
     know which pixels are not blank, you can use not on the output of
     the ‘isblank’ operator described below:
          $ astarithmetic image.fits isblank not

‘isblank’
     Test for a blank value (see *note Blank pixels::).  In essence,
     this is very similar to the conditional operators: the output is
     either 1 or 0 (see the ‘less than’ operator above).  The difference
     is that it only needs one operand.  For example:
          $ astarithmetic image.fits isblank
     Because of the definition of a blank pixel, a blank value is not
     even equal to itself, so you cannot use the equal operator above to
     select blank pixels.  See the “Blank pixels” box below for more on
     Blank pixels in Arithmetic.

‘where’
     Change the input (pixel) value _where_/if a certain condition
     holds.  The conditional operators above can be used to define the
     condition.  Three operands are required for ‘where’.  The input
     format is demonstrated in this simplified example:

          $ astarithmetic modify.fits binary.fits if-true.fits where

     The value of any pixel in ‘modify.fits’ that corresponds to a
     non-zero _and_ non-blank pixel of ‘binary.fits’ will be changed to
     the value of the same pixel in ‘if-true.fits’ (this may also be a
     number).  The 3rd and 2nd popped operands (‘modify.fits’ and
     ‘binary.fits’ respectively, see *note Reverse polish notation::)
     have to have the same dimensions/size.  ‘if-true.fits’ can be
     either a number, or have the same dimension/size as the other two.

     The 2nd popped operand (‘binary.fits’) has to have ‘uint8’ (or
     ‘unsigned char’ in standard C) type (see *note Numeric data
     types::).  It is treated as a binary dataset (with only two values:
     zero and non-zero, hence the name ‘binary.fits’ in this example).
     However, commonly you won’t be dealing with an actual FITS file of
     a condition/binary image.  You will probably define the condition
     in the same run based on some other reference image and use the
     conditional and logical operators above to make a true/false (or
     one/zero) image for you internally.  For example the case below:

          $ astarithmetic in.fits reference.fits 100 gt new.fits where

     In the example above, any of the ‘in.fits’ pixels that has a value
     in ‘reference.fits’ greater than ‘100’, will be replaced with the
     corresponding pixel in ‘new.fits’.  Effectively the ‘reference.fits
     100 gt’ part created the condition/binary image which was added to
     the stack (in memory) and later used by ‘where’.  The command above
     is thus equivalent to these two commands:

          $ astarithmetic reference.fits 100 gt --output=binary.fits
          $ astarithmetic in.fits binary.fits new.fits where

     Finally, the input operands are read and used independently, so you
     can use the same file more than once as any of the operands.

     When the 1st popped operand to ‘where’ (‘if-true.fits’) is a single
     number, it may be a NaN value (or any blank value, depending on its
     type) like the example below (see *note Blank pixels::).  When the
     number is blank, it will be converted to the blank value of the
     type of the 3rd popped operand (‘in.fits’).  Hence, in the example
     below, all the pixels in ‘reference.fits’ that have a value greater
     than 100, will become blank in the natural data type of ‘in.fits’
     (even though NaN values are only defined for floating point types).

          $ astarithmetic in.fits reference.fits 100 gt nan where


File: gnuastro.info,  Node: Mathematical morphology operators,  Next: Bitwise operators,  Prev: Conditional operators,  Up: Arithmetic operators

6.2.2.10 Mathematical morphology operators
..........................................

From Wikipedia: “Mathematical morphology (MM) is a theory and technique
for the analysis and processing of geometrical structures, based on set
theory, lattice theory, topology, and random functions.  MM is most
commonly applied to digital images”.  In theory it extends a very large
body of research and methods in image processing, but currently in
Gnuastro it mainly applies to images that are binary (only have a value
of 0 or 1).  For example you have applied the greater-than operator
(‘gt’, see *note Conditional operators::) to select all pixels in your
image that are larger than a value of 100.  But they will all have a
value of 1, and you want to separate the various groups of pixels that
are connected (for example peaks of stars in your image).  With the
‘connected-components’ operator, you can give each connected region of
the output of ‘gt’ a separate integer label.

‘erode’
     Erode the foreground pixels (with value ‘1’) of the input dataset
     (second popped operand).  The first popped operand is the
     connectivity (see description in ‘connected-components’).  Erosion
     is simply a flipping of all foreground pixels (with value ‘1’) to
     background (with value ‘0’) that are “touching” background pixels.
     “Touching” is defined by the connectivity.

     In effect, this operator “carves off” the outer borders of the
     foreground, making them thinner.  This operator assumes a binary
     dataset (all pixels are ‘0’ or ‘1’).  For example, imagine that you
     have an astronomical image with a mean/sky value of 0 units and a
     standard deviation ($\sigma$) of 100 units and many galaxies in it.
     With the first command below, you can apply a threshold of
     $2\sigma$ on the image (by only keeping pixels that are greater
     than 200 using the ‘gt’ operator).  The output of thresholding the
     image is a binary image (each pixel is either smaller or equal to
     the threshold or larger than it).  You can then erode the binary
     image with the second command below to remove very small false
     positives (one or two pixel peaks).
          $ astarithmetic image.fits 100 gt -obinary.fits
          $ astarithmetic binary.fits 2 erode -oout.fits

     In fact, you can merge these operations into one command thanks to
     the reverse polish notation (see *note Reverse polish notation::):
          $ astarithmetic image.fits 100 gt 2 erode -oout.fits

     To see the effect of connectivity, try this:
          $ astarithmetic image.fits 100 gt 1 erode -oout-con-1.fits

‘dilate’
     Dilate the foreground pixels (with value ‘1’) of the binary input
     dataset (second popped operand).  The first popped operand is the
     connectivity (see description in ‘connected-components’).  Dilation
     is simply a flipping of all background pixels (with value ‘0’) to
     foreground (with value ‘1’) that are “touching” foreground pixels.
     “Touching” is defined by the connectivity.  In effect, this expands
     the outer borders of the foreground.  This operator assumes a
     binary dataset (all pixels are ‘0’ and ‘1’).  The usage is similar
     to ‘erode’, for example:
          $ astarithmetic binary.fits 2 dilate -oout.fits

‘connected-components’
     Find the connected components in the input dataset (second popped
     operand).  The first popped is the connectivity used in the
     connected components algorithm.  The second popped operand is the
     dataset where connected components are to be found.  It is assumed
     to be a binary image (with values of 0 or 1).  It must have an
     8-bit unsigned integer type which is the format produced by
     conditional operators.  This operator will return a labeled dataset
     where the non-zero pixels in the input will be labeled with a
     counter (starting from 1).

     The connectivity is a number between 1 and the number of dimensions
     in the dataset (inclusive).  1 corresponds to the weakest
     (symmetric) connectivity between elements and the number of
     dimensions the strongest.  For example on a 2D image, a
     connectivity of 1 corresponds to 4-connected neighbors and 2
     corresponds to 8-connected neighbors.

     One example usage of this operator can be the identification of
     regions above a certain threshold, as in the command below.  With
     this command, Arithmetic will first separate all pixels greater
     than 100 into a binary image (where pixels with a value of 1 are
     above that value).  Afterwards, it will label all those that are
     connected.

          $ astarithmetic in.fits 100 gt 2 connected-components

     If your input dataset doesn’t have a binary type, but you know all
     its values are 0 or 1, you can use the ‘uint8’ operator (below) to
     convert it to binary.

‘fill-holes’
     Flip background (0) pixels surrounded by foreground (1) in a binary
     dataset.  This operator takes two operands (similar to
     ‘connected-components’): the second is the binary (0 or 1 valued)
     dataset to fill holes in and the first popped operand is the
     connectivity (to define a hole).  Imagine that in your dataset
     there are some holes with zero value inside the objects with one
     value (for example the output of the thresholding example of
     ‘erode’) and you want to fill the holes:
          $ astarithmetic binary.fits 2 fill-holes

‘invert’
     Invert an unsigned integer dataset (won’t work on other data types,
     see *note Numeric data types::).  This is the only operator that
     ignores blank values (which are set to be the maximum values in the
     unsigned integer types).

     This is useful in cases where the target(s) has(have) been imaged
     in absorption as raw formats (which are unsigned integer types).
     With this option, the maximum value for the given type will be
     subtracted from each pixel value, thus “inverting” the image, so
     the target(s) can be treated as emission.  This can be useful when
     the higher-level analysis methods/tools only work on emission
     (positive skew in the noise, not negative).
          $ astarithmetic image.fits invert


File: gnuastro.info,  Node: Bitwise operators,  Next: Numerical type conversion operators,  Prev: Mathematical morphology operators,  Up: Arithmetic operators

6.2.2.11 Bitwise operators
..........................

Astronomical images are usually stored as an array multi-byte pixels
with different sizes for different precision levels (see *note Numeric
data types::).  For example images from CCDs are usually in the unsigned
16-bit integer type (each pixel takes 16 bits, or 2 bytes, of memory)
and fully reduced deep images have a 32-bit floating point type (each
pixel takes 32 bits or 4 bytes).

   On the other hand, during the data reduction, we need to preserve a
lot of meta-data about some pixels.  For example, if a cosmic ray had
hit the pixel during the exposure, or if the pixel was saturated, or is
known to have a problem, or if the optical vignetting is too strong on
it, and etc.  A crude solution is to make a new image when checking for
each one of these things and make a binary image where we flag (set to
1) pixels that satisfy any of these conditions above, and set the rest
to zero.  However, processing pipelines sometimes need more than 20
flags to store important per-pixel meta-data, and recall that the
smallest numeric data type is one byte (or 8 bits, that can store up to
256 different values), while we only need two values for each flag!
This is a major waste of storage space!

   A much more optimal solution is to use the bits within each pixel to
store different flags!  In other words, if you have an 8-bit pixel, use
each bit as as a flag to mark if a certain condition has happened on a
certain pixel or not.  For example, let’s set the following standard
based on the four cases mentioned above: the first bit will show that a
cosmic ray has hit that pixel.  So if a pixel is only affected by cosmic
rays, it will have this sequence of bits (note that the bit-counting
starts from the right): ‘00000001’.  The second bit shows that the pixel
was saturated (‘00000010’), the third bit shows that it has known
problems (‘00000100’) and the fourth bit shows that it was affected by
vignetting (‘00001000’).

   Since each bit is independent, we can thus mark multiple metadata
about that pixel in the actual image, within a single “flag” or “mask”
pixel of a flag or mask image that has the same number of pixels.  For
example a flag-pixel with the following bits ‘00001001’ shows that it
has been affected by cosmic rays _and_ it has been affected by
vignetting at the same time.  The common data type to store these
flagging pixels are unsigned integer types (see *note Numeric data
types::).  Therefore when you open an unsigned 8-bit flag image in a
viewer like DS9, you will see a single integer in each pixel that
actually has 8 layers of metadata in it!  For example the integer you
will see for the bit sequences given above will respectively be: $2^0=1$
(for a pixel that only has cosmic ray), $2^1=2$ (for a pixel that was
only saturated), $2^2=4$ (for a pixel that only has known problems),
$2^3=8$ (for a pixel that is only affected by vignetting) and $2^0 + 2^3
= 9$ (for a pixel that has a cosmic ray _and_ was affected by
vignetting).

   You can later use this bit information to mark objects in your final
analysis or to mask certain pixels.  For example you may want to set all
pixels affected by vignetting to NaN, but can interpolate over cosmic
rays.  You therefore need ways to separate the pixels with a desired
flag(s) from the rest.  It is possible to treat a flag pixel as a single
integer (and try to define certain ranges in value to select certain
flags).  But a much more easier and robust way is to actually look at
each pixel as a sequence of bits (not as a single integer!)  and use the
bitwise operators below for this job.  For more on the theory behind
bitwise operators, see Wikipedia
(https://en.wikipedia.org/wiki/Bitwise_operation).

‘bitand’
     Bitwise AND operator: only bits with values of 1 in both popped
     operands will get the value of 1, the rest will be set to 0.  For
     example (assuming numbers can be written as bit strings on the
     command-line): ‘00101000 00100010 bitand’ will give ‘00100000’.
     Note that the bitwise operators only work on integer type datasets.

‘bitor’
     Bitwise inclusive OR operator: The bits where at least one of the
     two popped operands has a 1 value get a value of 1, the others 0.
     For example (assuming numbers can be written as bit strings on the
     command-line): ‘00101000 00100010 bitand’ will give ‘00101010’.
     Note that the bitwise operators only work on integer type datasets.

‘bitxor’
     Bitwise exclusive OR operator: A bit will be 1 if it differs
     between the two popped operands.  For example (assuming numbers can
     be written as bit strings on the command-line): ‘00101000 00100010
     bitand’ will give ‘00001010’.  Note that the bitwise operators only
     work on integer type datasets.

‘lshift’
     Bitwise left shift operator: shift all the bits of the first
     operand to the left by a number of times given by the second
     operand.  For example (assuming numbers can be written as bit
     strings on the command-line): ‘00101000 2 lshift’ will give
     ‘10100000’.  This is equivalent to multiplication by 4.  Note that
     the bitwise operators only work on integer type datasets.

‘rshift’
     Bitwise right shift operator: shift all the bits of the first
     operand to the right by a number of times given by the second
     operand.  For example (assuming numbers can be written as bit
     strings on the command-line): ‘00101000 2 rshift’ will give
     ‘00001010’.  Note that the bitwise operators only work on integer
     type datasets.

‘bitnot’
     Bitwise not (more formally known as one’s complement) operator:
     flip all the bits of the popped operand (note that this is the only
     unary, or single operand, bitwise operator).  In other words, any
     bit with a value of ‘0’ is changed to ‘1’ and vice-versa.  For
     example (assuming numbers can be written as bit strings on the
     command-line): ‘00101000 bitnot’ will give ‘11010111’.  Note that
     the bitwise operators only work on integer type datasets/numbers.


File: gnuastro.info,  Node: Numerical type conversion operators,  Next: Adding noise operators,  Prev: Bitwise operators,  Up: Arithmetic operators

6.2.2.12 Numerical type conversion operators
............................................

With the operators below you can convert the numerical data type of your
input, see *note Numeric data types::.  For example, let’s assume that
your colleague gives you thousands of single exposure images for
archival, but they have a double-precision floating point type!  You
know that the statistical error a single-exposure image can never exceed
6 or 7 significant digits, so you would prefer to archive them as a
single-precision floating point and save space on your computer (a
double-precision floating point is also double the file size!).  You can
do this with the ‘float32’ operator described below.

‘uint8’
     Convert the type of the popped operand to 8-bit unsigned integer
     type (see *note Numeric data types::).  The internal conversion of
     C will be used.

‘int8’
     Convert the type of the popped operand to 8-bit signed integer type
     (see *note Numeric data types::).  The internal conversion of C
     will be used.

‘uint16’
     Convert the type of the popped operand to 16-bit unsigned integer
     type (see *note Numeric data types::).  The internal conversion of
     C will be used.

‘int16’
     Convert the type of the popped operand to 16-bit signed integer
     (see *note Numeric data types::).  The internal conversion of C
     will be used.

‘uint32’
     Convert the type of the popped operand to 32-bit unsigned integer
     type (see *note Numeric data types::).  The internal conversion of
     C will be used.

‘int32’
     Convert the type of the popped operand to 32-bit signed integer
     type (see *note Numeric data types::).  The internal conversion of
     C will be used.

‘uint64’
     Convert the type of the popped operand to 64-bit unsigned integer
     (see *note Numeric data types::).  The internal conversion of C
     will be used.

‘float32’
     Convert the type of the popped operand to 32-bit (single precision)
     floating point (see *note Numeric data types::).  The internal
     conversion of C will be used.

‘float64’
     Convert the type of the popped operand to 64-bit (double precision)
     floating point (see *note Numeric data types::).  The internal
     conversion of C will be used.


File: gnuastro.info,  Node: Adding noise operators,  Next: Elliptical shape operators,  Prev: Numerical type conversion operators,  Up: Arithmetic operators

6.2.2.13 Adding noise operators
...............................

When you simulate data (for example see *note Sufi simulates a
detection::), everything is ideal and there is no noise!  The final step
of the process is to add simulated noise to the data.  The operators in
this section are designed for that purpose.

‘mknoise-sigma’
     Add a fixed noise (Gaussian standard deviation) to each element of
     the input dataset.  This operator takes two arguments: the
     top/first popped operand is the noise standard deviation, the next
     popped operand is the dataset that the noise should be added to.

     When ‘--quiet’ isn’t given, a statement will be printed on each
     invocation of this operator (if there are multiple calls to the
     ‘mknoise-*’, the statement will be printed multiple times).  It
     will show the random number generator function and seed that was
     used in that invocation, see *note Generating random numbers::.
     Reproducibility of the outputs can be ensured with the ‘--envseed’
     option, see below for more.

     For example with the first command below, ‘image.fits’ will be
     degraded by a noise of standard deviation 3 units.
          $ astarithmetic image.fits 3 mknoise-sigma

     Alternatively, you can use this operator within column arithmetic
     in the Table program, to generate a random number like below
     (centered on 0, with $\sigma=3$) like the first command below.
     With the second command, you can put it into a shell variable for
     later usage.

          $ echo 0 | asttable -c'arith $1 3 mknoise-sigma'
          $ value=$(echo 0 | asttable -c'arith $1 3 mknoise-sigma')
          $ echo $value

     You can also use this operator in combination with AWK to easily
     generate an arbitrarily large table with random columns.  In the
     example below, we’ll create a two column table with 20 rows.  The
     first column will be centered on 5 and $\sigma_1=2$, the second
     will be centered on 10 and $\sigma_2=3$:

          $ echo 5 10 \
                 | awk '{for(i=0;i<20;++i) print $1, $2}' \
                 | asttable -c'arith $1 2 mknoise-sigma' \
                            -c'arith $2 3 mknoise-sigma'

     By adding an extra ‘--output=random.fits’, the table will be saved
     into a file called ‘random.fits’, and you can change the ‘i<20’ to
     ‘i<5000’ to have 5000 rows instead.  Of course, if your input table
     has different values in the desired column the noisy distribution
     will be centered on each input element, but all will have the same
     scatter/sigma.

     You can use the ‘--envseed’ option to fix the random number
     generator seed (and thus get a reproducible result).  For more on
     ‘--envseed’, see *note Generating random numbers::.  When using
     column arithmetic in Table, it may happen that multiple columns
     need random numbers (with any of the ‘mknoise-*’ operators) in one
     call of ‘asttable’.  In such cases, the value given to
     ‘GSL_RNG_SEED’ is incremented by one on every call to the
     ‘mknoise-*’ operators.  Without this increment, when the column
     values are the same (happens a lot, for no-noised datasets), the
     returned values for all columns will be identical.  But this
     feature has a side-effect: that if the order of calling the
     ‘mknoise-*’ operators changes, the seeds used for each operator
     will change(1).

‘mknoise-poisson’
     Add Poisson noise to each element of the input dataset (see *note
     Photon counting noise::).  This operator takes two arguments: the
     top/first popped operand is the background value (in units of
     electron counts), the next popped operand is the dataset that the
     noise should be added to.

     Except for the noise-model, this operator is very similar to
     ‘mknoise-sigma’ and the examples there apply here too.  The main
     difference with ‘mknoise-sigma’ is that in a Poisson distribution
     the scatter/sigma will depend on each element’s value.

     For example, let’s assume you have made a mock image called
     ‘mock.fits’ with *note MakeProfiles:: and its assumed zeropoint is
     22.5 (for more on the zero point, see *note Brightness flux
     magnitude::).  Let’s assume the background level for the Poisson
     noise has a value of 19 magnitudes.  You can first use the
     ‘mag-to-counts’ operator to convert this background magnitude into
     counts, then feed the background value in counts to
     ‘mknoise-poisson’ operator:

          $ astarithmetic mock.fits 19 22.5 mag-to-counts \
                          mknoise-poisson

     Try changing the background value from 19 to 10 to see the effect!
     Recall that the tutorial *note Sufi simulates a detection:: shows
     how you can use MakeProfiles to build mock images.

‘mknoise-uniform’
     Add uniform noise to each element of the input dataset.  This
     operator takes two arguments: the top/first popped operand is the
     width of the interval, the second popped operand is the dataset
     that the noise should be added to (each element will be the center
     of the interval).  The returned random values may happen to be the
     minimum interval value, but will never be the maximum.  Except for
     the noise-model, this operator behaves very similar to
     ‘mknoise-sigma’, see the explanation there for more.

     For example with the command below, a random value will be selected
     between 10 to 14 (centered on 12, which is the only input data
     element, with a total width of 4).

          echo 12 | asttable -c'arith $1 4 mknoise-uniform'

     Similar to the example in ‘mknoise-sigma’, you can pipe the output
     of ‘echo’ to ‘awk’ before passing it to ‘asttable’ to generate a
     full column of uniformly selected values within the same interval.

   ---------- Footnotes ----------

   (1) We have defined Task 15971 (https://savannah.gnu.org/task/?15971)
in Gnuastro’s project management system to address this.  If you need
this feature please send us an email at ‘bug-gnuastro@gnu.org’ (to
motivate us in its implementation).


File: gnuastro.info,  Node: Elliptical shape operators,  Next: Building new dataset,  Prev: Adding noise operators,  Up: Arithmetic operators

6.2.2.14 Elliptical shape operators
...................................

The operators here describe certain functions that will be necessary
when dealing with objects that have a certain elliptical shape.

‘box-around-ellipse’
     Return the width (along horizontal) and height (along vertical) of
     a box that encompasses an ellipse with the same center point.  The
     top-popped operand is assumed to be the position angle (angle from
     the horizontal axis) in _degrees_.  The second and third popped
     operands are the minor and major axis lengths respectively.  This
     operator outputs two operands on the general stack.  The first one
     is the width and the second (which will be the top one when this
     operator finishes) is the height.

     If the value to the second popped operand (minor axis) is larger
     than the third (major axis), a NaN value will be written for both
     the width and height of that element and a warning will be printed
     (the warning can be disabled with the ‘--quiet’ option).

     As an example, if your ellipse has a major axis length of 10 units,
     a minor axis length of 4 units and a position angle of 20 degrees,
     you can estimate the bounding box with this command:

          $ echo "10 4 20" \
                 | asttable -c'arith $1 $2 $3 box-around-ellipse'

     Alternatively if your three values are in separate FITS
     arrays/images, you can use the command below to have the width and
     height in similarly sized fits arrays.  In this example ‘a.fits’
     and ‘b.fits’ are respectively the major and minor axis lengths and
     ‘pa.fits’ is the position angle (in degrees).  Also, in all three,
     we assume the first extension is used.  After its done, the height
     of the box will be put in ‘h.fits’ and the width will be in
     ‘w.fits’.  Just note that because this operator has two output
     datasets, you need to first write the height (top output operand)
     into a file and free it with the ‘tofilefree-’ operator, then write
     the width in the file given to ‘--output’.

          $ astarithmetic a.fits b.fits pa.fits box-around-ellipse \
                          tofilefree-h.fits -ow.fits -g1

     Finally, if you need to treat the width and height separately for
     further processing, you can call the ‘set-’ operator two times
     afterwards like below.  Recall that the ‘set-’ operator will pop
     the top operand, and put it in memory with a certain name, bringing
     the next operand to the top of the stack.

     For example let’s assume ‘catalog.fits’ has at least three columns
     ‘MAJOR’, ‘MINOR’ and ‘PA’ which specify the major axis, minor axis
     and position angle respectively.  But you want the final width and
     height in 32-bit floating point numbers (not the default 64-bit,
     which may be too much precision in many scenarios).  You can do
     this with the command below (note you can also break lines with
     <\>, within the single-quote environment)

          $ asttable catalog.fits \
                     -c'arith MAJOR MINOR PA box-around-ellipse \
                              set-height set-width \
                              width float32 height float32'


File: gnuastro.info,  Node: Building new dataset,  Next: Operand storage in memory or a file,  Prev: Elliptical shape operators,  Up: Arithmetic operators

6.2.2.15 Building new dataset
.............................

With the operator here, you can create a new dataset from scratch to
start certain operations without any input data.

‘makenew’
     Create a new dataset that only has zero values.  The number of
     dimensions is read as the first popped operand and the number of
     elements along each dimension are the next popped operand (in
     reverse of the popping order).  The type of the new dataset is an
     unsigned 8-bit integer and all pixel values have a value of zero.
     For example, if you want to create a new 100 by 200 pixel image,
     you can run this command:

          $ astarithmetic 100 200 2 makenew

     To further extend the example, you can use any of the noise-making
     operators to add noise to this new dataset (see *note Adding noise
     operators::), like the command below:

          $ astarithmetic 100 200 2 makenew 5 mknoise-sigma


File: gnuastro.info,  Node: Operand storage in memory or a file,  Prev: Building new dataset,  Up: Arithmetic operators

6.2.2.16 Operand storage in memory or a file
............................................

In your early days of using Gnuastro, to do multiple operations, it is
likely that you will simply call Arithmetic (or Table, with column
arithmetic) multiple times: feed the output file of the first call to
the second call.  But as you get more proficient in the reverse polish
notation, you will find yourself combining many operations into one
call.  This greatly speeds up your operation, because instead of writing
the dataset to a file in one command, and reading it in the next
command, it will just keep the intermediate dataset in memory!

   But adding more complexity to your operations, can make them much
harder to debug, or extend even further.  Therefore in this section we
have some special operators that behave differently from the rest: they
don’t touch the contents of the data, only where/how they are stored.
They are designed to do complex operations, without necessarily having a
complex command.

‘set-AAA’
     Set the characters after the dash (‘AAA’ in the case shown here) as
     a name for the first popped operand on the stack.  The named
     dataset will be freed from memory as soon as it is no longer
     needed, or if the name is reset to refer to another dataset later
     in the command.  This operator thus enables re-usability of a
     dataset without having to re-read it from a file every time it is
     necessary during a process.  When a dataset is necessary more than
     once, this operator can thus help simplify reading/writing on the
     command-line (thus avoiding potential bugs), while also speeding up
     the processing.

     Like all operators, this operator pops the top operand off of the
     main processing stack, but unlike other operands, it won’t add
     anything back to the stack immediately.  It will keep the popped
     dataset in memory through a separate list of named datasets (not on
     the main stack).  That list will be used to add/copy any requested
     dataset to the main processing stack when the name is called.

     The name to give the popped dataset is part of the operator’s name.
     For example the ‘set-a’ operator of the command below, gives the
     name “‘a’” to the contents of ‘image.fits’.  This name is then used
     instead of the actual filename to multiply the dataset by two.

          $ astarithmetic image.fits set-a a 2 x

     The name can be any string, but avoid strings ending with standard
     filename suffixes (for example ‘.fits’)(1).

     One example of the usefulness of this operator is in the ‘where’
     operator.  For example, let’s assume you want to mask all pixels
     larger than ‘5’ in ‘image.fits’ (extension number 1) with a NaN
     value.  Without setting a name for the dataset, you have to read
     the file two times from memory in a command like this:

          $ astarithmetic image.fits image.fits 5 gt nan where -g1

     But with this operator you can simply give ‘image.fits’ the name
     ‘i’ and simplify the command above to the more readable one below
     (which greatly helps when the filename is long):

          $ astarithmetic image.fits set-i   i i 5 gt nan where

‘tofile-AAA’
     Write the top operand on the operands stack into a file called
     ‘AAA’ (can be any FITS file name) without changing the operands
     stack.  If you don’t need the dataset any more and would like to
     free it, see the ‘tofilefree’ operator below.

     By default, any file that is given to this operator is deleted
     before Arithmetic actually starts working on the input datasets.
     The deletion can be deactivated with the ‘--dontdelete’ option (as
     in all Gnuastro programs, see *note Input output options::).  If
     the same FITS file is given to this operator multiple times, it
     will contain multiple extensions (in the same order that it was
     called.

     For example the operator ‘tofile-check.fits’ will write the top
     operand to ‘check.fits’.  Since it doesn’t modify the operands
     stack, this operator is very convenient when you want to debug, or
     understanding, a string of operators and operands given to
     Arithmetic: simply put ‘tofile-AAA’ anywhere in the process to see
     what is happening behind the scenes without modifying the overall
     process.

‘tofilefree-AAA’
     Similar to the ‘tofile’ operator, with the only difference that the
     dataset that is written to a file is popped from the operand stack
     and freed from memory (cannot be used any more).

   ---------- Footnotes ----------

   (1) A dataset name like ‘a.fits’ (which can be set with ‘set-a.fits’)
will cause confusion in the initial parser of Arithmetic.  It will
assume this name is a FITS file, and if it is used multiple times,
Arithmetic will abort, complaining that you haven’t provided enough
HDUs.


File: gnuastro.info,  Node: Invoking astarithmetic,  Prev: Arithmetic operators,  Up: Arithmetic

6.2.3 Invoking Arithmetic
-------------------------

Arithmetic will do pixel to pixel arithmetic operations on the
individual pixels of input data and/or numbers.  For the full list of
operators with explanations, please see *note Arithmetic operators::.
Any operand that only has a single element (number, or single pixel FITS
image) will be read as a number, the rest of the inputs must have the
same dimensions.  The general template is:

     $ astarithmetic [OPTION...] ASTRdata1 [ASTRdata2] OPERATOR ...

One line examples:

     ## Calculate (10.32-3.84)^2.7 quietly (will just print 155.329):
     $ astarithmetic -q 10.32 3.84 - 2.7 pow

     ## Inverse the input image (1/pixel):
     $ astarithmetic 1 image.fits / --out=inverse.fits

     ## Multiply each pixel in image by -1:
     $ astarithmetic image.fits -1 x --out=negative.fits

     ## Subtract extension 4 from extension 1 (counting from zero):
     $ astarithmetic image.fits image.fits - --out=skysub.fits           \
                     --hdu=1 --hdu=4

     ## Add two images, then divide them by 2 (2 is read as floating point):
     ## Note that without the '.0', the '2' will be read/used as an integer.
     $ astarithmetic image1.fits image2.fits + 2.0 / --out=average.fits

     ## Use Arithmetic's average operator:
     $ astarithmetic image1.fits image2.fits average --out=average.fits

     ## Calculate the median of three images in three separate extensions:
     $ astarithmetic img1.fits img2.fits img3.fits median                \
                     -h0 -h1 -h2 --out=median.fits

   Arithmetic’s notation for giving operands to operators is fully
described in *note Reverse polish notation::.  The output dataset is
last remaining operand on the stack.  When the output dataset a single
number, it will be printed on the command-line.  When the output is an
array, it will be stored as a file.

   The name of the final file can be specified with the ‘--output’
option, but if its not given, Arithmetic will use “automatic output” on
the name of the first FITS image encountered to generate an output file
name, see *note Automatic output::.  By default, if the output file
already exists, it will be deleted before Arithmetic starts operation.
However, this can be disabled with the ‘--dontdelete’ option (see
below).  At any point during Arithmetic’s operation, you can also write
the top operand on the stack to a file, using the ‘tofile’ or
‘tofilefree’ operators, see *note Arithmetic operators::.

   By default, the world coordinate system (WCS) information of the
output dataset will be taken from the first input image (that contains a
WCS) on the command-line.  This can be modified with the ‘--wcsfile’ and
‘--wcshdu’ options described below.  When the ‘--quiet’ option isn’t
given, the name and extension of the dataset used for the output’s WCS
is printed on the command-line.

   Through operators like those starting with ‘collapse-’, the
dimensionality of the inputs may not be the same as the outputs.  By
default, when the output is 1D, Arithmetic will write it as a table, not
an image/array.  The format of the output table (plain text or FITS
ASCII or binary) can be set with the ‘--tableformat’ option, see *note
Input output options::).  You can disable this feature (write 1D arrays
as FITS images/arrays, or to the standard output) with the
‘--onedasimage’ or ‘--onedonstdout’ options.

   See *note Common options:: for a review of the options in all
Gnuastro programs.  Arithmetic just redefines the ‘--hdu’ and
‘--dontdelete’ options as explained below.

‘-h INT/STR’
‘--hdu INT/STR’
     The header data unit of the input FITS images, see *note Input
     output options::.  Unlike most options in Gnuastro (which will
     ultimately only have one value for this option), Arithmetic allows
     ‘--hdu’ to be called multiple times and the value of each
     invocation will be stored separately (for the unlimited number of
     input images you would like to use).  Recall that for other
     programs this (common) option only takes a single value.  So in
     other programs, if you specify it multiple times on the
     command-line, only the last value will be used and in the
     configuration files, it will be ignored if it already has a value.

     The order of the values to ‘--hdu’ has to be in the same order as
     input FITS images.  Options are first read from the command-line
     (from left to right), then top-down in each configuration file, see
     *note Configuration file precedence::.

     If the number of HDUs is less than the number of input images,
     Arithmetic will abort and notify you.  However, if there are more
     HDUs than FITS images, there is no problem: they will be used in
     the given order (every time a FITS image comes up on the stack) and
     the extra HDUs will be ignored in the end.  So there is no problem
     with having extra HDUs in the configuration files and by default
     several HDUs with a value of ‘0’ are kept in the system-wide
     configuration file when you install Gnuastro.

‘-g INT/STR’
‘--globalhdu INT/STR’
     Use the value to this option as the HDU of all input FITS files.
     This option is very convenient when you have many input files and
     the dataset of interest is in the same HDU of all the files.  When
     this option is called, any values given to the ‘--hdu’ option
     (explained above) are ignored and will not be used.

‘-w FITS’
‘--wcsfile FITS’
     FITS Filename containing the WCS structure that must be written to
     the output.  The HDU/extension should be specified with ‘--wcshdu’.

     When this option is used, the respective WCS will be read before
     any processing is done on the command-line and directly used in the
     final output.  If the given file doesn’t have any WCS, then the
     default WCS (first file on the command-line with WCS) will be used
     in the output.

     This option will mostly be used when the default file (first of the
     set of inputs) is not the one containing your desired WCS. But with
     this option, you can also use Arithmetic to rewrite/change the WCS
     of an existing FITS dataset from another file:

          $ astarithmetic data.fits --wcsfile=other.fits -ofinal.fits

‘-W STR’
‘--wcshdu STR’
     HDU/extension to read the WCS within the file given to ‘--wcsfile’.
     For more, see the description of ‘--wcsfile’.

‘--envseed’
     Use the environment for the random number generator settings in
     operators that need them (for example ‘mknoise-sigma’).  This is
     very important for obtaining reproducible results, for more see
     *note Generating random numbers::.

‘-O’
‘--onedasimage’
     When final dataset to write as output only has one dimension, write
     it as a FITS image/array.  By default, if the output is 1D, it will
     be written as a table, see above.

‘-s’
‘--onedonstdout’
     When final dataset to write as output only has one dimension, print
     it on the standard output, not in a file.  By default, if the
     output is 1D, it will be written as a table, see above.

‘-D’
‘--dontdelete’
     Don’t delete the output file, or files given to the ‘tofile’ or
     ‘tofilefree’ operators, if they already exist.  Instead append the
     desired datasets to the extensions that already exist in the
     respective file.  Note it doesn’t matter if the final output file
     name is given with the ‘--output’ option, or determined
     automatically.

     Arithmetic treats this option differently from its default
     operation in other Gnuastro programs (see *note Input output
     options::).  If the output file exists, when other Gnuastro
     programs are called with ‘--dontdelete’, they simply complain and
     abort.  But when Arithmetic is called with ‘--dontdelete’, it will
     appended the dataset(s) to the existing extension(s) in the file.

   Arithmetic accepts two kinds of input: images and numbers.  Images
are considered to be any of the inputs that is a file name of a
recognized type (see *note Arguments::) and has more than one
element/pixel.  Numbers on the command-line will be read into the
smallest type (see *note Numeric data types::) that can store them, so
‘-2’ will be read as a ‘char’ type (which is signed on most systems and
can thus keep negative values), ‘2500’ will be read as an ‘unsigned
short’ (all positive numbers will be read as unsigned), while
‘3.1415926535897’ will be read as a ‘double’ and ‘3.14’ will be read as
a ‘float’.  To force a number to be read as float, put a ‘.’ after it
(possibly followed by a zero for easier readability), or add an ‘f’
after it.  Hence while ‘5’ will be read as an integer, ‘5.’, ‘5.0’ or
‘5f’ will be added to the stack as ‘float’ (see *note Reverse polish
notation::).

   Unless otherwise stated (in *note Arithmetic operators::), the
operators can deal with numeric multiple data types (see *note Numeric
data types::).  For example in “‘a.fits b.fits +’”, the image types can
be ‘long’ and ‘float’.  In such cases, C’s internal type conversion will
be used.  The output type will be set to the higher-ranking type of the
two inputs.  Unsigned integer types have smaller ranking than their
signed counterparts and floating point types have higher ranking than
the integer types.  So the internal C type conversions done in the
example above are equivalent to this piece of C:

     size_t i;
     long a[100];
     float b[100], out[100];
     for(i=0;i<100;++i) out[i]=a[i]+b[i];

Relying on the default C type conversion significantly speeds up the
processing and also requires less RAM (when using very large images).

   Some operators can only work on integer types (of any length, for
example bitwise operators) while others only work on floating point
types, (currently only the ‘pow’ operator).  In such cases, if the
operand type(s) are different, an error will be printed.  Arithmetic
also comes with internal type conversion operators which you can use to
convert the data into the appropriate type, see *note Arithmetic
operators::.

   The hyphen (‘-’) can be used both to specify options (see *note
Options::) and also to specify a negative number which might be
necessary in your arithmetic.  In order to enable you to do this,
Arithmetic will first parse all the input strings and if the first
character after a hyphen is a digit, then that hyphen is temporarily
replaced by the vertical tab character which is not commonly used.  The
arguments are then parsed and these strings will not be specified as an
option.  Then the given arguments are parsed and any vertical tabs are
replaced back with a hyphen so they can be read as negative numbers.
Therefore, as long as the names of the files you want to work on, don’t
start with a vertical tab followed by a digit, there is no problem.  An
important consequence of this implementation is that you should not
write negative fractions like this: ‘-.3’, instead write them as ‘-0.3’.

   Without any images, Arithmetic will act like a simple calculator and
print the resulting output number on the standard output like the first
example above.  If you really want such calculator operations on the
command-line, AWK (GNU AWK is the most common implementation) is much
faster, easier and much more powerful.  For example, the numerical
one-line example above can be done with the following command.  In
general AWK is a fantastic tool and GNU AWK has a wonderful manual
(<https://www.gnu.org/software/gawk/manual/>).  So if you often confront
situations like this, or have to work with large text tables/catalogs,
be sure to checkout AWK and simplify your life.

     $ echo "" | awk '{print (10.32-3.84)^2.7}'
     155.329