ADASS 2003 Conference Proceedings

PyFITS is a Python module developed for FITS file I/O. We demonstrated its use for FITS images and tables, illustrating the PyFITS classes and methods, as well as the array manipulation capabilities of numarray. PyFITS is convenient for interactive use, and we also showed two utility programs, fitsdiff and readgeis , as examples of its use in astronomical applications. The task fitsdiff compares two FITS files and reports their differences, readgeis reads the STScI-style GEIS format files and converts them to FITS files or FITS objects. These two Python modules were showcased not only because they are useful astronomical tools but to demonstrate the ease of writing such applications using PyFITS and numarray. PyFITS can also make use of memory mapping, which significantly enhances its performance on large FITS files, both images and tables. At STScI, we are also applying numarray and PyFITS for larger projects such as pydrizzle and the pipeline software for the new HST instrument COS (Cosmic Origins Spectrograph).

1. Introduction

The Python package numarray is an efficient array handling tool (it is a replacement for the Numeric Python extension). In addition to the existing Numeric functionalities for numeric arrays, arrays of character strings can be created and manipulated, and tables can be represented using record arrays which are 1-D arrays of structures with mixed data types within a row but are homogeneous within each column. The contents of a record array can be accessed by row or by column, or both at the same time. A column is a numarray (or numarray.strings) object, with attributes that include an offset from one element to another, which allows accessing the column data without copying from the table to a temporary array.

PyFITS is a Python module for working with FITS files. For an image, the data block is a numarray object. For a table, the data block is a record array. Images and tables may be read, updated in-place, or they may be created from scratch. Header keywords may be read, modified, deleted, or inserted at any location.

Memory mapping is supported in PyFITS to improve performance for large files. If the native byte order differs from that of a FITS file (big-endian), byte swapping is done on-the-fly within numarray when accessing data. This avoids the need for temporary storage space, and it can save time if only a portion of the data will be read or modified. Verification of objects to adhere to the FITS standards can also be performed automatically or manually.

If the data in the FITS file is scaled, i.e. BSCALE $\neq$ 1 or BZERO $\neq$ 0, PyFITS handles this in a transparent way so the user only needs to interact with the scaled objects. On the other hand, this usually entails extra memory space for the scaled arrays; this may be of consideration for very large data files.

2. PyFITS Examples

In order to access an existing FITS file, we need to use the PyFITS open function which will return a Python list-like object (called HDUList object) which can only contain FITS HDU (header-data unit) objects as its elements.

3. Some applications which use PyFITS or/and numarray

The two applications, readgeis and fitsdiff , are examples of applying numarray and PyFITS to do useful work. Both are entirely written in Python and can be run as a shell command or loaded in as a module inside Python. The task fitsdiff is a tool to compare two FITS files and generate customizable reports. Users can, for example, specify the relative difference level to be flagged for floating point numbers and exclude selected keywords or columns for comparison.

The task readgeis is a tool to read the GEIS file format used by some older HST instruments. The tool will read the GEIS file's contents into PyFITS's HDUList structure. Thus the header keywords will be in the primary HDU, the group parameters and data in each group will be in the extension HDU. Users can then either write it out as a FITS file or do mathematical operations with the data in the images or the headers.

Demo of numarray, PyFITS, and related software

Abstract:

1. Introduction

2. PyFITS Examples

3. Some applications which use PyFITS or/and numarray