Exporting Single Image Numeric Data

In this section, you will learn:

How to export CellProfiler data from a set of single images that were not obtained from a 96 well plate in numeric form.

This section pertains to data from a set of single images that were not obtained from a 96 well plate such as images obtained from a slide.

In addition to extracting metadata from the image files and providing names for different channels (if applicable), the following CellProfiler analysis modules are required or must be amended in the pipeline order for export to FCS Express: Convert Objects to Images, Save Images, and Export to Spreadsheet. The completed pipeline for this example can be found in the Tutorial Sample Data archive and is named Section4pipeline.cpproj. The completed pipeline is meant to be used as a template for comparison to your pipeline and represents the finished product of this tutorial.

The sample images in the example data set we will be using can be found in the Tutorial Sample Data archive and are the 5 and 10 minute time point images folder. These images are based on the "Human cytoplasm-nucleus translocation assay (Vitra Image)" from the previous module.

 

The steps for exporting data from CellProfiler have been broken down by module. After loading the Section4pipeline.cpproj, follow the steps below to amend the pipeline to prepare for export to FCS Express.

Selecting Default Output Folder

To organize your data correctly for FCS Express, ensure the Default Output Folder is the same folder where your images are stored and used for Default Input Folder (Figure T26.14).  

Note: The DefaultOUT.mat file will also be exported to the Output Folder. This file is for use in MATLAB. If you do not wish to use this file, you can select Do not write MATLAB or HDF5 files from the Output file format drop-down menu.

 

Loading Images, Extracting Metadata, Assigning Names

In order for FCS Express to recognize your data is in 96 well format, metadata from the image file names must be extracted. A Regular Expression must be defined to find the metadata in the file name or path of the data by following these steps in CellProfiler:

1. Click Images category in left column of CellProfiler window.

2. Drag and drop 5 and 10 minute time point images folder from archive folder into designated area in CellProfiler.

3. Click Metadata category directly below Images.

4. Click Yes radio button in the Extact metadata? box.

5. Confirm Metadata extraction drop-down list is set to Extract from file/folders.

6. Confirm Metadata source drop-down list is set to File name.

7. Click the magnifying glass icon to the right of Regular Expression to field.

8. Erase all text from Regex: field.

9. Copy and paste following text into Regex: field, which will define the Channel Number and time with a number:

 Channel(?P<ChannelNumber>[0-9]{1,2})-(?P<time>[0-9]{2})-min.tif

10. Click Submit in Regular expression editor window.

11. Click Update button in lower portion to see information extracted for each of the four images.

12. Click NamesandTypes directly below Metadata.

13. Change Assign a name to drop-down list to Image matching rules.

14. Enter Channel2- in blank field to right of Contain drop-down list.

15. Confirm DNA is entered in Name to assign these field.

16. Click Add another image button.

17. For second image set, enter Channel1- in blank field to right of Contain drop-down list.

18. Replace GFP in Name to assign these field with Cytoplasm.

19. Click Update button in lower portion.

Notes:

Follow these links for a tutorial on "Regular Expressions" and how to translate regular expressions.

Setting up metadata and regular expressions only has to be done for one image in the set. For this example, it has been set up for channel 2.

The regular expression output names: Well, Row, Column, and Col may only be used with plate based or montage imaging experiments. Using any of these names for a single image experiment will result in a non-working export.

 

ExportToSpreadsheet Module

Now that all of the images and object image masks have been defined and saved for analysis, you must now ask CellProfiler to export the measurements you wish to view in FCS Express. For this example, we will export all measurements.

(Note: If you would like to only export certain parameters please see the Selecting Individual Parameters for CellProfiler Export section)

1. Select the ExportToSpreadsheet module.

2. Confirm the Select the column delimiter drop-down list is set to Comma (",").

3. Choose Default Output Folder from the Output file location drop-down list.

4. Click No radio button in the Add a prefix to file names? box.

5. Click No radio button in the Export all measurement types? box.

6. Choose Image from the first Data to export drop-down list.

7. Click No radio button in the Use the object name for the file name? box.

8. Replace DATA.csv with Image_.cptoc in File name field.

9. Right-click between the underscore and period; click time.

10. Click Add another data set button.

11. Choose Nuclei from the second Data to export drop-down list.

12. Click No radio button in the Use the object name for the file name? box.

13. Replace DATA.csv with nuclei_.cpout in File name field.

14. Right-click between the underscore and period; click time.

15. Click Add another data set button.

16. Repeat steps 10-12 for "Cells" and set the file name as cells_.cpout.

17. Right-click between the underscore and period; click time.

The ExportToSpreadsheet module options should look like Figure T26.17 when you are done.

Note 1: on file extensions: ".cptoc" stands for CellProfiler Table of Contents and ".cpout" stands for CellProfiler Output. The "Image.cptoc" file stores information about the location and number of images for all objects processed in the pipeline, while the separate "Nuclei.cpout" and "Cells.cpout" store the actual listmode data associated with individual objects and analysis. There should be a separate ".cpout" file defined for each object mask defined in CellProfiler. Only one "Image.cptoc" file is needed.

Note 2:The name of your .cpout file must begin with the name of the object data you are exporting and may only be include the object name and regular expressions. For example: the file names nuclei.cpout and nuclei_<regular express>.cpout will result in a proper export while dataset1nuclei.cpout or <regularexpression>_nuclei.cpout will resulting in a non-working export.

18. Click Analyze images to run the pipeline. The folder where your images are stored will now contain the cells.cpout, nuclei.cpout and Image.cptoc files for each time point. There will also be a CellLabel and NucleiLabel image associated with every original image from the experiment (Figure T26.34).

Note: The DefaultOUT.mat file will also be exported to the Output Folder. This file is for use in MATLAB, if you do not wish to use this file we recommend deleting it to conserve disk space.

Figure T26.34  Example of CellProfiler Output Folder When Analysis is Complete

Figure T26.34  Example of CellProfiler Output Folder When Analysis is Complete

In the next exercise, we will import and analyze single image numeric data.