Segmentation Methodology

Relevant section from the [CODEX preprint](https://www.biorxiv.org/content/early/2018/02/05/203166):

> A 3D segmentation algorithm was therefore created to combine information from the nuclear staining and a ubiquitous membrane marker (in this case CD45) to define single-cell boundaries in crowded images such as lymphoid tissues. For each segmented object (i.e., cell) a marker expression profile, as well as the identities of the nearby neighbors were recorded (using Delaunay triangulation)


### Software

- [CellProfiler](http://cellprofiler.org/): uses [ilastik](http://ilastik.org/) for segmentation, cf. [blog post](https://blog.cellprofiler.org/2017/01/19/cellprofiler-ilastik-superpowered-segmentation/)
- scikit-image: [paper](https://peerj.com/articles/453/), [blog post](https://efferencecopy.net/allen-brain-image-segmentation-to-extract-neuron-cell-bodies/)
- CellSegm: [paper](https://scfbm.biomedcentral.com/articles/10.1186/1751-0473-8-16), [code](https://github.com/ehodneland/cellsegm/)
- DALMATIAN: [paper](https://www.ncbi.nlm.nih.gov/pubmed/29311849), [code](https://github.com/koulakovlab/dalmatian)
- TissueMiner: [paper](https://elifesciences.org/articles/14334), [code](https://github.com/mpicbg-scicomp/tissue_miner)
- FogBank: [paper](https://www.ncbi.nlm.nih.gov/pubmed/25547324), [code](https://github.com/usnistgov/FogBank)
- BioVoxxel toolbox: [paper](https://www.ncbi.nlm.nih.gov/pubmed/29221640), [code](https://github.com/biovoxxel/BioVoxxel_Toolbox)
- [Detectron](https://github.com/facebookresearch/Detectron)


Expanding on that list a bit:

- CellProfiler: 
    - There is another [blog post](https://blog.cellprofiler.org/2017/10/16/cellprofiler-3-0-release-faster-better-and-3d/) that mentions volumetric segmentation (as opposed to ilastik which afaik is only 2D).  It's a little unclear exactly what capabilities they are referring to within CellProfiler but it may simply be this 3D watershed implementation: [watershed.py](https://github.com/CellProfiler/CellProfiler/blob/master/cellprofiler/modules/watershed.py).  I don't see any modules in CellProfiler for segmentation that are also designed to work in 3D.
    - This post also mentions the [Allen Cell Explorer](http://www.allencell.org/3d-cell-viewer.html) which seems like a great way to curate 3D volumes
    - In the [forum post associated with that blog post above](http://forum.cellprofiler.org/t/cellprofiler-3-0-release-faster-better-and-3d/5198), they also mention the [Google Accelerated Sciences plugin for Scoring Image Focus](https://github.com/CellProfiler/CellProfiler-plugins/wiki/Measure-Image-Focus) and a pre-trained deep learning model for cell segmentation based on the paper "[Automated Training of Deep Convolutional Neural Networks for Cell Segmentation](https://www.nature.com/articles/s41598-017-07599-6)"
- [DeepCell](https://covertlab.github.io/DeepCell/) ([site](https://covertlab.github.io/DeepCell/)) ([repo](https://github.com/CovertLab/DeepCell))
    - [Paper](http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1005177)
    - Has [docker containers](https://covertlab.github.io/DeepCell/starting/) 
- [DeepFlow](https://www.biorxiv.org/content/early/2017/06/05/081364) ([original version](https://www.biorxiv.org/content/early/2016/10/17/081364)):
    - Reconstructs cell cycle for T-cells and identifies 7 different phases as well as identifies dead vs alive cells
    - [mxnet implementation on github](https://github.com/theislab/deepflow) (from scanpy creators)
    - This is a bit off topic for segmentation but the T-cell imaging data used for it could be useful

### Models Specific to Medical Imaging

- [U-Net](https://arxiv.org/abs/1505.04597) ([Example TF-based implementation](http://tf-unet.readthedocs.io/en/latest/usage.html)) - This appears to be a real workhorse architecture in medical image segmentation (there are dozens of implementations in TensorFlow and Caffe)
- [V-Net](https://github.com/jackyko1991/vnet-tensorflow) - A TensorFlow implementation of 3d extensions to the U-Net
- [NiftyNet](https://github.com/NifTK/NiftyNet) ([Site](http://www.niftynet.io/)) - "NiftyNet is a TensorFlow-based open-source convolutional neural networks (CNN) platform for research in medical image analysis and image-guided therapy." 
    - If we have to retrain an architecture for segmentation I have to imagine this would be a top choice.
    - Supports 2-D, 2.5-D, 3-D, 4-D inputs
    - It has a [Model Zoo](https://cmiclab.cs.ucl.ac.uk/CMIC/NiftyNetExampleServer/blob/master/model_zoo.md) but nothing in there for our modality yet, or anything even close
    - (Original Publication](https://arxiv.org/abs/1709.03485)  

### Generic Architectures

- [DeepLab](https://github.com/tensorflow/models/tree/master/research/deeplab) ([Google Research Post](https://research.googleblog.com/2018/03/semantic-image-segmentation-with.html)) - Google research project in the vein of Detectron
    -  My gut says we'd never have enough data to train these big general kinds of models but who knows
- [SegNet](http://mi.eng.cam.ac.uk/projects/segnet/) - Another generic architecture for semantic segmentation which I only mention because it was brought up along with U-Nets in this [webinar](http://info.nvidia.com/dl-pipelines-for-disease-detection-in-medical-images-reg-page) on advances in medical image analysis
    - [Publication](https://arxiv.org/abs/1511.00561)


*Comments from @nsamusik on some things to keep in mind*:

My main thought at this point is that the segmentation itself is just the first step, there also has to be a second step, where cell boundaries are optimized concomitantly with estimating the single-cell expression vectors. This way both the optimized cell boundaries and the expression data will likely look more accurate. 

As for the benchmarking, I am happy to share a hand-labelled dataset that I have generated for the CODEX paper revisions. Here, each TIFF is matched with a TXT file that contains the coordinates of hand-labeled cell centers (X, Y, Z). There are no cell outlines labelled here, just the centers. In order to assess the segmentation quality, I computed several measures: R = Recall (% of hand-labelled centeres that ended up within a segmented cell region), S= Singlets (of those, what % how many ended up in a cell region with exactly 1 hand-labelled center), FPR = False positive rate (% cell regions without a hand-labelled center). Then I combined the three in a harmonic mean 3/(1/R + 1/S + 1/(1-FPR)) 

here's the link
https://drive.google.com/open?id=1wUNaZ5dv2mDn_wwcSXlnfof6SwoQmlsq


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Segmentation Methodology #5

Software

Models Specific to Medical Imaging

Generic Architectures

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Segmentation Methodology #5

Description

Software

Models Specific to Medical Imaging

Generic Architectures

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions