More complete dataset documentation #3051

flying-sheep · 2024-05-13T10:42:56Z

Each dataset’s documentation should contain

what it contains (listing obs, …)
what steps have been run on it
better links (e.g. is pbmc68k_reduced this one? the docstring isn’t clear. It was added by @fidelram in new ranked genes plotting functions #228 …)

Especially important is if its .X is logarithmized, normalized, and/or filtered

The text was updated successfully, but these errors were encountered:

flying-sheep · 2024-05-13T10:44:36Z

idea for semi-automating 1.: we could have a representation (ideally the new fancy HTML one) created and attached by CI

flying-sheep · 2024-05-13T12:03:32Z

I think pbmc68k_reduced was processed something like

sc.pp.normalize_total(adata, target_sum=1e6)
sc.pp.log1p(adata)
sc.pp.scale(adata)

still no idea what’s in “raw” as it’s clearly not counts …

flying-sheep added Enhancement ✨ Area - Documentation 📒 labels May 13, 2024

flying-sheep mentioned this issue May 13, 2024

Extend benchmarks from basic tutorial #3031

Merged

3 tasks

flying-sheep mentioned this issue May 14, 2024

Document datasets #3060

Merged

6 tasks

flying-sheep closed this as completed in #3060 Jun 4, 2024

Provide feedback