Skip to content

bioSyntax TODO #2

@ababaian

Description

@ababaian

Open TODO list

A collection of tasks to improve bioSyntax. Feel free to tackle any of these if you'd like to pitch in. If there's no name/ID beside a task, it's open 👍 If there's a name, find that person and ask how to help.

Also any issue labeled with a green help tag is a great place to start collaborating.

Features

  • 1) Add Atomic Coloring. JMol / CPK coloring to atoms/elements when they appear in a file-format (PDB).

  • 2) Set-up Vim / Sublime / Gedit to only use bioSyntax theme when a bioSyntax format is being used; otherwise use the default or preset theme. Sublime Example.
    See: Issue 7: Sublime Auto-detect [Artem / Alyssa]

  • 4) Optimize the Regex Engine for VCF -gedit -sublime -vim(?) to account for catastrophic backtracking. (See vcf-less for a fixed example) Optimize Regex Engine for VCF #36

  • 5) Secondary Color Gradient: In BED/WIG files where there can be a score, have one color scheme (like we have) for 0-1000 range. Have a second color gradient (orange?) which recognizes 0.0 - 1.00 (decimal scale). This will support two widely used data-ranges then 0-1 and 0-1000.

  • 6) Make 'Infographics' for complex file-types (SAM, VCF, GTF) to help users learn and intepret the file formats. Include things such as PHRED numeric scale, FLAG conversion bits, what each field is etc... Use bioSyntax theme colours as a teaching tool here. Develop built-in help manuals for existing packages #25

  • 7) Website / Science art. A fun part of this project has been exploring the design and art side of computational biology. If you've got some artistic talent then there's a bunch of small/large little art projects to spruce the place up, just hit us up.

  • 8) For Clustal format (.clustal / .aln), create secondary language definition files for amino-acid based alignments. Currently only nt-sequences are supported.

  • 10) Develop either a Rainbow Parentheses or Rainbow Levels script to work with Newick tree format. This will also be used for Nexus files. Develop Rainbow Parentheses/Rainbow Levels Script for Newick Trees #37

  • 11) Recruit / find interested non-biology scientists to compile / spearhead what file formats should be developed for the other computational sciences (physics / chemistry / psychology etc...)

  • 13) Use less / source-highlight local preferences. When /usr/bin/source-highlight is run, it checks the folder it is run out of for compatible .lang files. To circumvent the need to sudo cp the bioSyntax files we can initialize a local folder ~/.source-highlight/ to which we point the alias scripts and store bioSyntax files in here. While installing source-highlight will require sudo; installing the bioSyntax won't. Check if this is feasible and implement on a branch. Perhaps switch to highlight? Note: This is done for linux installations, not done for mac installs.

  • 14) Port for Nano / Pico. [Anicet]

  • 15) Found a cool little script on wikipedia for converting PHRED to bars. Can be incorporated into fastq-less script. Sadly not super applicable in other formats at the moment. --> Could exploit the conceal<-->cchar relationship to redraw PHRED scores in vim syntax.

sed -e 'n;n;n;y/!"#$%&'\''()*+,-.\/0123456789:;<=>?@ABCDEFGHIJKL/▁▁▁▁▁▁▁▁▂▂▂▂▂▃▃▃▃▃▄▄▄▄▄▅▅▅▅▅▆▆▆▆▆▇▇▇▇▇██████/' myfile.fastq 

New File Formats Definitions for bioSyntax

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions