Starting from a data source there’s six steps:
- Data source => string … For .fa files this is just stripping out non bp characters
- String => segment … Here it’s just fixed size segments, 100,000 bp
- Segment =>pattern … Pattern is collection of fixed size 8bp units plus how many times they occur in the segment
- Pattern => pattern … Transformations, not doing that yet
- Grid of pattern combinations … Intersection of common units, sum of minimum counts
- Grid of results to colors … This is mapping numbers onto set of colors, actually not as simple as it seems, distribution is kind of weird
Leave a comment