Heterozygosity, read counts and GBS: PART 2

Posted on July 29, 2013 by Greg Owens

Subtitle: This time… its correlated.

Previously I showed that with the default ML snp calling on GBS data, heterozygosity was higher with high and low amounts of data. I then took my data, fed it through a snp-recaller which looks for sites that were called as homozygous but had at least 5 reads that matched another possible base at that position (i.e. a base that had been called there in another sample). I pulled all that data together, and put it into a single table with all samples where I filtered by:

Continue reading →

GBS, coverage and heterozygosity

Posted on June 27, 2013 by Greg Owens

I’m running some tests on my GBS data to look for population expansion. I know from looking at GBS data from an F1 genetic mapping population that for GBS data heterozygotes can be under called due to variation in amplification and digestions. Also, for my data observed heterozygosity is almost always under expected. Heterozygotes can also be overcalled when duplicated loci are aligned together. The tests I’m going to use explicitly use observed heterozygosity so this is worrying.

Continue reading →

GBS protocol Version 2.0

Posted on January 29, 2013 by Greg Owens

Hi all,

Here is the long awaited updated GBS protocol.

PROTOCOL ->>>> GBSv2.0

There are three main changes from the previous protocol.

-After digestion and ligation, all the product is kept so more attempts can be made at the PCR.

-The PCR uses Phusion Taq, has longer extension times and one additional cycle with more primers/Taq.

-Size selection is done using AMPure beads instead of gel extraction.

Continue reading →

GBS multiplexing

Posted on November 28, 2012 by Greg B.

GBS_mutliplexing contains 2 scripts that may be useful to you if you are using GBS data. One essentially formats GBS reads for Tassel. The other demultiplexes the reads. A readme file in there explains things in more detail.

Edited: edited the readme and added a script to convert qseq to fastq

Genotype By Sequencing (GBS) Barcodes

Posted on November 23, 2012 by Rose

Here are GBS_Barcodes and adapters that we currently have in the lab for GBS, for sequencing on an Illumina machine. They were designed using the site Deena Bioinformatics.

This information came from Greg Baute’s blog and I’ve just converted the file to .xls.

GBS phylogenetics

Posted on November 9, 2012 by Greg B.

Here is a phylogenetic network of wild annuals. This is made from GBS data, aligned against the celera assembly. This is ~40,000 sites which have 2 alleles and <20% missing data.

Here is a excel file that describes the samples: GB_GBSdesign

The JPEG looks horrible, download it as a pdf: 213i_80m

Tab delimited under the fold:

Continue reading →

GBS missing data

Posted on November 6, 2012 by Greg B.

I’ve done a small analysis on my GBS data and posted it on my blog: http://www.proseedwithscience.com/?p=816

Edit: This is mostly just a quick look at the amount of missing data in the data and some potential explanations of where it might be from.

GBS Protocol (GregO)

Posted on June 14, 2012 by Greg Owens

Kristin and I have been working on GBS for a long time and since it now seems to be working, we wrote up a protocol. It is mostly the same from Greg Baute’s previous protocol, but with a few key changes (More DNA, more PCR). I’ve made it look nice and included a diagram for ease of thought.

Also, the official pronunciation of GBS is ‘jibs’

Continue reading →

STACKS installation (Rose)

Posted on December 12, 2011 by Rose

Installing stacks on Ubuntu Natty Narwhal or Oneiric Ocelot

STACKS is a piece of software produced by Julian Catchen in the Cresko lab. It’s designed to identify loci and alleles from RAD (or GBS) reads either de novo or after alignment to a reference. It consists of several modules that can be run separately, but to completely install it as a pipeline, it relies on a web server, unfortunately. Many of the required instructions are given in the README file, but because nobody in our lab is an expert on this, we had to fiddle around to get the program running on our Ubuntu machines.

Continue reading →

Rieseberg Lab Resources

RLR: Technical resources for Rieseberglers

Tag Archives: GBS

Heterozygosity, read counts and GBS: PART 2

GBS, coverage and heterozygosity

GBS protocol Version 2.0

GBS multiplexing

Genotype By Sequencing (GBS) Barcodes

GBS phylogenetics

GBS missing data

GBS Protocol (GregO)

STACKS installation (Rose)