Darius Bost

Jump to: ncat_marc, search

Contact Information
4509 Munsee St. Charlotte NC, 28213

Email: bdarius85@gmail.com

North Carolina A&T State University, Greensboro, North Carolina 27411
Senior Undergraduate Student
Major: Applied Mathematics Concentration in Biology

Academic Honors and Awards

  • Golden Key International Honour Society, 2012
  • Arts and Sciences Dean’s List, 2010-2012
  • North Carolina A & T State University Honors’ Program, 2010 - 2012
  • Alpha Kappa Mu Inductee 2013
  • Alpha Lambda Delta Inductee 2011

Presentations and Publications

  • Presented poster and Oral
North Carolina LSAMP/RISE joint research symposium
    • Conference Sept 20-21, 2012
      • Poster
Nashville Tennessee ABRCMS
    • Conference
      • Oral
ERN Conference
    • Conference Feb 20-23
      • Oral


Pooled Sequencing Power Analysis
Darius M. Bost1; Takahiro Maruki, Ph.D.2; Scott Harrison, Ph.D.1; and Michael Lynch, Ph.D.2
North Carolina Agricultural and Technical State University, Greensboro, NC1
Indiana University, Bloomington, IN2
Pooled population sequencing using high-throughput sequencing technologies is increasingly becoming a widely used method of characterizing genetic variation within a given population. It is argued that pooled sequencing can be more cost effective because of the reduced up-front investment prior to sequencing, but this procedure can have high error rates that misidentify variation. The purpose of the project is to assist in the development of an appropriate maximum-likelihood estimator for the frequency of an allele in a pooled sample, taking into account the contribution from sequencing errors. Our hypothesis was that thresholds of sample size and coverage can be identified to ensure accuracy with polymorphism detection. Our methodology involved a maximum likelihood estimator algorithm processing 18 different simulated data sets (populations) that varied across factors of three sample sizes, three depths of sequencing coverage, and two error rates. Subsequently, we employed a statistical power analysis on the data received from each of the iterations. False-positive rates were an outcome of false estimations of polymorphisms, and false-negative rates were an outcome of false estimations of monomorphisms. Our results showed improved power for detecting true polymorphisms with increased coverage, but we found this to be dependent upon the underlying true minor allele frequencies. We examined minor-allele frequency thresholds above which there is a 95% probability of polymorphism detection with statistical significance levels set at 0.01, 0.001, and 0.0001. Overall, for the populations being analyzed, our results indicated that coverage has more effect on detecting polymorphisms than sample size. We further examined the effect that coverage and error rates have on the estimation of minor allele frequencies. As coverage increased, estimations of actual allele frequencies improved. Estimations of allele frequencies also improved with a lower error rate. Our conclusion is that the application of this work will improve detection of genetic variation in multiple populations and help control for the necessary sampling and coverage thresholds needed for examining pooled sequencing data with varying levels of error.
Mendelian Genotyping of Mutant Arabidopsis Thaliana

Darius M. Bost1, April Roberts2, Ivory Clabaugh2, Ann Loraine2

1 North Carolina Agriculture and Technical State University,
Greensboro, NC 27411

2 University of North Carolina at Charlotte
Kannapolis, NC 28081
The Thale Cress (Arabidopsis Thaliana) is commonly used in studying plant genes because of its short maturation period. A. Thaliana mutant line CS25(AGAMOUS/AT4G18960.1, available from TAIR www.arabidopsis.org)carries a recessive allele of AG-1. This allele causes a phenotype in which stamens are transformed into petals and carpels are transformed into new flowers. The result in infertile flowers looks like double flowers. Seeds ordered from the stock center have presumably come from heterozygous plants but there is no way of knowing the genotype of an individual seed until it has been grown to the flowering stage. The goal of this project was to use Mendelian genotyping in combination with probability statistics to identify parent plants that carry the recessive allele. Seeds were collected from thirteen plants of seed stock CS25, identified as wild or heterozygous. Fourteen seeds were then collected from each stock of the thirteen, and allowed to grow to flowering stage. The phenotypes of the flowers were then determined and the ratio of progeny for each group of seeds calculated. This ratio was used to establish the likely genotype of the plant from which the seeds were harvested, based on the monohybrid cross Punnett square. The results will help establish a stock of seeds with a heterozygous genotype for future studies.

My current research interests include:
  • Mathematical Biology
  • Partial Differential Equations
  • Genetic Modeling
  • Population Dynamics
  • Finite Element Methods
  • Wiki design/Upkeep
  • Autonomous Robotics Programming and Circuit Building
  • Integrated Genome Browser
  • PCR process
  • Matlab
  • Java
  • Genotyping plants through growing progeny


  1. Lynch, Michael, et al. "Population-Genetic Inference from Pooled-Sequencing Data." Genome biology and evolution 6.5 (2014): 1210-1218.

Research Interest

  1. Computational Biology
  2. Genetics
  3. Evolution
  4. Epidemiology

Research Advisors

My advisor is Dr. Mary Smith.
Dr. Ann Loraine advised me at the North Carolina Research Center (NCRC)
I'm currently working with Dr.Scott Harrison in collaboration with Dr. Michael Lynch.

Current Developments

  • Conducting research at Washington University in the OGR Scholars Program