1. Mar 2026
    1. On 2019-06-06 18:27:31, user Amrit Singh wrote:

      Hi Y-h. Taguchi, this is interesting. The datasets in the mixOmics R-library are a small subset of the original data used in the manuscript (https://www-ncbi-nlm-nih-go... "https://www-ncbi-nlm-nih-gov.ezproxy.library.ubc.ca/pubmed/30657866)"). Please see https://github.com/singha53... for the entire breast cancer data (train and test). I wonder if there is a sparse version of HOSVD, in order to perform variable selection which is already implemented in DIABLO using soft-thresholding.<br /> Best,<br /> Amrit

    1. On 2019-06-28 18:15:59, user Fraser Lab wrote:

      This manuscript by Rocchio and co-workers investigates the structural basis for the interaction between the molecular chaperone Spy and the client protein Im7. Although the Spy-Im7 complex crystallizes, only the Spy portion is readily modelable and there is only residual electron density for the Im7 molecule. This attribute of the cyrstal system means that investigating the structural basis for the interaction is challenging for traditional X-ray crystallography methods. In a previous publication (Horowitz et al., 2016), an approach was designed to overcome this challenge; 4-iodophenylalanine (pI-Phe) was introduced into the Im7 peptide at one of eight positions, and anomalous data was collected for the Spy-Im7 complexes. The Spy-Im7 interaction was mapped using a combination of the anomalous signals from pI-Phe, the residual electron density from Im7, with plausible Im7 conformations generated using molecular dynamics simulations. The original work was the subject of criticism (Wang, 2018), which focused mainly on whether or not the interpretation of the weak anomalous signal was valid. The authors have acknowledged aspects of the criticism (Horowitz et al., 2018), while emphasizing the cross-validation build into their approach, and that other biophysical methods (e.g. NMR) corroborate their proposed model for the Spy-Im7 interaction. The major successes of the present manuscript are that: i) it increases the sensitivity of pI-Phe assignment, validating parts of the previous paper, and ii) it contributes to the development of anomalous scattering as a tool for interrogating protein structure and function.

      In the present work, Rocchio and co-workers focus on optimizing data collection and analysis for anomalous signals from pI-Phe. They placed pI-Phe at one of three positions (only one of which cleanly overlaps with the previous publication) in the Im7 peptide and collected diffraction data around the iodine L absorption edge (? = 2.38 Å and 2.76 Å). Because the iodine scattering factor is 3.9-fold lower at the longer wavelength, peaks in the anomalous density map that are present at 3.28 Å but absent at 2.76 Å, were assigned to iodine. To test signal reproducibility, they adjusted the goniometer ? angle, and collected additional datasets from the same crystals, and also collected repeat datasets from different crystals. The reproducibility was moderate; three signals were observed in all the datasets, one signal was observed only in one of the datasets, and two signals were observed in three out of four datasets. The authors used a cut-off of 6 ? for assigning peaks. Why was this cut-off chosen? Were peaks present at site I3 (e.g. Fig. 1A/C) at lower electron density thresholds? We suggest that showing the anomalous difference maps at various thresholds is a good idea for a supplemental figure to appreciate the sparsity of the signal at high sigma values and the choice of the cut-off at 6 ? .

      The manuscript could be strengthened if the authors addressed the lack of reproducibility for some of the iodine signals in more detail. They state that it is “…clear that the ability to detect low intensity signals is highly dependent on the crystal and the resolution.” Could it be that different crystallization conditions have shifted the binding mode of Im7? I noticed that the concentration of zinc acetate was listed as between 70-270 mM – could this have contributed to the difference? The authors also mention that three out of the four binding sites were identified in the previous work. To make a fair comparison, it might be worth showing all of the proposed binding sites from the previous paper (Horowitz et al., 2016) (e.g. on Fig. 5). On a related note, how confident are the authors in the refined occupancies for the iodine atoms? Is it possible to perform refinement with different initial occupancies to see if they converge on a particular value?

      The major weakness of the manuscript lies in its failure to extend the increased sensitivity of iodine assignment to an improved understanding of the Spy-Im7 interaction. The authors begin the manuscript by introducing the READ algorithm – why hasn’t the more sensitive assignment of pI-Phe been fed back into READ? It would be interesting to see if whether the ensembles generated by the two anomalous data collection strategies agree with each other.

      Overall this is a well written article that will be of interest to a wide section of the structural biology community. The improved capability of modern beamlines to collect anomalous data at long wavelengths, as described in this paper by Rocchio et al., may help to interrogate the structure and function of macromolecules that were previously intractable to traditional approaches.

      Minor points

      1. Consider revising the manuscript title – it is possible for a residue to be partially occupied and not dynamic - “conformationally heterogeneous” is probably more accurate. Or leave it as is, but clearly define the differences and the ability of crystallography to inform between dynamics and heterogeneity somewhere in the manuscript

      2. It might be helpful to add a couple of sentences in the introduction to describe what is known about the Spy-Im7 interaction from orthogonal methods (see point 6).

      3. Please include a section in the methods describing how the ITC experiment was performed.

      4. Page 4 line 31: Typo - “we reasoned that we should be able to specifically distinguish iodine anomalous signals from other with elements”

      5. Page 2 line 14: Shouldn’t it be Residual Electron and Anomalous Density not Residual Anomalous and Electron Density?

      6. Please either label Fig 1D with the location of the “crook of Spy’s cradle” or include a new figure. It might be helpful to have a “cartoon” type schematic in the introduction to illustrate what is known about the Spy-Im7 interaction.

      7. Fig S4-6: Please label the iodine peaks with the labels used in Fig. 1 (e.g. I1, I2, I3, I4).

      8. PDB 6OWX: Is imidazole 214 in chain A modelled correctly?

      9. It’s notable that they have co-crystalized Spy with 10 peptides (8 in the previous publication, 1 overlapping in both, and 2 novel ones in this publication) - have soaks been attempted? This could also add an interesting experimental control where the anomalous signal should be displaced by excess unlabeled client peptide but maintained (or enhanced) by labeled peptide.

      10. Peak height is a good measure and relatively unbiased, but the anomalous maps have the potential to inform (in a relatively model unbiased manner) on the occupancy and B-factor directly. Showing multiple concentric contours or plotting density as a distance from peak center along a pseudo atomic radius will help to clarify the differences in the profiles. For example, in table 4, the peak heights for the 4th and 5th rows are the same, but the occupancy is 2-fold different.

      11 - Figure 4 is pretty difficult to follow, even for us. We could do with a bit more annotation of where the disordered peptide backbone is predicted to trace. It also seems like the aromatic group attached to the I is a fairly strong modeling constraint that could help guide the eye in this figure.

      12 - These data are especially valuable for methods developers and given the issues raised by Wang on the previous paper, it is especially important to put as raw data as possible out into the public. Papers like this should be held to a higher data deposition standard (mtzs are not enough!): the authors should deposit their raw diffraction data at SBGrid Data Bank (https://data.sbgrid.org/) "https://data.sbgrid.org/)") or an equivalent resource to enable future re-use and validation.

      We review non-anonymously, James Fraser and Galen Correy (UCSF), and will have posted this review as a public comment on the preprint: https://www.biorxiv.org/con...

      References<br /> Horowitz, S., Salmon, L., Koldewey, P., Ahlstrom, L., Martin, R., Quan, S., Afoine, P., van den Bedem, H., Wang, L., Xu, Q., Trievel, R., Brooks, C. & Bardwell, J. (2018). Nat. Struct. Mol. Biol. 25, 989–991.<br /> Horowitz, S., Salmon, L., Koldewey, P., Ahlstrom, L. S., Martin, R., Quan, S., Afonine, P. V., Van Den Bedem, H., Wang, L., Xu, Q., Trievel, R. C., Brooks, C. L. & Bardwell, J. C. A. (2016). Nat. Struct. Mol. Biol. 23, 691–697.<br /> Wang, J. (2018). Nat. Struct. Mol. Biol. 25, 989–991.

    1. On 2021-06-23 07:38:24, user Alex Crits-Christoph wrote:

      This work by Professor Jesse Bloom details a phylogenetic analysis of a set of SARS-CoV-2 genomic sequences originally submitted to the Sequence Read Archive, likely by the authors of Wang et al. 2020, in February - March, 2020. Both a preprint, and a later paper (by Wang et al.), were publicly available describing these sequences, including key aspects of their genomic features. However, independently of the preprint or paper, which make no direct reference to an NCBI submission, the sequences been marked for deletion on NCBI, but were re-obtained by Bloom via internet backups of the data. While sharing more genomic data from the early epidemic can be valuable, I believe the current version of this work makes several errors that are important to address in both scientific content and critically, in scientific communication.

      In general, the work is vague or remiss about extremely important context and details about the sequences in question. To begin, the genomes obtained are consistently referred throughout the title and abstract (and indeed most of the text) as being from the "early" epidemic. This terminology is too vague, as it may invite the reader to assume that the genomes include the earliest cases known - in fact, they are not, and were originally reported by Wang et al. 2020 to be from January 2020. Over 25 genomes had previously been obtained during 2019 for SARS-CoV-2. To properly interpret the context of this work, an emphasis on the timing is critical and should not be avoided in the abstract or title.

      Secondly, the abstract and title both strongly give the inaccurate impression that the re-obtained sequences alone newly demonstrate that the known and reported genetic diversity of viruses in the Huanan market was not representative of all circulating SARS-CoV-2 variants at the time. This is alluded to in the title, which says they "shed more light" on the early epidemic. Yet, this is not supported by the data in the manuscript: it was already known that the genetic diversity of viruses obtained from Huanan market cases was not representative of all genetic variants circulating at that time. As can be seen most clearly in Figs 3 and 5, the new sequences obtained were not completely unique in sequence or unusual in case timing. There were already several reported publicly available similar genomes with for the most part identical sequences from the early epidemic. The author is certainly aware of this, as it is evident from the data presented and from the works cited (both Garry 2021 and Kumar et al. 2021 discuss this at length), but this key point is not emphasized in the title or abstract, and indeed is a weakness of the importance of the work itself.

      Thirdly, in the introduction the work makes several key omissions about the timing of cases. The introduction directly suggests that the Huanan market is unlikely to be a site of zoonosis because a number of early cases could not be confirmed to be connected to the market; However, this was true of SARS-CoV-1 in Fushan as well. The author also neglects to mention that a significant fraction of cases were connected to *other* markets in the city, including the earliest cases (as described in the WHO report). And indeed, if there were multiple spillover events, e.g. from a shared wildlife supply chain that connected multiple markets throughout the city, even with perfect contact tracing data one would not expect all cases to be connected to one market.

      Fourthly, I believe the inclusion of Figure 2 in this paper is highly unprofessional and misleading. This is a screenshot of an email from a scientist who is not associated with the papers or data described in the data; indeed, the only relevance between them seems to be a shared nationality of the author(s). Additionally, as of June 2021 the data described as 'deleted' in this screenshot is publicly available again on NCBI. While I understand that Bloom likely did not realize this mistake, it is difficult to justify the inclusion of this figure in a scientific work.

      Fifthly, and perhaps most importantly, I think the previous point happens to provide a fortuitous lesson about a key error that Bloom makes in assuming misconduct on the part of the Wang et al. 2020 authors. He writes "It therefore seems the sequences were deleted to obscure their existence". It is implicitly assumed that Bloom believed that to also be true of the data described in Figure 2 - hence its inclusion - but its recent re-publication on NCBI makes it abundantly clear that was not the case. To be honest, this is not surprising. Claims of scientific misconduct made from a distance, without knowledge of the details and circumstance surrounding the issue, have the potential to appear convincing despite having perfectly ordinary (or perhaps slightly less than ideal, but not nefarious) scientific rationale.

      Similarly, there is also evidence that scientific misconduct is also highly unlikely to be the case with the dataset that is described in this paper. Primarily, this is because the paper describing these sequences was published in a journal several months later. If the authors were indeed trying to obscure their data, they would simply also not publish their paper in a public journal, or request it to not be published after submitting for review. If these sequences were removed for the purpose of obscurity, it is also worth noting that such an effort clearly flopped - because as described above, these sequences do not immediately provide any completely new knowledge about the genetic diversity of SARS-CoV-2 in the early pandemic. In both the preprint and the paper (the first submitted before data removal; the second published likely after), no mention of a BioProject accession is made, and instead it is said that the data is available upon reasonable request from the authors. If the writers initially intended for the data to be made public, standard practice would have included the BioProject reference (or even a placeholder) in the text. The fact that there was never such a reference makes it quite evident that there likely was a miscommunication between co-authors about whether the data was intended to be released to NCBI.

      The reality is that minor scientific missteps and less-than-ideal circumstances surround the sharing of scientific data all of the time. The process of publishing on scientific data is fraught with tiny details and hard work, the trainees responsible for it are often overworked, and there are often unfortunate but relatively ordinary scientific incentives to avoid making as much data public as should be. The rationale behind what data is and is not shared for scientific works are often nuanced, and sometimes quite personal. The scientific process would be better if it were not the case, but these forces are almost universal, and everyone encounters them in scientific work. In this circumstance, like in the circumstance outlined in Figure 2 of this work, these ordinary forces of scientific circumstance are almost certainly more likely the case than direct censorship or misconduct. It is unfortunate and alarming that this work as it currently stands baselessly contributes to an environment where many readers assume the latter depending on the country of origin of the original authors.

      Minor point:

      At the bottom of page 5: "But although there are unusual aspects of RatG13's primary sequencing data". This statement is vague, the language is unscientific, and many interpretations of it would not be supported by the citations provided. I would recommend that Bloom change this sentence to present a coherent scientific statement if indeed there is a point about genome quality to be made there that is relevant to the text.

    1. On 2020-04-13 21:24:43, user Mutasem Taha wrote:

      The supplementary table of this preprint raises doubts about the findings, for example, Enoxacin is given a high index of 1.18 while its very close analogue Norfloxacin is considered inactive and given index of 0.07. Similarly, Prothionamide is considered potent with an index of 1.14 while its close analogue ethionamide is totally inactive according to the study with activity index of 0.12. <br /> The preprint considered Nitazoxanide to be inactive (index 0.6) although it is currently in clinical trials as treatment for COVID-19 (https://clinicaltrials.gov/... "https://clinicaltrials.gov/ct2/show/NCT04341493)") and was reported to have anti SARS-Cov-2 IC50 of 2.1 microM in a peer reviewed paper (Cell Research (2020) 30:269–271; https://doi.org/10.1038/s41... "https://doi.org/10.1038/s41422-020-0282-0)").

    1. On 2024-12-06 17:54:14, user Malte Elson wrote:

      The remarks below are a summary of the points discussed during the Cake Club of the Psychology of Digitalisation lab at University of Bern ( https://www.dig.psy.unibe.ch/studies/cake_club_/index_eng.html ). They do not reflect the opinions of each individual journal club participant. Any responses to these points should be addressed to Malte Elson.

      In their preprint, Spiess et al. (2024) illustrate the impact of influential data points on statistical significance in linear regression analyses. The authors reanalyzed data from three high-impact journals by searching for the term "linear regression” and digitizing graphs of the included papers (due to the absence of raw data). Their findings revealed that excluding influential data points often rendered previously significant results non-significant. The simulations included in the study largely confirmed expected outcomes, supporting the overall argument for incorporating leave-one-out analyses in data analyses practices. The authors ultimately advocate for broader adoption of such methods to enhance the robustness of statistical conclusions.

      We found the paper to be interesting and an illustrative contribution to statistical education, both in terms of the potential fragility of published claims and as an illustration of an intuitive but underused outlier detection method. We identified points that might allow the authors to strengthen future versions of the manuscript, including some critical points about potential weaknesses or absences in the current version of the manuscript.

      1) TERMINOLOGY CONFUSION AND REPORTING ISSUES<br /> * Graphs vs. Papers: There is some confusion regarding the unit of analyses, and probably some reporting errors: On p. 4, l. 115, the paper states that the sample was 24 + 30 + 46 = 100 graphs, whereas on p. 6, l. 170 the authors state they examined 100 publications (going by Table 1, this is a simple clerical error, and should say graphs).

      * Similarly, the description of the columns in Table 1 (p. 11) is confusing, and we think has at least one reporting error:

      * It is unclear what “Hits” represent: Are these unique papers, or do the search engines of Science/Nature/PNAS return the same paper multiple times for each instance of the search term (“linear regression”)?

      * What does "number of graphs that were not shown" mean? We think these are instances of linear regressions that simply were not reported with a corresponding graph in the original publication, but they could also be graphs missing, inaccessible, or excluded <br /> * The “Articles” column is described as “number of Articles in which the analyzable graphs were found” (p. 11, l. 314), but we think these are the 21 articles in which the 29 “influential variables” were found. The number of articles with analyzable graphs is not reported. It thus remains unclear how many papers were included, and how many graphs were analyzed from each paper.

      * On p. 6, the authors report having identified 29 graphs in 21 papers in which the removal of one datapoint changes the result of a linear regression (see also Figure 1). On p. 6, l. 179 the “incidence” (should be prevalence instead) of changes in papers is reported as ~20%. However, this puts papers (21) in the numerator and graphs in the denominator (100), which underestimates the prevalence. On the graph-level, it should be 29/100 = 29%. The paper-level prevalence cannot be calculated because the authors do not report the number of papers with analyzable graphs (see above).

      * We strongly recommend reporting a Prisma flowchart to clarify the inclusion/exclusion of graphs and papers. In the same vein, the paper lacks basic information about the included studies, such as sample sizes or the distribution of p-values. Other information would also help emphasizing the importance of the present study, e.g. citation metrics.

      * The authors refer to “Supplementary Data 1” (p. 4, l. 121) but provide no link.

      2) SAMPLING STRATEGY <br /> * The study focuses on digitizable graphs without overlapping data points, inherently excluding studies with (1) larger samples and (2) homogeneous effects, where overlapping data points should be more frequent. This selection skews the included papers towards studies with smaller samples and p-values near 0.05 (due to lower power and publication bias / p-hacking), which are more susceptible to the illustrated effects. This is not a problem per se, but means the findings (including the prevalence rate) are about a narrower population of studies. Either way, the selection effects should be discussed in the paper.

      * It is not fully clear how it was decided which graphs are analyzable and which are not. Moreover, on p. 4, l. 127-130 the authors state that the obtained regression parameters match those reported in the paper closely, but they do not further explain what exactly this means, or what happened when they did not match

      3) ANALYSES AND CONCLUSIONS <br /> * The analysis does not account for dependencies when multiple graphs from the same paper, which will likely be based on the same data (which are then susceptible to the exclusion effects), are included.

      * In a way, the susceptibility of findings to the removal of a single data point is a restatement of issues related to small samples. Small samples are inherently more fragile, and larger sample sizes are more robust to the influence of removing (or adding) single data points and render p-values (and other estimates) more stable. This is not to say that the findings reported are not interesting; however, we were wondering whether a table of all included studies sorted by observed p-value and sample size would have flagged the same fragile papers. This is also not to say that dfstat is redundant, and we absolutely see the pedagogical value in being able to point at individual data points that “cause” a finding to be significant. Rather, we would be interested to what extent dfstat converges with common heuristics.

      * Relatedly, the authors decry that influence measures such as dfstat are largely ignored, even by statisticians (p. 4, l. 139). This may well be, but of course, statisticians (and non-statisticians) are obviously aware of issues related to low power and small samples, and one of these issues is the problem of spurious findings (e.g. due to few, extreme data points).

      * The authors largely blame frequentist statistics, particularly on p. 10, where e.g. they state that “[a]s long as stating significance or not is still based on the ubiquitous ? = 0.05 threshold, these statements can be sensitive to the presence of a single data point.” (l. 282-284). However, it is unclear how this follows from their findings. Any inference (not just ? = 0.05) could be susceptible to the influence of single data points when the estimate is close to the criterion. Moreover, particularly when the sample size is low, any metric’s value (e.g. point estimates) will vary as a function of the removal of individual data points, regardless of whether the inference is threshold-based or not. This is simply a property of statistical models fit to a limited amount of data. So again, the issue seems to be with small sample sizes.

      4) RECOMMENDATIONS AND FUTURE DIRECTIONS<br /> Things we would have liked to see:

      * Additional analyses, such as leave-two-out or leave-k-out methods. The leave-one-out analyses are providing a good intuition of how fragile some small-sample study results are. Additional leave-k-out analyses would provide further information about the fragility of the entire sample.

      * So far, the authors are concerned with the fragility of results as an outcome of removing data points. An additional study exploring the reverse scenario would be valuable. Specifically, it could investigate how extreme an additional data point would need to be to alter results, and how adding non-extreme data points could mitigate the relative weight of extreme data points.

      * Discussing dfstat as a robustness metric (“How many individual data points would have to be removed/added to render a significant result nonsignificant or vice versa”)

      * A discussion of how dfstat could be used for p-hacking by showing researchers which data points they would have to remove to turn a nonsignificant study result into a significant one.

      * The authors graciously and immediately shared data and code with one of us who requested it, and we thank them for this. We would like to see this data and code provided in a public repository and linked to in a future version of the manuscript.

      * We note that the authors chose to anonymise their data so that the reader cannot tell which original study’s results are robust or not. Personally, we think that meta-scientific interests are best served by making this information public; that is, we would like this data to not merely be used to illustrate the method but also inform the reader about the fragility or robustness of those publications’ results. Of course, not everyone agrees with this practice - perhaps the authors could comment on their perspective on this issue in a future version of the manuscript.

    1. On 2022-03-15 10:42:31, user sagar khare wrote:

      Thank you, Roberto and James for taking the time to engage with our work and for these helpful comments! We are revising the manuscript in the light of these comments (among others). We will post a detailed response to your comments in time.

      Re. why we used AF2 when exptl structures are available: we generated these models in early December and posted this preprint on Dec 13 before any experimental data were available. When the first structures came out in early Jan, we performed a comparison with our models and submitted to journal in mid-January. (what can I say, that's the pace of scientific publishing :))

    1. On 2016-08-20 15:44:28, user Raquel Tobes wrote:

      It is good to know new things about bacterial intergenic regions!<br /> Escherichia coli, Salmonella enterica and Klebsiella pneumoniae are plenty of REP (Repetitive Extragenic Palindromes) in their intergenic regions. Staphylococcus aureus and Streptococcus pneumoniae have many intergenic tandem repeats and Mycobacterium tuberculosis has REP-like intergenic sequences and some tandem repeats. <br /> Even considering that probably the short reads assemblies in many cases collapse the regions with repeats, it would be interesting to find some analysis about repeats in your study.

    1. On 2018-06-06 10:27:19, user Balaji wrote:

      I am posting the following jointly with Jaydeepsinh Rathod.

      It is clear that the people of the Turan were a mix of people from Iran and India. From Iran came Anatolian agriculturalist ancestry and from India AASI as well as ANE. You have found that BMAC people had 60% Iranian agriculturalist-related ancestry, 20% Anatolian<br /> agriculturalist-related ancestry and 13% ANE-related ancestry and 6% AASI. Suppose from Iran came people with 60% Iranian agriculturalist-related ancestry and 40% Anatolian agriculturalist-related ancestry and from India came people with 62% Iranian agriculturalist-related ancestry, 26% ANE-related ancestry and 12% AASI and suppose they mixed in equal proportions, the resulting population would have the ancestry that was found in BMAC.

      There is additional evidence that ANE found in Turan may have come from India and not from Kelteminar hunter-gathers or from West Siberian hunter-gathers. Figure S3.10 shows that other than the three Indus_Periphery outliers (Gonur2_BA, Shahr_I_Sokhta_BA2 and Shahr_I_Sokhta_BA3), it is Sarazm_EN and Gonur1_BA_o which share most alleles with Birhor. The high allele sharing of Indus_Periphery with Birhor is explainable by their high AASI. But Sarazm and Gonur1_BA_o have no AASI. Their high allele sharing with Birhor must be due to the West Eurasian alleles of Birhor being similar to those in Sarazm_EN and Gonur1_BA_o. Sarazm_EN has been modeled as 75% Ganj_Dareh_N, 2% Anatolia_N and 23% West_Siberia_N and Gonur1_BA_o as 61% Ganj_Dareh_N and 39% West_Siberia_N. In spite of the mainstream BMAC people having about 6% AASI and Sarazm_EN and Gonur1_BA_o having 0% AASI, the latter share more alleles with Birhor. This must be because the mainstream BMAC people have half their Iranian agriculturalist-related ancestry from Iran and only half their Iranian agriculturalist-related ancestry from India. The “Iranian agriculturalist-related” ancestry in Iran and India must be different as suggested by the Metspalu et al. 2004 finding that “mtDNA haplogroups which are shared between Indian and Iranian populations exhibit coalescence ages corresponding to the early upper paleolithic” and that they are present in India "largely as Indian-specific sub-lineages". Therefore Sarazm_EN and Gonur1_BA_o are likely to have been emigrants from the Indus civilization – they could not have come from Iran since they have almost no Anatolian agriculturalist-related ancestry.

      If the Indus population already had 26% or more ANE instead of the 10% ANE based on your assumption that Indus_Periphery was typical of the Indus population, then there is no<br /> room for Steppe_MLBA to bring in additional ANE and WHG ancestry.

    2. On 2018-04-02 05:54:08, user Arun wrote:

      "...Using admixture linkage disequilibrium, we estimate a date of 107 ± 11 generations ago for Iranian agriculturalist and AASI-related admixture in the Palliyar, corresponding to a 95% confidence interval of 1700-400 BCE assuming 28 years per generation."

      118 generations at 28 years per generation gets us to 3300 years before present, i.e., 1300 BCE, not 1700 BCE. This would kill the idea that the Palliyar admixture with Iranian agriculturalists originated with the collapse of the Indus Valley Civilization.

      Moreover, I would take P. Moorjani's establishment of 26-30 years per human generation since the Neanderthals with a big pinch of salt. The !Kung are measured to have a generation time of 25.5 years. This would further kill this idea about Iranian agriculturalist - AASI admixture. That extra 2.5 years is significant.

    1. On 2025-02-19 09:08:52, user Gennady Gorin wrote:

      This is a novel and interesting consideration, though I believe less fundamental than 1. the definition of "neighbor" is somewhat arbitrary because the metric is arbitrary and 2. the definition of "neighbor" is ill-specified because the uncertainty in (imperfect) data is not properly propagated.

    1. On 2018-11-29 09:06:32, user Conrad Mullineaux wrote:

      Speculative hypothesis papers can be fun and good for stimulating debate. But, to be useful, I think they need to present a plausible and coherent scenario (something that at least has a chance of being true) and they need to pay reasonable attention to the facts. I’m not sure that’s the case here. My main concerns are:<br /> 1. Fig. 3. The feedback loop looks neat, but it ignores the fact that the local [O2] around the nitrogenase need not correlate to any significant extent with the global atmospheric [O2]. Huge discrepancies could occur, due to local environmental conditions, and also due to the metabolic activity of the cell itself. Considering only the latter factor, the intracellular [O2] could be much higher than ambient (due to PSII activity) or much lower than ambient (due to respiration). If the nitrogenase doesn’t actually see the global atmospheric [O2], such a feedback loop could not clamp global [O2] at any particular level as proposed.<br /> 2. P.5 “If diazotrophic cyanobacteria are grown under conditions where they have sufficient CO2 and light, and with N2 as the sole N source, then they grow and accumulate no more than 2% oxygen in their culture atmosphere (16). The 2% O2 remains constant during prolonged culture growth because this is the O2 partial pressure beyond which nitrogenase activity becomes inhibited. With greater O2, nitrogenase is inactivated and there is no fixed N to support further biomass accumulation. With less O2, nitrogenase outpaces CO2 fixation until the latter catches up, returning O2 to 2% in the culture.” The outcome of this experiment will come as a surprise to anyone who has observed diazotrophic cyanobacteria happily growing without a combined nitrogen source at 21% ambient O2 (it depends on the cyanobacterium, of course). The result is a key plank of the authors’ argument, but it’s not clear if, when or how the experiment has been carried out. It’s not as straightforward as it seems, and nothing like that is described in the cited reference (16: Berman-Frank et al 2003). The nearest thing in that paper is a statement that a specific cyanobacterium, Plectonema boryanum, is unable to fix nitrogen above certain ambient [O2] levels. The limits are actually rather lower that the 2% quoted: 0.5% in the light and 1.5% in the dark (16). Plectonema is a specialist for microaerobic environments, and most other diazotrophic cyanobacteria are not so susceptible to O2 inhibition. <br /> 3. P.5 “Cyanobacteria have evolved mechanisms to avoid nitrogenase inhibition by oxygen, including N2 fixation in the dark, heterocysts or filament bundles as in Trichodesmium. Critics might counter that any one of those mechanisms could have bypassed O2 feedback inhibition.” Indeed they might. The authors go on to brush aside their imaginary critic on 3 grounds, none of which seem valid. “First, evolution operates without foresight”. Foresight isn’t needed: there would have been an immediate selective advantage to acquiring an O2 protection mechanism. “Second, the mechanisms that cyanobacteria use to deal with modern O2 levels appear to have arisen independently in diverse phylogenetic lineages, not at the base of cyanobacterial evolution when water oxidation had just been discovered”. Very likely so, but what about the next 2 billion years? “Third, the oldest uncontroversial fossil heterocysts trace to land ecosystems of the Rhynie chert”. It may or may not be the case that heterocysts evolved late, but, in any case, heterocysts are not significant contributors to marine nitrogen fixation: in extant cyanobacteria it’s the other protection mechanisms that allow cyanobacteria to make a huge contribution to oceanic nitrogen fixation even in the presence of 21% atmospheric O2. What about those other mechanisms? The fact that different lineages of cyanobacteria have independently come up with at least 3 different ways to protect their nitrogenase from O2 indicates that evolving such mechanisms is not really such a big deal. The authors’ scenario suggests that for a period approaching 2 billion years there was a nitrogen-limited biosphere with cyanobacterial nitrogenase operating right up against an inhibitory concentration of O2. There would have been intensive selective pressure for adaptations to protect the nitrogenase from oxygen. The scenario depends on the assumption that no cyanobacterium was able to develop a protection mechanism that would allow nitrogen fixation at >2% O2, despite selective pressure operating over a period of about 2 billion years and the availability of multiple solutions to the problem, as seen in extant cyanobacteria. I’m afraid that’s implausible, and I suggest that we need to look elsewhere for an explanation of the low O2 level through the Proterozoic.

    1. On 2020-07-08 13:37:25, user Anders Vahlne wrote:

      Some comments about the reactivity of the blood donors from 2019 (BD2019) and 2020 (BD2020):

      The results are shown in Figures 3A and 4B, F and G showing interferon-gamma-producing cells after stimulation with SARS-CoV-2 peptides.

      In Figur 3A one can see that six BD2019 react with S (spike) and four with M (membrane) but none with N (nukleocapsid). Tthe cut-off may have been chosen high enough to get a desired outcome. In the histogram to the right the authors decide that in order to be T-cell reactive the cells have to react with N plus S or M. There is no explanation why. One might suspect that the reason is to have all BD2019 negative and still have BD2020 positive.

      In Figur 4B they depict those that are CD4+ and CD8+. Now two CD8+ BD2019 are positive also with N.

      In Figur 4F they depict those those that are antobody positive and antibody negative. The BD2019 were not tested and thus omitted in Fig 4F.

      In Figur 4G there are two histograms. The one to the right summarizes T-cell positives in respective patient group. Again, the authors decide that in order to be T-cell reactive the cells have to react with N plus S or M so they get zero reactivity with the BD2019s.

    1. On 2021-03-26 09:24:57, user Tijawi wrote:

      Fascinating stuff, need to read through again. Note that UBP9 is now class Xenobia. Also, I hate to be pedantic, but be sure to check that the names of your nereid-based taxa are correctly combined: Eudoromicrobium, Autonoimicrobium, and Amphithoimicrobium.

    1. On 2019-07-09 09:29:52, user Martin Steppan wrote:

      A very fascinating study and topic with interesting implications! If I understand the results and design correctly, the genetic instruments quantifying the risk for early menarche also grasp a considerable amount of socioeconomic information (by being associated with educational attainment). Would it not be straight forward to control outcome variables (like age at first birth) for socioeconomic differences on the observational level first, before conducting Mendelian Randomization? MR is powerful, but if you have real confounding variables, is there a rationale not use them?

    1. On 2020-12-02 15:40:43, user Ryan wrote:

      NE 598 Group 2<br /> Social isolation impairs the prefrontal-nucleus accumbens circuit subserving social recognition in mice. https://doi.org/10.1101/202...<br /> Ryan Senne, Evan Mackie, Patlapa Sompolpong, Anthony Khoudary

      Introduction

      We are a group of Boston University students enrolled in a course focused on understanding neural circuits, including cortical information processing, guided behavior and cognition. To further engage with current research in the field and to gain experience in the process of peer-review, we present the following critique of the currently unpublished manuscript from Park et al. posted on biorxiv.org on November 12, 2020.

      Summary <br /> The medial prefrontal cortex (mPFC) has been shown to activate in response to social behaviors in both humans and rodents. Recent studies have revealed a corticothalamic circuit affected by social isolation; however, whether social isolation affects mPFC projections to other subcortical regions involved in social behaviors remains unclear. To this end, Park et al. investigate the role of projections from the mPFC to the nucleus accumbens shell (NAcSh) in the social recognition deficit observed in mice following social isolation. Through retrograde viral tracing, electrophysiological, chemogenetic and behavioral experiments they identified a novel circuit projecting from the prefrontal infralimbic cortex (IL) to the NAcSh affected by early social isolation. They found IL neurons to have decreased excitability in single housed (SH) mice compared to normally group housed (GH) mice. NAcSh-projecting IL neurons were activated when the GH mice interacted with a familiar mouse, but this activation was not observed in SH mice. Furthermore, inhibition of IL neurons in GH mice impaired social recognition without affecting social interaction in GH mice. Similarly, activation of IL neurons rescued social recognition in SH mice. These findings corroborate the social recognition defects observed in models of ASD and schizophrenia, which may reflect problems in human patients. Overall we recommend comparison of results to data collected before the re-socialization period, non-parametric data analysis and improved IHC imaging. Additionally, we recommend consistency between figures in the manuscript and the extended data, alternative anxiety measurements and in vivo electrophysiology recordings. We believe these recommendations will strengthen the argument for the role of this novel circuit subserving social recognition.

      Figure 1 serves to establish the experimental timeline and demonstrate the social recognition deficit induced by social isolation. Mice were housed either singly or in groups for 8 weeks post weaning. SH mice were then regrouped for 4 weeks with their littermates. At the end of this 12-week period, experiments were conducted. Mice from both cohorts were subjected to three chamber tests assessing social preference and social recognition. Both GH and SH mice spent significantly more time with a novel mouse than an inanimate plastic mouse; indicating no change in social preference due to isolation (Fig. 1c). GH mice spent significantly more time with a novel mouse compared to a familiar one in the social recognition test. Constratingly, SH mice spent comparable time with both the novel and familiar mouse suggesting a deficit in social recognition (Fig. 1d). Both cohorts showed no significant deficits in general recognition memory or hippocampal dependent memory (Fig. 1e, f). SH and GH mice also showed similar body mass changes, basal locomotor activity and anxiety levels (Extended Data Fig. 1).

      The authors hypothesized that projections from mPFC to NAcSh may be involved in social recognition. To test this the authors injected a retrograde enhanced green fluorescent protein (eGFP) virus into the NAcSh. Neurons in the deep layer of the IL were heavily labeled with eGFP. There was a significant difference in the number of eGFP+ cells in the IL compared to the PL (Fig. 2b). This observation led the authors to focus their study on mPFC-IL projections. Ex vivo brain slice whole cell patch clamp recordings revealed a significant decrease in excitability of NAcSh-projecting mPFC IL neurons in SH mice compared to GH mice (Fig. 2c). This decrease in excitability was not observed in mPFC PL projections to NAcSh, suggesting cell specific modulation of this circuit by social isolation (Fig. 2d). Other electrophysiological properties of NAcSh projecting IL/PL neurons were similar in both GH and SH mice (Extended Data Fig. 4).

      The goal of the next experiment was to determine if IL-NAcSh projections were activated by familiar mice in a different behavioral paradigm. Mice from both cohorts were habituated to a target mouse (Fig 3a). Interestingly, both GH and SH mice spent significantly less time interacting with the target mouse on consecutive social habituation trials (Fig. 3b). In the social recognition test SH mice again spent comparable time interacting with both novel and familiar mice, indicating the social recognition deficit (Fig 3c). Post mortem slice histology was used to quantify the activity of IL-NAcSh projections in response to a familiar or novel mouse. A retrograde eGFP virus was injected into NAcSh in both GH and SH mice; eGFP+ cells co-labeled with c-Fos staining were used as a proxy for activation of this circuit (Fig. 3d, e). Quantification of this labeling revealed that GH mice had a significant increase in the ratio of c-Fos+/eGPF+ cells after interacting with a familiar mouse compared to a novel mouse (Fig. 3f). This increase in activity was not observed in SH mice, supporting the claim that this circuit is activated by a familiar conspecific.

      To confirm the findings in Fig. 3, the authors conducted chemogenetic experiments in normal GH mice. A retrograde eGFP-Cre virus was injected into the NAcSh and a Cre dependent hM4Di receptor virus or mCherry control vector was injected into the IL (Fig. 4a, b). Intraperitoneal injection with CNO confirmed the inhibitory effect of hM4Di (Fig, 4d). Mice were then subjected to the social preference and social recognition tests following CNO injection. Inhibition of IL-NAcSh projections did not affect social preference, but did result in a significant decrease in social recognition (Fig. 4e, f). To further investigate this effect, mice were subjected to the social recognition test with the choice between a cage mate (in place of a target mouse) and a novel mouse. When IL neurons were inhibited, mice were unable to distinguish their cage mate (Extended Data Fig. 5). These findings support the claim that activation of this IL-NAcSh circuit is necessary for social recognition.

      In an attempt to solidify this claim, further chemogenetic experiments were conducted in SH mice. The previously mentioned experimental approach was used; however, a Cre dependent hM3Dq or mCherry control vector injected into the IL (Fig. 4a, b, c). CNO injections confirmed the activation of IL neurons (Fig 4d). Activation of IL-NAcSh projections did not affect social preference but did rescue social recognition (Fig. 4e, f). These findings demonstrated that activation of this IL-NAcSh circuit is both necessary and sufficient for social recognition.

      Major Criticisms

      The authors claim that regrouping SH mice in the model is insufficient to rescue social recognition. White the first experiment showed that SH mice spent relatively similar time with both the novel and familiar mouse, suggesting a social recognition deficit, all behavior tests were done following resocialization of SH mice (Fig. 1d). Adding another SH cohort without resocialization prior to administering behavioral tests would be beneficial to determine whether there is a significant change between the performance of regrouped SH mice and non-regrouped SH mice.

      In the second experiment, the authors found the projections from the prelimbic cortex (PL) to the NAcSh to have less neuronal density when compared to IL-NAcSh projections, therefore decided to conduct subsequent experiments only looking at the IL (Fig. 2b). Relatively less dense neuronal density in the PL does not equate to low activity in the PL and is not sufficient to rule out the role of the PL in social behavior, especially because previous papers have found projections from the PL to contribute to social behavior. There was no information on how eGFP-positive cells in the IL and PL were quantified. The cell numbers in the IL and PL were compared using an unpaired t-test, however, the IL cells appear to have a normal distribution while the PL cells do not. Using a parametric test may therefore be inappropriate for comparing the two populations. In Figure 2, there was also minimal physiological data to confidently conclude that excitability in the IL of SH mice is significantly reduced (Fig. 2c). Incorporating in vivo data would be beneficial.

      In the third experiment, c-Fos immunohistochemistry was performed as a proxy of recent synaptic activity. The ratio of quantified c-Fos+ cells in the IL to GFP+ cells was used to prove that GH mice show a significant increase in c-Fos positive NAcSh-projecting IL neurons while encountering familiar conspecifics. The method behind quantifying the overlaps are unclear in the paper. The major issue with this approach is that separately quantifying c-Fos+ cells and comparing it to the quantified number of GFP+ cells is that there is a possibility that there are quantified neurons that are not co-labeled with c-Fos and GFP. A one-way ANOVA and Tukey's multiple comparisons test was used to analyze the data, however, all of the data does not appear to follow a normal distribution (Fig. 3f).

      Apart from data in Figures 2 and 3 that are not appropriate for parametric statistical tests, data from other figures such as Figure 1 exhibit a binomial distribution and also do not fit the criteria for parametric tests (Fig. 1c). The distribution of the data in all experiments should be taken into consideration when running analyses <br /> While the viral stain in Figure 2 appears to be non-nuclear, the stains in Figures 4 and 5 appear to be nuclear (Fig. 2a; Fig. 4b; Fig.5b). It would be more standard to use the same virus for labeling throughout the experiments. The figures state that a retrograde adeno-associated virus (AAVrg) expressing eGFP was used, but the expression patterns are not consistent with this.

      Minor Criticisms <br /> Many of the summary bar graphs in the figures have error bars that are obscured by the individual data points, specifically figures 3b and 3f, 4e and 4f, and 5e and 5f. Changing the color of the error bars would help with better visualization of the data and its distribution. Additionally, in figure 1b and all three chamber tests, it would be worth noting whether or not the tests were counterbalanced with the stimulus mice in different chambers. This would control for the SH mouse simply memorizing the location of a preferred stimulus rather than true social recognition or preference.

      In the first experiment, it would have been worth titrating the length of juvenile isolation in order to find the critical period where its effect is strongest. The referenced paper determined 8 weeks to be effective, but an experiment to prove that 8 weeks is ideal would have been beneficial to the study as a whole. Another useful tool would have been in-vivo electrophysiology to selectively measure activity in IL-NAcSh projecting neurons during socialization and confirm the results shown by the c-Fos immunohistochemistry. Optogenetics also could have been used to measure social preference or recognition during the inhibition of these IL-NAcSh projections.

      Merits <br /> The panels in Figure 1 are incredibly well made and very easily communicate the experiments and data. The heatplots used throughout the paper are incredibly parsimonious in their representation.The novel object and object place controls on the three-chamber test often get ignored so this experiment was very well controlled. <br /> The behavioral schedule in Figure 3 is incredibly erudite and can be recycled by other researchers for these types of experiments. <br /> One of the biggest mishaps in chemogenetic experiments is a lack of proper controls. The researchers were incredibly thorough in their DREADD’s experiemnts and included all the necessary control groups including CNO in WT mice and using a saline vehicle in a DREADD injected animal. This type of comprehensive experimental schedule ensures that the data has a considerable level of confidence attached to it.

      In the supplemental figures the authors chose to include several controls which are necessary for the confidence of their results. Their inclusions of anxiety controls, often overlooked electrophysiology metrics, object controls, and cagemate controls inspires confidence in the results. Overall, a very well controlled paper.

      Future Directions <br /> One of the most important future experiments could involve dissecting between cell subtypes within the IL. A recent paper has shown that somatostatin interneurons house social memory within the PL and such cells could be necessary and sufficient for proper memory expression. The authors coils also determine the receptor subtypes the pyramidal neurons they focused on contained. For example, a recent article showed that Pl neurons which projected to the NAc shell were D1R+ and it would be interesting to see if similar neurotransmitter systems were prevalent in both mPFC areas.

      With respect to Figure 2, outside of the IL, the PL, and vCA1 have also been shown to be necessary for the expression of proper social cognition and behavior. These other areas have been shown to project tohe NAc shell. A follow up study that highlighted the unique contributions of these distinct areas and how possible neural circuitry links them together would be a valuable funding for the social neuroscience community. The electrophysiology in this figure is solid from a technical standpoint but whether this difference in excitability translates to meaningful behavioral phenotypes isn’t characterized. To this end, in vivo physiology during epochs of social interaction may more aptly furnish the narrative that Il cells are preferentially affected by social isolation.

      With respect to Figure 3, one of the most crucial aspects of this paper is that socially isolated mice have functioning social recognition on a short time scale as shown in 3A and 3B. The authors supply two reasonable hypotheses that this then could be a deficit in consolidation or retrieval memory mechanisms. This would be a crucial discovery for the field of social and memory neuroscience. One possible set of experiments the authors could pursue in a future paper would be to use the TRAP2 or tet-tag viral system to tag cells active at the encoding of a social epoch with ChR2 and eYFP within vCA1 or the PL, two areas shown to be important for the social engram. The next day the researchers could perform a 90-minute transcardial perfusion and quantify overlap. If there is an above chance overlap between the “tagged” cells and endogenous c-Fos this would rule out consolidation as the faulty mechanism. In this hypothetical scenario the researchers could then use a subsequent cohort to see if chronic activation of this memory ensemble could be enough to rescue the behavior if it were a failure of retrieval.

      Works Cited

      1.)Yamamuro K, et al. A prefrontal–paraventricular thalamus circuit requires juvenile social experience to regulate adult sociability in mice. Nature Neuroscience, (2020).<br /> 2.)Murugan M, et al. (2017) Combined Social and Spatial Coding in a Descending Projection from the Prefrontal Cortex. Cell 171(7), 1663-1677.<br /> 3.)Cummings K. and Clem R. (2020) Prefrontal somatostatin interneurons encode fear memory. Nature Neuroscience 23(1):61-74<br /> 4.) Xing B. et al (2020) A subpopulation of Prefrontal Cortical Neurons Is Required for Social Memory. Biological Psychiatry in press.<br /> 5.) Okuyama T et al. (2016) Ventral CA1 neurons store social memory. Science. 129:17-23.

    1. On 2018-11-30 13:12:24, user theempiricalmage wrote:

      And of course, lack of Iranian samples from the paleolithic. "Caucasus", nowadays, has become the choice euphemism for Iranian. Note the lack of 25kybp R1a diversification in the Caucasus compared to Iran (likely origin). Difficult to reconcile that with this study. The Iranian anthropological record is impeccable, and it has long been established that the plateau was well populated by that time.

    1. On 2016-11-09 02:34:34, user James Hane wrote:

      An interesting read, but excluding scaffold N50s and using only contig N50 in assembly comparison tables versus the other paired-end/mate-paired assemblies is misleading. Adding scaffold N50 and total unknown(N) bp to those tables would address this.

      Fleshing out a discussion on how the nanopore platform resolves heterozygous regions would also be appropriate.

    1. On 2019-02-05 10:21:11, user Sascha Rösner wrote:

      https://uploads.disquscdn.c...

      The results of this study are very interesting. Especially as we are dealing here with a long lived habitat specialist with very limited dispersal abilities. The ecological or conservation consequences might be crucial in the case of e.g. increased habitat fragmentation. In a study, where similar questions were asked, we found somewhat similar result in terms of relationship with significant closer relationship of males up to 5km I guess.

      https://www.researchgate.ne...

    1. On 2020-02-07 11:42:41, user mlhnrca wrote:

      "Among acylcarnitines, acetylcarnitine (C2) and the hexanoylcarnitine (C6-DC/C8-OH) showed the greatest absolute differences in baseline values between cases and controls (7.54 µM versus 9.92 µM (p=0.54) and 0.12 µM versus 0.08 218 µM, respectively)."

      Those p-values are not significantly different, so it's misleading to state that there were concentration differences.

    1. On 2021-06-01 06:06:21, user Prof. T. K. Wood wrote:

      Unfortunately, much literature is missing from this manuscript:

      Manuscript should cite the first persister proteome study; one that even used a similar technique, overproducing toxin YafQ (doi:10.1111/1462-2920.12567).

      Manuscript should cite the first single cell work showing the importance of ribosomes and heterogeneity in persistence resuscitation (doi:10.1111/1462-2920.14093).

      Manuscript should cite the first resuscitation study with single cells which showed the importance of HflX (https://doi.org/10.1016/j.i... "https://doi.org/10.1016/j.isci.2019.100792)").

      Manuscript should cite the persister formation work with single cells showing the importance of RaiA (https://doi.org/10.1016/j.b... "https://doi.org/10.1016/j.bbrc.2020.01.102)").

    1. On 2020-10-02 18:57:09, user ravi chandra wrote:

      Is there a control for this experiment? tried to look for SARS COV2 sequences in a healthy and COVID negative individuals? What is the confirmation that the amplified rna sequences originated from SARScov2?<br /> Which was the first SARScov2 virus sample or gold standard that was considered in this study ? Any reference cited as such ?

    1. On 2018-05-14 14:23:32, user Eduardo J. Villablanca wrote:

      Intriguing! Interesting Figure 4F, in which ileum and C1 draining lymph nodes are removed and 7B8 cells proliferate in all LN (e.g. D1, D2, etc). Wondering if the SFB is spread through out the GI tract or ileal DC now can migrate to D1?

    1. On 2018-02-26 16:01:24, user Helder Maiato wrote:

      I must apologize to Dr. Paola Vagnarelli for not mentioning in my previous discussion her recent paper (De Castro et al., Oncotarget, 2018) where, in addition to data supporting a role for Aurora B in the regulation of proper nuclear envelope reformation on lagging chromosomes in human cells, they show that “core” nuclear envelope components (Lamins A/C) are uniformly recruited to lagging chromosomes.

    1. On 2019-10-22 22:09:58, user DKF wrote:

      Great to see anything re genetics in relation to France - considering the present official attitude towards DNA testing ("recreational" or otherwise). None the less, Y chromosome male line and mtDNA female line uniparental markers are the most informative for understanding the origins of regional groups - when combined with data from history and archaeology. Apparently none presented here. Perhaps in a subsequent publication? In addition, until there are ancient DNA studies of key French sites (e.g., LaTene Celtic) we are flying blind in many respects since migration for example during the Industrial Age will have had a strong impact on the population of today. We need to know "what is under our feet". Why is it that neighboring countries are flooding the literature with immensely informative ancient DNA studies? We need to integrate this data with similar work from France before we can make conclusions about how history and prehistory have affected the population of France today. More broadly, there is an expanding body of knowledge from Spain, Italy and Germany concerning for example Bronze Age Bell Beaker sites. Those from France would help to tie things together coherently so that we can provide an accurate story of Europe through the ages.

    1. On 2018-02-23 10:15:21, user Ferran Aragon wrote:

      Dear authors,<br /> Congratulations for these very interesting results. We are struggling to get good yields for long ssDNA (3-8Kb). I don`t understand very well how the yielding result indicated in the text when doing aPCR with AccuStart HiFI (695+/-35 ng in 50 ul, thus 14ng/ul aprox) fits with the purification results shown in Fig.S7 where the most efficient purification method gave a yield of about 250ng/ul. Could you clarify this? Thanks a lot!

    1. On 2018-02-28 18:19:51, user Leslie Vosshall wrote:

      The Vosshall Lab discussed this pre-print at our journal club on 2/28/2018. We agreed that it was a very exciting series of observations.

      The following discussion points came up:<br /> 1. The experiments refer to control as 100% humidity, but was ambient relative humidity measured directly? It is very difficult to get a room to 100% humidity!<br /> 2. Figure 1A, how does the time on the X axis relate to the entrained circadian cycle of these animals? Is the big peak at 40 hr modulated by circadian time or is it an absolute peak dependent only on dehydration?<br /> 3. Figure 1A, what are the dehydration levels of the animals in this experiment? How tight is the correlation of dehydration to that activity peak? We were interested in comparing the curve in Figure 1C to the activity data in Figure 1A.<br /> 4. Figure 1C, is this behavior specific to blood-feeding? Or if you gave the animals the option of drinking water or sucrose would you see the same increase in feeding correlated to dehydration?<br /> 5. Figure 1C, what happens if you repeat this experiment with gravid females who previously blood-fed, who have a lower drive to blood-feed? i.e. can you disentangle the drive for blood from the drive to rehydrate from any source, including blood?<br /> 6. Figure 3D-E: it would have been good to include the GFP dsRNA control that was used in the sugar measurements in Figure B-C.<br /> 7. Figure 4A refers to n=8, but there are many more than 8 data points. Is this a typo in the legend?<br /> 8. Figure 4A, can the authors clarify what “wet conditions” and “dry conditions” refer to? Did they measure relative humidity?<br /> 9. Figure 4B, the X-axis is very confusing. Do you mean 1.94 cases/10,000? Or do you mean 19,400 cases? Or is 10[4] a typo given that there are closer to 2000 WNV cases per year – did you mean 10[3] as a multiplier? Also starting the axis at a non-zero value is misleading.<br /> 10. Do male mosquitoes have some of these phenotypes? It would be good to see how much of this is a general effect of dehydration vs a specific drive of females to seek blood sources<br /> 11. Given how low the blood-feeding rate on the Hemotek devices was, we wondered why the investigators did not use live hosts, especially in Figure 4.

    1. On 2023-05-12 17:40:58, user Maurice Franssen wrote:

      Fascinating research and results. One comment: the moth shown in Supplementary Figure 1i is not Noctua pronuba but its sister species Noctua fimbriata. I do not think that species will behave differently but the authors better show a specimen of N. pronuba.<br /> Maurice Franssen, amateur entomologist, Wageningen, the Netherlands

    1. On 2020-02-05 11:14:40, user Pei-Hui Wang wrote:

      Forward to you friends who needs. Prof. Wang Pei-Hui Lab in Shandong University, China, all orf clones of 2019-nCoV / SARS in pcDNA6B-FLAG can be freely requested, please contact him. Email: wphlab@163.com covering the following orfs: nsp1-16, S, 3, E, M, orf6, orf7a/b, orf8, orf9, orf14, N; and ACE2

    1. On 2016-06-24 16:37:11, user Peter wrote:

      We appreciate all the comments and will address them individually in due time.

      One comment was that we are measuring stable gene transcripts rather than the up regulation of genes. As stated in lines 905 to 925, one would not expect a gene transcript to be stable at one time, unstable at another time and then stable later on (see Figure S3). We see this pattern in many of the transcriptional profiles. This is difficult to explain by the 'stable gene transcript' idea. Moreover, many of the gene transcriptional profiles show putative feedback loops. This is also difficult to explain by the 'stable gene transcript' idea. It cannot be attributed to 'noise' because the behavior of each probe is calibrated and 'noisy probes' are not used.

      We are not the first to report the postmortem up regulation of genes as pointed out on lines 927 to 947 (using a different technology) and the other paper mentioned by mit_opinion below. The difference in our study from the others is that we measured the precise abundances of hundreds of genes through post-mortem time.

      Lastly, one comment stated that we were measuring absolute transcript abundances. We never said anything about absolute concentrations. Our measurements are always relative to the live control.

    1. On 2022-03-21 23:11:00, user Soso wrote:

      This paper has so much information and all the data looks great. However, I do have a few questions and suggestions on the methods and materials section of your paper.

      First, fecal sample collection was mentioned but not the process of how the samples were collected. Was the feces collected after it passed from the monkey or was there some process to collect upper GI samples? The difference in collection methods may affect the presence of certain bacterial species, such as anaerobic species.

      Second, I did not see internal controls for DNA extraction, PCR, or the sequencing process. Were there any internal controls used to verify the results of each step? So, DNA extraction for example, known bacterial species grown in the lab can be included at specific concentrations and used to calculate how much DNA should be extracted from this known species, this way we can check to make sure the DNA extraction process worked as expected.

      Third, were there any steps taken to minimize PCR errors (such as ambiguous bases, etc.) and chimeras? It would be beneficial to mention these steps and what programs were used to minimize them in the materials and methods section, for example ChimeraSlayer is often used for removing chimeras. Also, it was mentioned that analysis was performed on samples after rarefaction to 10000 sequences/sample, but how many sequences/sample were there before this rarefaction?

      Last, in the Microbiota analysis section for 16S amplification, the link for the earth microbiome project does not seem to work for me. I apologize if this is an error on my end, but it says the page cannot be reached. Is there another way I can find this information? It may be beneficial to also include PCR steps and cycles you followed in the methods and materials section so that if anything ever happens to the link and it no longer works, you still have the steps you followed in the paper.

      Overall, I really enjoyed your paper. I hope you find these questions and comments helpful!

      SHSU5394

    1. On 2019-12-30 06:58:01, user Koki Tsuyuzaki wrote:

      I'm a developer of scTensor.<br /> Thanks for adding scTensor in your experiment,<br /> but there are many misleading or wrong parts as below,

      so please consider the modification.

      1. LRBase

      In Figure 1C and the caption, you use "the database of scTensor", but please use "LRBase" (the name of our L-R database) instead.

      Please note that the scTensor is the name of algorithm or the software package, and not L-R database.

      Actually, scTensor can be used with any L-R database.<br /> https://rdrr.io/bioc/LRBase...

      I think you should separately consider the effect of selection of L-R database and CCI detection algorithm based on the database.

      2. Which is the largest LR database?

      In the INTRODUCTION, you said

      to the best of our knowledge, it is the largest database of this kind.

      but in the Figure 1C, LRdb seems not so large.

      3. Blank elements in Table 1

      There are many blank elements in Table 1 scTensor column.

      If you will add our method in your experiment, you should investigate it more.<br /> I still do not understand your intention of

      complete pipeline<br /> the other items are as follow:

      Accept preprocessed data Y

      Export types<br /> tables Y<br /> circualar plots N<br /> graphML or equiv. N

      Perhaps the complete pipeline means the built-in call type calling using t-SNE, k-means, or SIMLR but I don't think such name is appropriate, any tool can perform such task by combinined with the other tools.

      Besides, if the user want to use other dimensional reduction, clustering, and cell type identification methods not included in your tool, your tool can be used with them?

      4. UniProtKB/Swissprot

      In the Comparison with other tools, you said

      Swissprot annotations (secreted/membrane) automatically

      but UniProt/Swissprot is based on the manual annotation.<br /> https://www.uniprot.org

      We also used UniProt/TrEMBL and this database is based on the prediction by cellular localization algorithms.<br /> So please explain more accurately.

      5. No registraction of LRdb in Bioconductor

      In the AVAILABILITY, you said

      the LRdb package is submitted to Bioconductor

      but I couldn't confirm the submitted R/Bioconductor package,

      although I could confirm that a TSV file is put on the GitHub.

      https://github.com/Biocondu...<br /> https://www.bioconductor.or...<br /> https://github.com/SCA-IRCM...

      Where was the package published?

      6. Benchmark design

      In the manuscript, you compared your method with other methods,

      but the benchmark is not well designed; each method use different L-R databases and different algorithms, so even if your tool showed the good performance, the reader cannot understand why the tool was good.

      Again, you should separately consider the effect of selection of L-R database and CCI detection algorithm based on the database.

      7. Criticism against scTensor

      As the criticism against scTensor, you said

      We found 2 reliable LR pairs whereas scTensor returned 14 pairs, none in common<br /> or<br /> we found significant discrepancies with PyMINEr and scTensor

      but there are no detail explanation or no quantitative evaluation,<br /> and these parts are just your impression.

      Acutually, some L-R pairs detected by scTensor (Supplementary Table 3) are still curated and not "none in common".<br /> https://string-db.org/cgi/n...

      Besides, as you said, LRBase includes many purative (not known) L-R pairs,<br /> you cannot simply say which L-R pair is correct or not.

    1. On 2019-01-11 10:59:25, user PTRRupprecht wrote:

      This is a really useful resource!

      Something I did not fully understand: "To allow easier and faster access to the exposed brain, all pipettes are positioned on one side of the preparation." Also from Fig. 1B, I cannot see immediately why one should not position the pipettes one all sides of the craniotomy. What am I overlooking?

      And it would be really cool to have (in the supplementary material) a video which shows both the 2P image and the oscilloscope (or something similar) during the procedure.

      Kind regards,<br /> Peter

    1. On 2016-08-31 11:54:15, user Aleksey Belikov wrote:

      All attempts of field normalization for citation indices are basically useless, because this is a non-issue. Nobody in his right mind would compare a biologist with a mathematician based on a citation index, and then give preference to the one who has a higher index. Indices are used for hiring and promotion for a particular open position. If this position is for a mathematician, would anybody hire a biologist, even if his citation indices are 20 times higher?

    1. On 2023-07-14 23:30:00, user Zach Hensel wrote:

      The revised manuscript overlooks the dispositive analysis first suggested to the authors, to my knowledge, in the first week of September 2022. The manuscript’s hypothesis of an endonuclease “fingerprint” of a synthetic origin in the SARS2 genome makes a testable claim: if regions around the sites composing the “fingerprint” are sampled in nature, engineered nucleotides will stick out like a sore thumb.

      Authors were told about this test in the first week of September 2022 when people independently noted the recombinant evolutionary history and that almost all elements in the “fingerprint” are sampled in a handful of the most closely related genomes. Others rephrased essentially the same test, with Francois Balloux commenting to Alex Washburne on September 5, 2022:

      Assuming we wished to follow up on this, the next step would be to test if high homology can be found to different Sarbecoviruses for (some of) the 6 fragments defined by the restrictions site (ie. there's no reason to expect natural breakpoints to match restriction sites).

      This step was not taken. And it was not a difficult step. Shortly after the manuscript’s publication, Crits-Cristoph and colleagues rigorously showed that the hypothesis fails this test: https://github.com/alexcritschristoph/ancestral_reconstruction_endonucleases – the conclusion is noteworthy considering the public record, which demonstrates bias in site selection and post hoc selection of statistical tests. In fact, this manuscript’s hypothesis gained attention only after Justin Kinney, who is acknowledged for his assistance on the manuscripted, prompted the discussion by suggesting a different hypothesis about a different restriction endonuclease, BsaXI.

      In the comments section of V1 of this manuscript, Alex Washburne proposed a second test of his hypothesis, claiming that “the rapid loss of this pattern is indicative of its evolutionary instability, suggesting what we observe in the SARS-CoV-2 ancestral state is not a stable pattern resulting from recombination, but a transient, unstable pattern that perhaps went against selection and reverted back once the infectious clone was subjected to selection from considerable onward transmission.” While this statement makes some dubious claims and another test is not needed, this comment shows that Washburne considers fitness changes in mutations at these sites to be another test of his hypothesis. This is a test that Washburne can conduct based upon published analysis of the fitness impacts of mutations: https://github.com/jbloomlab/SARS2-mut-fitness – as Washburne and co-authors have not published the results of this test, I will briefly do so here.

      The mean, median, maximum, and minimum fitness change estimated for point mutations in the “fingerprint” of the 5 BsmBI or BsaI sites in SARS2 are -1.7, -1.4, 2.2, and -6.5. The same calculations for 1000 random samples of 30 nucleotides give -1.7, -1.5, 1.8, and -6.4 (see link above on interpreting these numbers, or simply note their similarity). A search on https://cov-spectrum.org/ shows that point mutations or deletions for one or more of these 30 nucleotides have been reported in 0.75% of sequences sampled in the most recent 3 months. Point mutations or deletions for one or more of 30 random nucleotides (a single random sample; results will vary) have been reported in 0.96% of sequences in the same period. All in all, the main point of interest in these 30 nucleotides is the attention given a hypothesis of a “fingerprint” of synthetic origin that was effectively disproven before this manuscript was published.

      Finally, considering the countless number of equivalent hypotheses, I suggest that a better effort would be immune to these tests (and I can think of at least one example myself). It is critical that a manuscript of this type demonstrate that there is an unbiased rationale behind the hypotheses tested and that is plainly not the case here. One simply needs to observe that “longest fragment” is referred to 20 times in the manuscript, while “shortest fragment” goes unmentioned.

    1. On 2018-12-12 21:01:26, user Michael Neel wrote:

      Thanks for the really interesting paper! I recently reviewed this paper for a class assignment and decided to share my comments with you.

      Paper Summary:<br /> This paper investigates mechanisms that help to establish centriole number in multi-ciliated cells (MCCs). The authors investigate this using an ex vivo airway culture model that produces mouse tracheal epithelial cells which are MCCs. They investigate whether the parental centrioles (PCs) are involved in regulation of centriole abundance. Using centrinone, the authors ablate PCs from their cell cultures and present data they claim shows PCs loss does not inhibit centriole amplification, deuterosome biogenesis, or affect amplification dynamics. The authors also presented data they claim shows that PIk4 levels do not affect centriole abundance, although it may delay amplification. Lastly, the authors investigated the relationship between cell surface area and centriole abundance. They present data that suggests centriole abundance correlates with cell surface area and that they were able to affect centriole abundance by manipulating cell size. Overall, the authors propose that cell surface area is a determining factor of cilia abundance in MCCs.<br /> Overall, I liked the authors experimental approach and the amount of quantification attempted. In particular, I like how they not only investigated PCs, but also PIk4 and surface area as possible regulators of centriole abundance. I also deeply appreciate their attempts to quantify many of their immunofluorescent images. However, the paper contains a number of issues predominantly including insufficient sample sizes, and graph choices which I address in more detail below.

      Comments and suggestions <br /> 1. In many of the graphs (2b-g, 3b, 5c,d,f,g, SF3b) the authors present data that include error bars and statistical tests based on averages of 2 independent experiments. This means that most of the data have an n of 2. While an n of 2 is not technically insufficient for statistical testing, data with n=2 lack statistical power and presenting SEMs with n=2 can be misleading. I would advise the authors to perform addition independent experiments to increase their n and possibly a power analysis to determine a sufficient sample size. <br /> 2. For fig 2d-g, authors show bar graphs depicting percentages of cells with 0, 1, 2, 3, 4, >4 centrioles from cultures stained for markers of various centriole assembly stages. They claim these graphs show that loss of PCs did not affect the overall timing of centriole amplification stages, but the graphs shown do not appear to be appropriate for this type of analysis. I would suggest the authors instead include graphs quantifying the % of cells positive for the various markers in graphs similar to fig 2 b and c.<br /> 3. In their results section on page 6, authors say that deuterosomes are lost by ALI8 in control cells. However, fig 3a clearly shows some Deup1 immunostaining at ALI8 and fig 3b shows 20-30% of control cells are positive for Deup1 at ALI8. Authors should amend this statement.<br /> 4. On page 8, authors claim that manipulating PIk4 protein levels does not alter deuterosome number, which contradicts data in fig 5e that shows increased deuterosome number when PIk4 is knocked down. <br /> 5. Authors mention in results that the centrinone concentration used is roughly 3-8 times higher than needed in most cells, but do not provide their rationalization for using such a concentration. This can be addressed by including a sentence or two explaining why such a concentration was used<br /> 6. On page 7, authors state that fraction of MCCs at ALI12 show no overall difference between control and PIk4-depleted cells, but do not reference any data. Authors can address this by including a bar graph displaying this data.

    1. On 2020-08-13 12:25:42, user kdrl nakle wrote:

      That will too complex and too complicated to treat people that are days away from dying. It is OK for cancer but I bet it is not going to work effectively for acute infections. It is interesting so try to prove me wrong.

    1. On 2023-11-12 00:05:45, user Elizabeth Duncan wrote:

      Recently, a group of trainees read and discussed this preprint as part of a journal club at the Markey Cancer Center at the University of Kentucky. We thought the findings suggesting that SETD1A may be driving the increase in H3K4me3 in MLL1 mutated cells (and possibly leukemic cells with MLL1 translocations) were very intriguing. However, we have several questions and suggestions:

      In figure 2B (metagene analysis) and C (pie charts), you plot the mean read counts from H3K4me3 ChIP-seq. We interpret the unexpected lack of enrichment of H3K4me3 at gene TSSs in the WT sample as a reflection of the relatively significant increase of H3K4me3 at new gene loci in the MT1 and MT2 cell lines. Is this correct?<br /> If so, we believe this point could be made stronger by adding, for example, a Venn diagram of the genes with MAC2 peaks in the WT cells and those with peaks in the MT1 and MT2 cells. You could also create two separate metagene plots based on the data in Figure 2B: one looking at H3K4me3 in all three cells lines at genes with MACS2 peaks in WT, and one looking at H3K4me3 at genes with MACS2 peaks in MT1+MT2.<br /> Given that there is likely variability in the chromatin state in different iPSC lines, we also wonder if you performed these experiments and/or analyses using a separate iPSC line?<br /> It is unclear how you performed the differential expression analyses in figures 3, 4, and 5. The heatmaps show changes in both the WT and the mutated cell lines, even though we assume the differential expression is in relation to the WT cells? We appreciate there are many ways to perform these analyses, however we would like to understand the details of how they were done here to better understand their implications.<br /> What happens if you knock down SETD1 expression in the MLL1-R3765A cells?<br /> Do you see the same effects if you KO or KD MLL1? Versus this mutation that prevent association with WRAD?

      We look forward to seeing your paper in publication.

    1. On 2018-02-14 10:38:49, user Benoît Girard wrote:

      A purely formal comment: on fig 3A, it is quite difficult to spot the lonely additional blue spike (I had to read the legend to learn about its existence, and to subsequently search the figure to find it).

    1. On 2022-10-22 00:15:10, user CDSL JHSPH wrote:

      This is interesting research, not only because it corroborates past findings, but also because it confirms and arouses mixed reactions concerning microbial diversity in equal measure. It is my pleasure to make these remarks. The work is well structured, well researched, and properly presented, easy to read even for non-scientific audiences. When going through the details though, I could not keep the concepts of hospital-acquired infections, antibiotic resistance, and the emergence of novel diseases out of my mind, particularly because of how they are linked to the overall concept of microbial diversity and adaptability. For antimicrobial resistance, for instance, the underlying factor has everything to do with the transfer of mobile genetic elements (MGEs) between two genomes. When MGEs access the chromosomes of new bacterial hosts, the outcome is phenotypical alteration. If the MGEs contained antibiotic resistance then novel or ongoing pathogenesis may result. Nevertheless, your study has demonstrated that bacterial species rarely cross environmental barriers. However, it is interesting to note that this is not the entirety of the results because there are distinct transitions between aquatic biomes, which, noteworthy, are ancient, rare, and often directed towards the brackish biome. At the same time, there are frequent transitions into brackish sites, which are harder to explain. I am just concerned, are there tests that can ascertain these claims? Previous studies have identified that bacteria are opportunistic and may manipulate any loophole to establish supremacy. The concern is further aggravated by your additional findings, that brackish bacteria often exhibit enriched gene functions for various physiological responses, including transcriptional regulation, which is integral in the re-writing genetic information, further begging the question should there be a cause for worry.

    1. On 2018-06-22 16:01:11, user Jun wrote:

      Dear authors

      Very outstanding works. But I have some concerned about the article.<br /> 1) You mentioned "Despite many attempts, and using several different versions of Cas9 under control of different promoters, we were never able to generate mutants showing altered pigmentation, among the few transformants which resulted from transformations with either vector." <br /> Any detailed information about this statement? Which promoter you guys already tried? Which version of Cas9 you guys already tried? It will be a very useful information for whom wanna work on CRISPR in Magnaporthe oryzae.

      2) You mentioned Cas9 is toxic to Mo, but do the transformants without pigment change has Cas9 expression? Did you find any difference in the growth rate, sporation and virulence between the transformants even without pigement and WT?

      Thanks

    1. On 2021-10-08 11:57:25, user Eric Fauman wrote:

      There are many p-values listed as 0 in the supplementary tables. You need to either report the -log10(p), or include the standard errors and subject counts for each variant so researchers can calculate the p-values for themselves.

    1. On 2017-04-05 02:08:41, user Zhiyong Shen wrote:

      hi everyone,<br /> I try to install the panX on my desktop, the process of installing is well.<br /> However, i always get the error report as follows when i run the example data!<br /> Did anybody run the demo data success?

      warning: ./data/TestSet/geneCluster/GC_00001981_na.aln is not a core gene<br /> core_list============== []<br /> step07-call SNPs from core genes:<br /> 0 minutes 0 seconds (0 s)<br /> fasttree time-cost: 0 minutes 0 seconds (0 s)<br /> Traceback (most recent call last):<br /> File "./scripts/run-pipeline.py", line 128, in <module><br /> aln_to_Newick(path, params.raxml_max_time, params.threads)<br /> File "/home/shenzy/soft/pan-genome-analysis-master/scripts/SF08_core_tree_build.py", line 39, in aln_to_Newick<br /> resolve_polytomies('initial_tree.newick0','initial_tree.newick')<br /> File "/home/shenzy/soft/pan-genome-analysis-master/scripts/SF08_core_tree_build.py", line 6, in resolve_polytomies<br /> tree = Tree(newickString);<br /> File "/usr/lib/python2.7/site-packages/ete2-2.3.10-py2.7.egg/ete2/coretype/tree.py", line 218, in __init__<br /> read_newick(newick, root_node = self, format=format)<br /> File "/usr/lib/python2.7/site-packages/ete2-2.3.10-py2.7.egg/ete2/parser/newick.py", line 231, in read_newick<br /> raise NewickError('Unexisting tree file or Malformed newick tree structure.')<br /> ete2.parser.newick.NewickError: Unexisting tree file or Malformed newick tree structure.

      i past part of the results from the directory of geneCluster as follows:<br /> -rw-r--r-- 1 root root 73369 Apr 5 01:51 GC_unclust006_1.fna<br /> -rw-r--r-- 1 root root 0 Apr 5 01:51 SNP_whole_matrix.aln<br /> -rw-r--r-- 1 root root 6 Apr 5 01:51 core_geneList.cpk<br /> -rw-r--r-- 1 root root 0 Apr 5 01:51 core_geneList.txt<br /> -rw-r--r-- 1 root root 18658 Apr 5 01:51 gene_diversity.cpk<br /> -rw-r--r-- 1 root root 16578 Apr 5 01:51 gene_diversity.txt<br /> -rw-r--r-- 1 root root 54 Apr 5 01:51 snp_pos.cpk

      Did any one can help me, many thanks in advance!

    1. On 2023-01-03 08:55:04, user Pustelny Katarzyna wrote:

      Thank you. Currently, we have MS data confirming Tyr273 phosphorylation in the activation loop and it is also clearly visible on the electron density map. Detailed analysis of pTyr273 is on-going.

    1. On 2018-02-15 17:45:44, user Gaurav wrote:

      Hi

      A quick comment about using symbols for isoforms.<br /> In the Methods section, reference isoform refers to A1 (Line 4 of Overall design and problem formulation) and alternatively spliced to A2, however Figure 1 represents A as reference and A1 as alternatively spliced.<br /> The entire text uses A1 for reference and A2 for alternatively spliced.

      This is confusing.

    1. On 2019-07-30 22:49:30, user Charles Warden wrote:

      Thank you for putting together this paper.

      I was a little concerned when I saw "We estimate that a sample sequenced to the depth of 70 million total reads will typically have sufficient data for accurate gene expression<br /> analysis." for a couple reasons:

      1) For most gene expression projects, I think 10 million aligned reads is OK and 20-30 million total reads is often pretty safe. While the exonic percentage varies for library protocol, and I'm not sure about the unique read conversion (or if that conversation also varies between library protocols and sample types).

      2) I think the specifics have to be figured out for specific protocols (and raw data can be used for research purposes in different applications, or to check the validity of processed data).

      For 1), I think that was justified from both my own experience (with 50 bp single-end reads), as well as Liu et al. 2014 / Wang et al. 2011 / Tarazona et al. 2011. I noticed those papers while responding to this discussion.

      For 2), I don't exactly have a paper to show this, but I would say differential expression between groups requires testing / optimization per-project. So, you couldn't really define criteria that will work in all possible gene expression projects. While kind of messy, I have some notes from a Twitter discussion this past weekend.

      However, I think part of the discrepancy for b) is different interpretations for "differential expression," "over-/under-expression," and "outlier expression". I am mostly thinking of the 10-20 total million polyA reads for differential expression and genes with clear expression / over-expression. If you talking about a pattern that would more more likely to be a technical artifact, I can see how extra effort would be needed for gene expression analysis. For example, if you could have 2-3 biological replicates from slightly different sections of a sample (each with 10-20 million reads), that starts getting close to a total of 70 million total reads for that sample.

      I think your Figure 1A and Figure 4C (and possibly Figure 3C) makes me think there is more agreement than I originally expected from the abstract (since that emphasizes something with a threshold of 10-20 million MEND reads). However, I would say 90% specificity may be more reasonable for sensitivity (instead of 95%), for whatever metric is captured by that test. In general, I think 80% accuracy for a genomic signature is pretty good, and I think you need to be careful about over-fitting. That was part of the Twitter discussion that I linked above, but that is also described in my genomics for "hypothesis-generation" blog post.

    1. On 2017-12-21 22:01:25, user awcm0n wrote:

      Mollie, great work, but I think there's a problem with your code for simulating from a fitted model in Appendix B:

      simdatsums=lapply(simdatlist, function(x){ <br /> + ddply(x, ~spp+mined, summarize, <br /> + absence=mean(count==0), <br /> + mu=mean(count))<br /> + })

      Error in .fun(piece, ...) : argument "by" is missing, with no default

    1. On 2023-10-02 00:06:33, user LEVIn - Unicamp wrote:

      We discussed this preprint in our journal club and enjoyed reading it. We collectively agreed it's an exciting contribution to understanding how biological invasions shape invasive species' trophic niche and functional morphology in new environmental contexts. We all think the manuscript is well written and the figures are excellent! During our discussion, a major point that came up deals with how the hypothesis (lines 88-90) is motivated and then connected with the results. A more conceptual contextualization of the hypothesis in the introduction (e.g., explaining the ecological release hypothesis in the 3rd paragraph) could help readers to generalize the results beyond the study system and attract a more diverse readership interested in niche variation and biological invasions. Also, as the results combine a substantial body of statistical analyses aiming to understand variation in functional morphology and trophic niches across species, ontogenetic stages, sexes, and invaded vs. native ranges, presenting predictions after the hypotheses could help readers to navigate the results. For example, in light of the ecological release hypothesis, what is expected regarding morphological and body size variation across native and invaded areas? Our final point of discussion is related to the interpretation of the observed niche contraction in the invaded range. As replicates representing invaded vs. native ranges are sampling sites in space (Fig. 1), clarifying whether observed niche contraction emerges via lower variability in resource use across sites and/or within sites would be interesting. This is a key point to connect the results with the ecological release hypothesis. I hope you find these comments constructive; discussing this manuscript in our journal club was great. Congrats on your work, and good luck with the following steps!

      Laboratório de Ecologia das Variações Individuais (LEVIn), State University of Campinas (Unicamp), Brazil

    1. On 2016-05-14 10:53:56, user Arjun Raj wrote:

      Interesting paper! Perhaps I missed it, but I'm wondering if there's any way to know whether the putative trans-splicing or circRNA events are not due to template jumping, which I believe reverse transcriptase is (or at least was) prone to.

    1. On 2020-06-10 08:25:42, user Renzo Huber wrote:

      This is a timely manuscript of exceptionally high quality. <br /> I believe that the flexibility of this sequence approach is a quantum leap forward for the emerging field of high-resolution fMRI at high magnetic field strengths. <br /> The combination of multi-shot readouts and CAIPI-based undersampling will allow future neuroscientists to choose the matix-size and spatial resolution as desired. Despite T2*-constraints of conventional fMRI approaches.<br /> I am looking forward to using skipped-CAIPI in many of my future experiments.

    1. On 2019-04-18 18:06:07, user Paul Schanda wrote:

      This is a really useful development. We have used FLYA ourselves and managed to assign a 12 x 39 kDa protein from solid-state NMR data in a fully automatic manner - the largest protein so far assigned in solids. About a year of manual work -- or a few hours with FLYA, leading to the identical result. (https://www.biorxiv.org/con... "https://www.biorxiv.org/content/early/2018/12/16/498287)")

      Having this tool now - finally - for methyl assignments is very welcome and I am sure it will boost solution-NMR of large proteins further.

    1. On 2023-10-03 21:24:35, user Herwig Walter Lange wrote:

      It is not correct to state that the large interneurons are not lost in striatum. As i could show in 1981 (Verh. Anat. Ges. 75, S. 923-925), 52 % of the macroneurons are lost in HD, compared to a loss of 86 % of the microneurons.

    1. On 2016-04-18 17:37:35, user Julie Dunning Hotopp wrote:

      1. You should read and cite this paper by Aravind: http://biologydirect.biomed...<br /> His group already demonstrated some of the things you describe about the latrotoxin domain proteins. They talk about some other protein that might relate to these as well. We talk about it in the BMSB transcriptome paper at<br /> http://download.springer.co...

      2. I think you need to address convergent evolution in this case. It might not be convergent evolution (I can't tell from what you present). But if you don’t bring up and address the topic head on, you could set yourself up for criticism.

      3. The literature is becoming riddled with LGTs from X to Y, where X was found first and Y was found second (or vice versa). But really there are probably intermediates in between. For example, you could imagine a gene moving from a eukaryote to a Rickettsia endosymbiont to Wolbachia, or something like that. So I would look at your text to see if you can clean-up any references to that line of thinking. One of the benefits to working on recent transfers is that it isn’t an issue! ;)

    1. On 2016-06-22 21:47:00, user PornHelps wrote:

      In addition, sexual images are not considered sexual rewards by anyone. You do realize they are jacking it to these images every single time they see them, don't you? Yet you pretend like sexual stimulation doesn't exist. You are comparing apples (money reward) and oranges (sexual images). Your use of the term "porn" also is very telling. No scientist uses that term. Sexually explicit media (SEM) is an actually non-based term. You show your true inability to think or write objectively about this media.

    1. On 2019-08-23 18:40:05, user IJ wrote:

      Interesting manuscript!

      However, I would like to point out a minor error. The manuscript incorrectly states that Li and Zhang have disputed the results of Jungreis et al. "Drosophila melanogaster has been shown to have significant functional (“programmed”) readthrough (Jungreis, et al. 2011). While this is disputed (Li and Zhang 2019)..." Li and Zhang found evidence that the readthrough extensions of most of the 307 Drosophila genes found to undergo readthrough via ribosome profiling by Dunn et al. are non-adaptive. However, this set has very little overlap (only 43 genes) with the 283 genes found by Jungreis et al. to show evolutionary signatures of adaptive readthrough. While Li and Zhang have found evidence that most stop codon leakage is non-adaptive, they do not dispute that it can be adaptive for some genes. They state, "That most read-through events are nonadaptive does not preclude the possibility that a small proportion of such events have been co-opted in evolution for certain functions." Thus, Li and Zhang 2019 did not dispute the results of Jungreis et al.

      On a related note, the Dunn et al. 2013 ribosome profiling experiments on readthrough in Drosophila and yeast seem very relevant to your manuscript, and you might considering discussing, or at least citing, their work.

    1. On 2015-03-31 14:05:45, user Leslie Vosshall wrote:

      Thanks, glad you found it useful! I also hope that our experience will encourage people to post pre-prints on bioRxiv without fear that the manuscript won't be accepted subsequently by a peer-reviewed journal (disclaimer: i am on the bioRxiv board)

    1. On 2018-01-15 02:11:20, user anonymous wrote:

      Is the definition of an "interference domain" in this paper, having an area of ~1/rho around it's focal point, equivalent in any direct way, with the definition of a "linkage block" from Good et al. 2014**? i.e., sites separated by a map length of r < 1/TMRCA?

      Also, with respect to: "The transition [between interference regimes] occurs when the interference probability e^–? becomes of order 1; the transition point marks the onset of nonlinearity in the fitness variance."

      How was this determined from the simulated data? For example, in figure 2a, it isn't completely clear from those data points that there is a "transition to nonlinearity" at ?=1. Kind of looks like it, but it's not absolutely clear. How did you determine that?

      enjoyed this paper a lot, thanks!

      **http://journals.plos.org/pl...

    1. On 2020-01-27 15:38:01, user Tom Featherstone wrote:

      I'm very intrigued by this concept. It has been a while since there has been any update on this project and I'm wondering whether the company has started considering looking at alternative methods for the study of neural transmissions. Rather than just using electrodes to measure the impulses from neurons , having just read a paper on scientists using trans genetic flies to study neural path ways it would be interesting to see if they start utilizing this technique to further understand the neural pathways. It may have already started taking place however there is no mention of such at the moment neither the use of viral vectors to transmit markers round neural pathways. Really looking forward to see developments of this project and the future to come. Good luck guys! :)

    1. On 2020-09-16 15:30:36, user Raghu Parthasarathy wrote:

      The observation of cool fungi is fascinating. However, I don't understand many aspects of the proposed "mushroom-based cooling device." Since the mechanism is evaporative cooling, how is putting mushrooms in a box any better than putting the equivalent amount of water in the box? Perhaps the argument is that the mushrooms have greater surface area, but this requires that the cooling be surface-limited rather than flow-limited, and this isn't discussed; moreover, if how is mushrooms-in-a-box better than water-soaked sponges in a box (or something else with a large surface area)? Clarification would be welcome!

    1. On 2021-02-17 17:45:07, user chikheang Soeng wrote:

      Hello,<br /> My colleagues and I recently chose to present your paper in a journal club. We think that Artemisinin-resistant malaria is a major health threat and that repurposing Alisporivir as an anti-malaria drug, as demonstrated in this article, is a promising solution. <br /> I would like to share some of the comments that were brought up during our discussion. In Figure 1 and Figure 2, the interaction between Alisporivir and Cyclosporin A was demonstrated using computer simulations. However, we believe that this conclusion could be better supported by conducting an in-vitro protein binding assay. Also, in Figure 2C, the colors of the graph and the figure legends do not match, making it difficult to interpret the results. Since the main focus of this article concerns the effectiveness of Alisporivir against Artemisinin-resistant malaria, it might be a good idea to move them to the supplemental figure section. In Figure 3C, we think that the third column was mislabeled; it should be DAPI + Cyp. Otherwise, the quality of the microscopy images was excellent. When reading the methods section, the sample size could not be found and we hope to see it included there.<br /> Overall, I think that this paper is very interesting to read. I like the fact that the result section was broken down into smaller sections which makes it easy to follow. I am looking forward to reading more in the future.

    1. On 2022-01-20 16:38:23, user Mathurin Dorel wrote:

      Just a few remarks that would improve an overall rather good paper:

      • A multiplicity of infection of 0.3 is not "extraordinary" low.
      • 25 guides per cell on average is really high, either your multiplicity of infection is miscalculated or those are sequencing artefacts (a substitution makes a spacer sequence look like another). You should check that. This is probably a signal picked up by tour neural network. Another reason could be your fixation and rehydration protocol that increases the ambient noise.
      • with a multiplicity of infection of 0.3, you do expect ~25% of the cells to have >2 guides. If you find less there is a problem. However your expression vs guide assignment argument is convincing for the accuracy of scAR so it might be worth checking the expression of the candidate second guide targets.
    1. On 2019-08-03 00:24:50, user Heteromeles wrote:

      Anthropocene refugia is not a novel term or concept, as it was proposed it in 2015 in the book Hot Earth Dreams by Dr. Frank Landis, and others are pursuing the same concept with plants in California. The analysis is quite welcome.

    1. On 2022-10-31 05:04:22, user Ashraya Ravikumar wrote:

      Summary:

      In this work, the author asks how protein structures change based on analyzing the torsion angles. Through examples they show that the distribution of points in this representation correlates with resolution and data collection temperature of the structures. They also construct the RoPE space of a protein using time-resolved experiment datasets and show that minor changes in the linear coordinate space are clearly observed in the RoPE space. This work demonstrates the utility of a non-linear representation of the conformational space in visualizing changes throughout the structure which are originally considered subtle. This work is very interesting and can have significant impact on ensemble studies on protein structures and in crystallization/cryo-EM and fragment screening efforts by showing the impact of temperature and resolution. The manuscript is very concise (perhaps too concise?) and well written.

      Major points:

      1. In Page 3, para 2, the author states differences associated with data collection temperature is preserved across space groups for trypsin and lysozyme but Figure 1(a) and 1(b) marks different space groups only for lysozyme and not for trypsin<br /> 2.The section on carboxymyoglobin has some unclear statements:<br /> (a) “The RoPE space of these structures showed that, over the first three picoseconds, two torsion angle modes are sufficient to represent a clear trajectory during release of carbon monoxide”. Fig 1(e) does show a trajectory from -0.1ps to 3.0 ps but it is not clear how two torsion modes are sufficient to build the trajectory.<br /> (b)“The last three timepoints, 10 ps, 50 ps and 150 ps, are therefore beyond the biologically relevant timescales for CO dissociation in myoglobin and in-line with this, they did not strongly correlate with any other timepoints in RoPE space”. We are confused about which figure/data supports this non-correlation. Is it to be interpreted from Fig 1(e)? If yes, then the author should describe what is correlation and non-correlation in the context of this figure.<br /> (c) The section on “mapping motion back onto structure” in the methods makes it unclear why the scaling is normalized to 1degree and how that might bias the magnitude of motion observed in Figure 2a (+/- 0.3 A)
      2. We tried running some analysis on the RoPE website but it was either unclear how to go about submitting a job or the website became unresponsive after clicking on “view conformational space”. The author can provide a run-through of the website usage with some examples.
      3. It is unclear how important the vagabond refinement performed here is in the clustering. How would figure 1a, b look, for example, if the PDB or PDB-REDO models were subjected to ROPE without further refinement?
      4. At the end of the SVD, it should be possible to project the contributions for each SV back onto the torsion angles most responsible for the differences. It would be interesting to plot that for BPTI and lysozyme to identify the torsions/areas leading to the greatest differences across temperatures.

      Minor points:

      1. There are some gray colored points in Figure 1(a) and 1(b) which are not accompanied by a legend and their significance not explained.
      2. To highlight the advantage of RoPE space, the author can show clustering of the same protein chains when clustered based on RMSD. The crowding of points when using RMSD vs. the separation of points when using torsion angles can make the utility of RoPE space obvious to the reader.

      3. Ashraya Ravikumar and James Fraser, UCSF

    1. On 2019-09-10 00:30:16, user Holly Beale wrote:

      Congratulations on your paper. I really enjoyed it.

      A couple of notes: <br /> I'm working on something related in bulk RNA-Seq, and I also did subsetting of fastq with seqtk. The behavior wasn't exactly what I expected. If I used the same random seeds to take two subsets, one with one million reads and the other with two million reads, the second set included all the reads from the first set. I ended up using different random seeds for each subset.

      I think I eventually got it, but I had trouble parsing figure 3. It might be easier to understand if you omitted 2/3 of the groups from each plot. You could include the full figures in the supplement.

    1. On 2016-05-28 15:43:57, user Davidski wrote:

      Aren't the stats f4 (Steppe, Neolithic Farmer; Pop1, Pop2) potentially confounded by geography?

      Steppe has more hunter-gatherer ancestry than Farmer, so if Pop1 has more hunter-gatherer ancestry than Pop2, and hunter-gatherer ancestry is usually positively associated with higher latitude in Europe, then the stat might be significantly positive as a result of Pop1 living at a higher latitude.

      No wonder then that more northerly UK populations score more Steppe affinity in this test. That's not to say that they don't have more steppe ancestry than the southeast English. But the question is whether these particular stats can pick that up specifically, as opposed to just picking up extra hunter-gatherer ancestry in more northerly populations.

    1. On 2021-03-18 06:53:46, user Michele Nunes wrote:

      Hello,<br /> A group of undergraduate students at UCLA had the pleasure of discussing this BioRxiv paper during one of our journal clubs. Many of us were fascinated by the background information on phosphodiesterase inhibitors in relation to lipid metabolism. However, since the background consists of all text, it was a bit difficult for some of us to truly understand the signaling pathway in regards to how natriuretic peptides, PPAR?, and PDE9 were all related. We thought that including a visual aid such as a signaling map in the first figure or a visual summarizing the entire introduction would be helpful to engage readers who are not as familiar with the field.

      Specifically, in the liver photos (Figure 1f), some of us found it difficult to distinguish a change in size between the placebo and PDE9-I group solely based on the images. We thought that including a line with a known measurement or showing the livers in cylinders with weights attached would be a more helpful metric to justify the results.

      In addition, you state that these experiments were done in both OVX-female and male mice, but the only figure that includes both data is Figure 2g. The rest of the male mice data is pushed to supplementary. We were curious if there was a reason for only including a portion of male mice data? The ability to easily compare the data to OVX-female mice could bring to light important differences.

      Finally, one of my colleagues was quite interested in the notion of estrogen having a protective effect against cardiometabolic syndrome. They suggested a future rescuing experiment where if OVX-female mice with induced obesity-cardiometabolic syndrome were injected with estrogen, could estrogen reverse the effects? I thought this was a great suggestion for possible future research in this area.

      Thank you for your time!

    1. On 2024-06-06 17:27:56, user Prof. T. K. Wood wrote:

      The first TA system found to inhibit phage was Hok/Sok in 1996 (that makes it seminal). So 26 years before retrons (your ref 47) and 25 years before ToxIN (your ref 48), Hok/Sok set the precedent of stopping phage by interpreting a phage process (transcription shutoff), rather than reacting to a specific phage protein. Curious as to why this discovery does not merit citation.

    1. On 2016-07-11 18:45:07, user Ludo Waltman wrote:

      I would like to announce that I have written a blog post commenting on this paper: https://www.cwts.nl/blog?ar.... The blog post discusses the difficulty of distinguishing between the use of impact factors at the level of journals and at the level of individual papers.

      In addition to the comments made in the blog post, I also would like to raise the following issue.

      In my view, the skewness of citation distributions can be interpreted in different ways, with different implications for the use of impact factors. Let me give two interpretations:

      (1) This interpretation starts from the idea that citations provide a reasonable reflection of the quality of papers. Therefore the fact that within a single journal there are large differences in the number of citations received by papers indicates that there are large differences in the quality of papers. Consequently, the impact factor of a journal doesn’t properly reflect the quality of individual papers in the journal.

      (2) This interpretation combines two ideas. The first idea is that citations are weak indicators of the quality of papers. Papers of similar quality on average have a similar number of citations, but there is a large standard deviation. Due to all kinds of ‘distorting factors’, papers of similar quality may differ a lot in the number of citations they receive. The second idea is that journals manage reasonably well to carry out quality control. Therefore the papers published in a journal are of more or less similar quality, so the standard deviation of the quality of the papers in a journal is relatively small. It follows from these two ideas that the impact factor, which is the average number of citations of the papers in a journal, provides a reasonable reflection the quality of individual papers in the journal (especially if the journal is sufficiently large, so that the above-mentioned ‘distorting factors’ in the citations received by individual papers cancel out). The fact that some papers in a journal receive many more citations than others is not the result of quality differences but instead it results from citations being weak indicators of quality, so it results from the above-mentioned ‘distorting factors’. In this interpretation, impact factors are a stronger rather than a weaker indicator of the quality of individual papers than citation counts.

      The interpretation that the authors seem to follow in their paper, and that for instance also seems to be followed in the DORA declaration, is the first one. However, the empirical results presented by the authors, showing that citation distributions are highly skewed, are compatible with both interpretations provided above. In the second interpretation, there is no reason to reject the use of IFs to assess individual papers in a journal. Therefore, if the authors want to reject the use of IFs for this purpose, I believe they need to provide an additional argument to make clear why the first interpretation is more reasonable than the second one. I do think that the first interpretation is indeed more reasonable than the second one, but a careful argument is needed to make clear why this is the case and on which assumptions this is based.

    1. On 2020-04-16 10:49:14, user Darren Martin wrote:

      I think that we maybe need to find more viruses that connect to the tree in the branch that separates SARS-CoV2 from the MRCA node of SARS-CoV2 and RATG. Without the genome sequences of these missing relatives we're not going to get very far wrt figuring out what actually happened.

    1. On 2019-10-29 03:19:33, user mismatch_repair wrote:

      I had a number of questions/concerns about this manuscript and its co-submitted counterpart on which I would appreciate feedback from the authors:

      Some of my concerns are the following:

      1) The manuscript states that I-PpoI "recognizes ~20 sites in the genome." However, in addition to a number of unique sites in genes and noncoding regions, which comprise the 20 sites you refer to, I-PpoI cuts within every 28s rDNA repeat (which you mention as a target, but which seems to be counted only once). Mammalian genomes contain many identical rDNA repeats spanning multiple chromosomes, and copy number can vary by an order of magnitude between individuals in a species because these repetitive sequences are highly prone to recombination. These repeats are difficult to sequence and not annotated on the Mus musculus reference genome. Per the NCBI entry on the murine 28s gene: "The sequences coding for ribosomal RNAs are present as rDNA repeating units distributed on chromosomes 12, 15, 16, 18 and 19. The number of rDNA repeating units varies between individuals and from chromosome to chromosome, although usually 30 to 40 repeats are found on each chromosome. These rDNA repeats are not currently annotated on the reference genome." Several publications even report an ability of 28s rDNA units to undergo coordinated copy number expansion in response to deletion events.

      2) The claim that it is possible to generate "non-mutagenic" DSBs by simultaneously creating hundreds of compatible sticky-end cuts throughout the genome (primarily in highly repetitive sequences) is quite unprecedented. I am not aware of any prior publications on DNA repair claiming the existence of a 100% non-mutagenic DSB. The burden of proof for this should be high. However, the evidence provided here is insufficient to support this claim. There are numerous types of mutations: point mutations, minor indels, insertion and deletion of larger chromosomal regions, duplications, inversions, and chromosomal translocations. All of the larger chromosomal rearrangements are anticipated outcomes of simultaneously freeing compatible sticky ends throughout the genome. Point mutations/minor indels may occur but at lower rates. However, these minor mutations are the only ones directly assessed, by sequencing the genome and checking mapped reads. Detecting these larger genomic rearrangements is a challenging task even for experts in the field, and it seems the sequencing efforts did not extend beyond this. The genome reads are based on 500-bp fragments, which would make detection of most of these events impossible, even if you were looking for them. In the rarer case of a chimeric 500-bp read resulting from fusion of compatible but non-homologous sequences, the read would not map to the genome and have been discarded by your analysis. In the more likely case of a fusion between 28s cuts on different regions of a chromosome or on different chromosomes, the read would merely show a normal sequence in the 500 bp surrounding the cut but it would be impossible to discern where or on which chromosome the sequence is located among the numerous repetitive tracts throughout the genome.

      3) You use a few additional methods like the Surveyor assay to assess 28s mutation, but this again can only detect point mutations. Furthermore, it relies on PCR amplification so if 2 different sequences are fused, the 28s primers would no longer amplify this. And the small size to which the DNA analyzed by Southern blot was fragmented render it similarly unable to detect rearrangements. While you prepared metaphase spreads, you did not do any banding analysis which drastically limits the ability to detect chromosomal rearrangements that do not lead to obvious changes in the shape of the whole chromosome. I do not know why your ligation-based method did not detect 28s cuts, but my guess would be failed PCR. I took a look at your target amplicon and the region between the primers is 64% GC, and immediately adjacent to one of the primers is a stretch of ~70 bp with 90% GC content. This would likely make for an extremely difficult PCR- one publication describing special conditions for amplifying 28s DNA reports that "the cloning of the rDNA gene family is very difficult" and "Sequencing primers should be far from the sequences with stretches of G or C repeats." (DOI:10.17221/3960-cjas). Your baseline "aberrant metaphase" level also seems very high. Mladenov et al. (Chromosome Translocation 2018) reported that 15 aberrant chromosomes/100 metaphases as a result of 8 single I-SceI cut sites and transient transfection leads to 30% lethality in CHO cells. 12 clusters of 4 cut sites lead to 90% cell lethality.

      4) Your findings of a lack of mutagenicity from I-PpoI cutting contradict a substantial number of publications using this system. Ray Monnat, whom you cite in these manuscripts, reported that "These endonuclease-induced breaks can be repaired in vivo, although break repair is mutagenic with the frequent generation of short deletions or insertions," and also mentioned that the human genome contains ~300 28s cut sites for I-PpoI across chromosomes (1999). Other publications report the same phenomenon in yeast. Your own paper from 2015 found large deletions in mice. I-PpoI has been incorporated into a "gene drive" in mosquitos to "shred" the X chromosome and prevent the birth of female offspring. I-PpoI is derived from a mobile genetic element and its evolved purpose is to catalyze the insertion of a new sequence into the genome. Yet, you claim that in your system, it causes no genetic changes. If 100% re-ligation of cut sticky ends in the presence of many other compatible sticky ends was as likely as you suggest here, restriction enzyme-based cloning would not work, nor would numerous DNA repair reporter constructs based on this principle.

      I do not see in this manuscript data that is sufficient to support the claims being made. One manuscript claims that mutations do not accumulate substantially with age and the other cites multiple sources showing they do. It appears you have generated an artificial progeroid model with genomic instability due to DSBs and that is why you see the same phenotypes as human progeroid syndromes and mouse models based on DNA repair deficiencies. It is impossible to claim epigenetic changes are responsible for the observed phenotypes when you are in all likelihood causing extensive genetic changes to these mice.

      I also wonder how the following statements from a 2015 paper on the same mouse model (the only difference being cell type specific rather than ubiquitous expression) can be reconciled with the current manuscripts:

      2015: "To induce nuclear translocation of ERT2-I-PpoI, PpoSTOP/+; lck-Cre mice were subjected to 2–4 intraperitoneal injections of 1 mg TAM (Sigma, resuspended in corn oil) at 24 h intervals. Animals were analyzed 4 h after the final TAM injection."

      2019: "ICE mice were generated by crossing I-PpoI STOP/+ mice to CreERT2/+ mice harboring a single ERT2 fused to Cre recombinase that is induced whole body (Ruzankina et al., 2007). 4-6 month-old Cre and ICE mice were fed a modified AIN-93G purified rodent diet with 360 mg/kg Tamoxifen citrate for 3 weeks to carry out I-PpoI induction."

      2015: "33% of break-spanning DNA segments yielded a chimeric DNA sequence, in which one end of the I-PpoI-flanking DNA was joined to that of a second, polymorphic I-PpoI site located ~1 Mb downstream. No evidence for aberrant junctions was observed in break-flanking DNA from lck-Cre controls, demonstrating I-PpoI-dependent formation of these distal fusions."

      2019: "Unlike other methods of creating DSBs, such as CRISPR, chemicals and radiation, I-PpoI creates "sticky DNA ends" that are repaired without inducing a strong DNA damage response or a mutation (Yang et al., co-submitted manuscript)."

      2015: "Although moderate transcriptional changes can be detected in DSB-bearing genes, persistent DSB formation and repair is associated with a surprisingly stable transcriptome in vivo." "Our findings further suggest that DSB repair is necessary and sufficient to ensure the maintenance and/or restoration of break-proximal gene expression profiles and, by extension, epigenetic integrity in vivo." "Consistent with cell-intrinsic epigenetic deregulation being a minor consequence of continued DSB exposure in vivo, a recent study shows that DNA damage-induced, age-associated functional decline can be attributed in large part to systemic consequences of DSBs, including cell death, tissue atrophy and the ensuing, non-cell-autonomous inflammatory response."

      2019: "We present evidence that the response to DSBs changes the compartmentalization of chromatin and introduces transcriptional and epigenetic noise that closely mimics what happens during normal aging, including hallmark changes to the histone modifications, gene expression, and DNA methylation patterns." "After repair, the epigenome is reset but not completely, leading to progressive changes to the epigenetic marks and chromatin compartmentalization of the genome." "In the parlance of Waddington, the youthful epigenetic landscape is eroded to the point where cells head towards other valleys, losing their identity in a process we have termed "exdifferentiation.""

    1. On 2019-11-08 16:26:18, user V Blaine wrote:

      THCA might be the diet pill that could revolutionize the obesity industry. I suppose it is related to cannabinoid hyperemesis syndrome. And it is based on the theory that reducing a drug causes the opposite of what the drug causes. THC increases appetite and THCA causes limited appetite. I am only giving my two cents based on my experiences so that researchers can perhaps test some of these hypotheses as cannabis becomes more accepted.

    1. On 2019-02-05 09:10:55, user Ian Collinson wrote:

      Congratulations on your study. We’d like to draw your attention to one of ours. <br /> Allen et al eLife 2016;5:e15598<br /> The similarities are indeed very interesting for such a diverse system. <br /> Best wishes<br /> Ian Collinson (ian.collinson@bristol.ac.uk)

    1. On 2019-05-14 05:17:04, user Preeti Garai wrote:

      This manuscript from Garai et al. has been recently accepted for <br /> publication in PLOS Pathogens. Significant changes have been made to the <br /> preprint during the revision process and a link to the published article <br /> is forthcoming.

    1. On 2020-09-01 22:09:23, user Alexander Novokhodko wrote:

      Dear Authors,

      I believe figure 3 has two residues labeled 490 in the RBD. It looks like it should just be the phenylalanine and the leucine should be labeled differently. Please correct this typo, or let me know if I am misunderstanding something.

      Thank You,<br /> Sincerely,<br /> Alexander Novokhodko

    1. On 2023-01-31 22:59:50, user Bruce Kirkpatrick wrote:

      The data presented in Figures 1C and 1D seems to internally conflict — it would be unusual for the mesh size to increase past the size of the soft, non-degradable condition without the modulus decreasing correspondingly (i.e., it is odd that a mesh size 50% greater in the stiff vs. soft condition could be achieved in the context of a G' that is 3-fold greater in the stiff than soft condition).

    1. On 2023-02-13 00:52:12, user John Barry Gallagher wrote:

      The article as it stands makes it not possible to verify their results or conclusions: 1) there is no data or presentation of the 210Pb geochronology or independent validation, especially important in these not ideal dynamic depositional environments; 2) no disentanglement between seaweed and their epiphyte remaining deposits and allochthonous deposits, importantly that have been consumed before deposition; 3) While the title focusses on deposits under the farm, there should thus be a discussion or quasi estimation of the amount of export that survives consumption and the role of calcareous epibionts and benthic fauna on the sequestration rate the article implies from organic carbon soil accumulation; 4) how does sequestrtaion of biomass related to atmospheric flux, driven but not equivalent.

    1. On 2025-10-03 08:55:58, user Tomáš Strecanský wrote:

      Dear authors,

      Great work on this study, thank you for sharing it as a pre-print.

      I have a few quick questions about the methods that would be helpful for clarification:

      What was the centrifugation speed and time used to obtain the blood plasma?

      Could you specify the brand and material of the filter used for the plasma filtration?

      What were the sample plasma volumes used for the cfRNA extractions?

      For the sequencing, could you clarify the number of samples pooled per PromethION flowcell and the resulting sequencing depth and per sample number of reads (for both long and short reads) for each sample?

      Thanks in advance for the details. I'm looking forward to the next version of the paper!

      All the best,<br /> Tomas Strecansky<br /> PhD student<br /> Institute of Molecular Biomedicine<br /> Comenius University in Bratislava

    1. On 2024-07-11 13:25:32, user Pookey532 wrote:

      A small correction in Table 1.<br /> CRISPR gRNA vector wrongly including PAM sequence, the consequence should say "gRNA plasmid becomes target of CRISPR cleavage" with the caveat that this would only be the case if the wrongly included PAM is followed by another PAM, which is not the case in many CRISPR plasmids such as the pX330 derived ones. This would obviously affect cleaving at the target if its PAM is not followed by a second PAM.

      While some errors in the table are almost certainly errors in design (ex stop codons before a 2A sequence, mutations in ITRs, etc...) I'm curious why some of the other design "errors" are deemed errors. For example, using CMV in AAV vectors can be a perfectly acceptable choice depending on the use of the virus, especially if it isn't intended for long term expression. Likewise, use of "unstable" sequences in high copy plasmids can be a problem, however if those plasmids are maintained in bacteria that maintain plasmids at a low copy (Epi400, Stbl2, etc...), the replication origin of the plasmid becomes less relevant as the copy number becomes more dependent on the host strain. Similar to this, "Vectors containing toxic genes to E. coli host" is not necessarily a design error. Sometimes this simply the only option.

    1. On 2021-03-04 22:45:49, user Dawson White wrote:

      Thanks for your hard work elucidating these processes. I am very curious, what happens to within vs among group turnover when the numerous singleton clades are removed? I am also keen to understand the elevational distribution of your clades with >2 samples. Good luck moving forward!

    1. On 2016-08-30 09:33:53, user Matilda Katan wrote:

      For quite some time we and others working on FICD were concerned about its true enzymatic function. In particular, having to use an E234G variant to show AMPylation raised many doubts. Now, Preissler et al. not only solve this puzzle but also place FICD in a physiological context i.e. protein folding homeostasis in the ER. Great manuscript!

      Matilda Katan

    1. On 2020-12-14 18:30:37, user Rachael Tarlinton wrote:

      As others with more experience in Bio-informatics than me have pointed out the chimeric reads reported here are likely an artifact of the sequencing method. The authors have also used a very artificial cell culture system to specifically drive the phenomenon they were seeking and even then have not actually demonstrated integration of virus into the genome (this would as others have pointed out require sequencing of the DNA of the cells rather than the RNA to capture the integration sites between cellular and viral DNA). <br /> There does seem to be a case (in general) that viral infections in cells lead to increased expression of retroelements (we have reported on this ourselves) but in no case that I am aware of has anyone demonstrated that this then leads to integration of the virus (or the retroelement) into the genome. In people the accumulation of new retroelement integrations is a very rare occurrence indeed (these types of evolutionary events are measured in millions of years, not an individuals life span) . This is not the case in species with more recent and active retroviruses (such as pigs, sheep, koalas, mice, chickens) but even in those species they do not typically pick up or insert sequences from other virus classes (these types of events are even rarer than new retroelement insertions). The mechanisms speculated here have also never been known to occur with HIV infections in people (an incredibly well studied retroviral infection). <br /> This paper certainly does not demonstrate that SARs-Cov-2 is or is likely to become integrated in a human genome.

    1. On 2023-03-28 20:25:39, user Alexander Nikitin wrote:

      The authors would like to add Blaine A. Harlan and Minseok Kim as co-authors of this manuscript. The list of authors in this preprint should read as Dah-Jiun Fu, Andrea J. De Micheli, Blaine A. Harlan, Mallikarjun Bidarimath, Minseok Kim, Lora H. Ellenson, Benjamin D. Cosgrove, Andrea Flesken-Nikitin and Alexander Yu. Nikitin, with DJF, AJDM and BAH indicated as equal contributors.

    1. On 2021-03-22 22:22:39, user Anuradha Wickramarachchi wrote:

      Congratulations on your work.

      We would like to know if there is an implementation made available for this tool. Furthermore, could you clarify a bit more on the logistic regression classifier trained? Is it trained on k=3-7'mers or something else.

      Thanks in advance!

    1. On 2020-12-22 13:08:44, user ?? ?? wrote:

      This manuscript has published online in the Rhinology journal website.<br /> https://www.rhinologyjourna...

      After reviewing, we made some important changes to the content of the paper that reflected the results. Unfortunately, bioRxiv said they was not able to link it to the website, because the manuscript was published as "letter", not "original article".<br /> Please see the following link. https://www.rhinologyjourna...<br /> Thank you.

    1. On 2021-06-18 18:06:01, user Briana Rivera wrote:

      Hello.<br /> I really liked the paper, it was enjoyable and I learned a lot about the Ash1L gene. Overall, I thought the figures were beautifully put together. I appreciated how Figure 3 was organized and how the abnormal behaviors of adult mice were observed and measured. I understand autistic-like behaviors were the focus of the research but I wondered if other specific neurodevelopmental disorders were considered, since the clinical manifestations listed might also overlap with other disorders. I also appreciated seeing the differences between the global and conditional knockouts respectively. In regards to the conditional knockout, I wondered if perhaps a different promoter, like the neuron specific enolase promoter, was ever considered and if it would yield similar results. I also wondered if a conditional knockout after maturity, such as one conducted through a tamoxifen inducible cre system, would be of interest to then compare subsequent effects on brain morphology and mice behavior alike. I also appreciated Figure 4. I thought maybe as a separate or supplemental figure it would be of interest to do a gene expression comparison of cell lines derived from humans without an ASD diagnosis and those with an ASD diagnosis with an Ash1L mutation to then see if the pattern of gene expression might be similar to the results in Figure 4.<br /> Thank you for sharing all of your hard work.

    1. On 2015-05-13 16:35:53, user Josiah Zayner wrote:

      A protein with a single mutation can become sub.neofunctionalized. Some may even argue that mutations to a coding region in a gene that don't change the protein sequence could change the translation rate, which changes the folding, which changes the function. It appears you are looking at genes that are not identical so how do you know they aren't sub.neofunctionalized?

    1. On 2020-05-12 09:48:20, user Gilthorpe Lab wrote:

      'As of April 29, 2020, COVID-19 has claimed more than 200,000 lives, with a global mortality rate of ~7% and recovery rate of ~30%' - where is the citation for this? It is simply unjustified to state figures such as this.

    1. On 2018-11-20 13:13:47, user Ingmar Claes wrote:

      Very proud of being part of the bacterial revolution! At YUN we are convinced that the live biotherapeutic products (LBP) field can jointly reduce the (over/mis)use of antibiotics!

      Many thanks to the University of Antwerp and the University Hospital of Antwerp! This wouldn't have been possible without this collaboration. <br /> @SarahLebeer @Eline_Oerlemans @Filip_Kiekens @Tim_Henkens @Julien_Lambert

    1. On 2020-02-14 15:52:22, user Sebastian Dresbach wrote:

      Dear Johanna Bergmann, Andrew Morgan, & Lars Muckli,

      Thank you for providing your interesting manuscript as a preprint. Recently, we discussed this preprint in a journal club concerning layer (f)MRI at Maastricht University and would like to provide some comments.

      In general, we enjoyed the idea of applying depth dependent imaging to investigate the complex architecture underlying mental imagery and visual illusions, rather than probing basic mechanisms. Furthermore, we liked the way in which you use the stimuli to distinguish between the short-range and long-distance connections between proximal and distant cortical regions and V1. However, we were wondering about the use of distinct ROIs representing foveal and peripheral space for the two types of feedback. Specifically, we reasoned that using identical shapes and ROIs would have rendered the comparison between the depth-dependent profiles of mental imagery and illusory percept more straightforward.

      We liked that you first delineate the ROIs in surface space and subsequently project the selected vertices back to volume space, as this seems less susceptible to cortical depth artefacts. On the other hand, several issues can be raised concerning the (presentation of the) data quality and results. For example, providing tSNR maps and/or a few slices of the functional images would give a better feel for the data quality and provide a reference for the future studies. Furthermore, the r²-threshold for voxel selection appeared lower than what we are used to seeing in other similar studies (e.g. minimum around 0.3 to 0.5 range). We would be curious to hear your thoughts on this.

      Your main results are based on the group-level averages of decoding accuracy for different conditions. We highly appreciate that you also report the single subject data in the supplementary materials. As it is often the case, the single subject plots seem to be quite different than the group-level results in multiple subjects. We think that a discussion on how this influences the overall conclusions would be worthwhile.

      Minor points concern the reporting of (f)MRI parameters: Some units (e.g. for TE) are missing and we believe there is a typo in the voxel size reporting throughout. Instead of “0.8mm³”, we guess that you have meant 0.8 mm isotropic or 0.8³ mm³ or 0.8×0.8×0.8 mm³. Finally, reporting the partial Fourier-type you employ might also be important to report as this choice highly influences the resulting image quality and might provide some insight with regards to the effective image resolution.

      We hope that our input is valuable to you in some form for the next iteration of this article

      With kind regards,

      Sebastian Dresbach, Lonike Faes, Johannes Franz, Faruk Gulban, Renzo Huber, Amanda Kaas, Till Steinbach, & Yawen Wang

    1. On 2022-01-11 20:42:36, user Mina Bizic wrote:

      I would like to congratulate Rachel Szabo and colleagues on their great work and effort put into this manuscript. The goal of analyzing such a high number of particles has been something I have been calling for ever-since my work cited in the comment by Dr. Jacob Cram (Bizic-Ionescu et al., 2018). It’s exciting to see the efforts you have made in this direction.

      It’s equally exciting to see that my conclusion from 2018 that the initial colonization of particles is stochastic, is strongly featured in your paper title and well supported by your results.

      As Dr. Cram has mentioned in his comment, we discussed your study and have come up with several aspects that we feel deserve some attention and most likely to be better addressed in the manuscript. Some of these aspects were raised by Dr. Cram in his comment. However, we felt that our opinions on this manuscript were dissimilar enough to warrant separate comments, with some observations that overlap and some that differ.

      My general query goes to the applicability of the results to the natural environment, given several biases introduced by the chosen experimental system. I will list here my opinion on the source of these biases.

      1) The concentration of seawater is likely to have generated an unrealistic microbial community. This is for three reasons (A) concentration of particle-attached microbes, (B) concentration of large bacteria, and (C) non-concentration of DOM: <br /> (A) Filtering the water through a 63 µm mesh should leave all particles smaller than this size in the water The subsequent step of gentle centrifugation most likely further concentrated these microparticles increasing their abundance above natural concentrations. <br /> (B) The gentle centrifugation likely selected for larger bacteria, as smaller cells may not be concentrated by a 5 min 4000 g run. <br /> (C) Finally, the seawater DOM on which bacteria can feed was not concentrated in this process. <br /> Therefore, the resulting inoculum used for the experiment contains a size-selected microbial community and a microparticle enrichment which in the absence of ambient DOM will rapidly drive the experiments towards consumption of the particulate organic matter at rates not representing the natural environment.

      2) The incubation time and small volumes: While samples have been collected already after 12 h the experiment ran for 166 h in a closed microwell. It has been shown by many as well as by my colleagues and I that after 24 h at the latest, the community in the experiment does not represent the environmental one (for example: Baltar et al., 2012; Ionescu et al., 2015; Herlemann et al., 2019). Therefore, seeing such long experiments conducted in fully closed systems, as in this paper, makes me wonder to what degree the rates of events observed in the lab are similar to rates in nature.

      3) One possible problem with the incubation system used, is the effect of the microwell surface on microbial activity. Ploug and Jorgensen (1999), for example, came up with the net-jet system for measuring microprofiles on organic matter aggregates. However, aside of the effect of direct contact of particles with surfaces on particle properties and the microbial activity on it, a second issue is the formation of biofilms may form on the surfaces of the incubation system. Heterotrophic activity is known to increase in closed incubation systems (e.g. Fogg and Calvario-Martinez, 1989; Ionescu et al., 2015). Though it was shown that these biasing effects will occur regardless of bottle size (Hammes et al., 2010), these will likely have a stronger effect in very small incubation volumes (Herlemann et al., 2019), consuming oxygen and nutrients. I don’t recall reading whether the O2 concentration was monitored? My guess is that the system became anoxic relatively fast, unlike it would be in a natural environment. How does this affect the nature of associated (and active) bacteria?

      Having said that, I support the authors’ overall conclusion and applaud the effort that went into the data collection and analyses I am aware from my own work on the difficulties to obtain and maintain such a large number of particles in open systems, such as the one my colleagues and I designed. However, I think that the biases introduced by an experimental system should be openly discussed in the manuscript and if possible, explain how your results remain valid despite them. This is even more important when you often discuss late-stage particles, that are the most to be affected by aspects mentioned above.

      Sincerely,

      Mina Bizc

      References

      Baltar, F. et al. (2012) Prokaryotic community structure and respiration during long-term incubations. Microbiology open, 1, 214–224.

      Bizic-Ionescu, M. et al. (2018) Organic Particles: heterogeneous hubs for microbial interactions in aquatic ecosystems. Front. Microbiol., 9.

      Fogg, G. E. and Calvario-Martinez, O. (1989) Effects of bottle size in determinations of primary productivity by phytoplankton. Hydrobiologia, 173, 89–94.

      Hammes, F. et al. (2010) Critical evaluation of the volumetric “bottle effect” on microbial batch growth. Appl. Environ. Microbiol., 76, 1278–1281.

      Herlemann, D. P. R. et al. (2019) Individual physiological adaptations enable selected bacterial taxa to prevail during long-term incubations. Appl. Environ. Microbiol., 85.

      Ionescu, D. et al. (2015) A new tool for long-term studies of POM-bacteria interactions: Overcoming the century-old Bottle Effect. Sci. Rep., 5.

      Ploug, H. and Jørgensen, B. B. (1999) A net-jet flow system for mass transfer and microsensor studies of sinking aggregates. Mar. Ecol. Prog. Ser., 176, 279–290.

    1. On 2022-02-16 01:03:10, user Michael wrote:

      Recording useful metadata in a standard format is definitely a laudable goal. Building on OME, as discussed, is great. But I find that a far larger problem than instrument settings is not having information on the biology (and other reagents). Knowing that an image was taken with a 60x N.A. 1.4 Nikon planapochromat lens at 100 ms with an Andor Zyla 5.5, LED excitation at 555 nm, bandpass emission 580+/- 40 nm, with Micromanager 2 beta with ImageJ 1.53q23 running under Java 1.8.005.93b is irrelevant if you don't know the cell type and molecule labeled. In fact, we could change every detail of the technical minutiae about the microscope to digital file and the only important thing would be the cell type and molecule labeled. Metadata need to prompt the user to include details about the experiment. Proprietary systems do save the technical details pretty well, at least for standard imaging, but none record the biology (or whatever is being imaged). One critical field that’s should be added a phrase stating the goal of the experiment (like a tweet). Every notice that when you go for a clinical diagnostic medical procedure that the technician enters all sorts of data about the patient or pulls them out of an existing record? This is what is missing from microscopy metadata. This is where there really is a crisis.<br /> Sample prep and biological samples are mentioned in the introduction, but are largely absent throughout the text. However, these are the most important data that need to be recorded with the images.

    1. On 2025-11-07 10:51:52, user Tatsuya Yamashita wrote:

      Dear authors,

      at first, congratulations to this important findings. This data, paired with other ancient DNA evidence, can further clarify the demographic patterns of the peopling of Eastern Eurasia and Oceania, as well as their interactions with archaic human groups. Different deeply branching Denisovan components can be very useful data points for possible migration routes and or population substructure scenarios.

      In your pre-print, you argued for a possible earlier southern route into Oceania, followed by a later wave of the ancestors of South Asians (AASI) and East Asians, with East Asians via a possible northerly route: "This supports an early migration of the ancestors of Oceanians through South Asia followed by the later arrival of the ancestors of present-day South Asians. East Asians do not share this Denisovan component in their genomes, suggesting that their ancestors arrived independently, perhaps by a northerly route".

      One major problem with this scenario is the observed genetic affinity between the different "basal Asian" populations (e.g. Tianyuan, Önge, Hoabinhian, Xingyi_EN, Jomon/Shiraho_27k, AASI, and Australasians/Oceanians such as Papuans); also known as "eastern non-Africans" (ENA) or "East Eurasian Core" (EEC). The aDNA data strongly suggest a single dispersal route and subsequent rapid diversification into multiple basal Asian lineages (presumably in the South-Southeast Asia region via a single Southern route).

      E.g. Oceanians/Papuans can successfully be modeled (qpAdm/qpWave) as simply Önge-like + additional Denisovan; or alternatively as Tianyuan-like + additional Denisovan. They do not fit as outgroup to "West/East Eurasians" either, but are nested within the "Eastern" clade (e.g cluster with Önge, Tianyuan, or present-day East Asians).

      Although it is possible to reproduce a signal affilated with a distinct earlier southern coastal route (proxied by ZlatyKun_45k); this wave however left only minor ancestry among present-day Oceanians/Papuans (and or South Asians), with the majority ancestry of them being derived from the same source as Önge or Tianyuan: e.g. ZlatyKun + Önge-like + extra Denisovan, in a 3–5%, 92–95%, and 2–3% ratio respectively. (qpGraph models allow higher "early ancestry" for Oceanians/Papuans: 12–24% when splitting before or at around the same time as ZlatyKun/Ranis, or up to 44% when splitting at the same time as Ust'Ishim.)

      Beyond that, a northern route entry for the ancestors of East Asians seems to be only partially possible, as the majority ancestry of East Asians seems to be from an Önge-like source (except Önge also used a northern route entry).

      This means that present-day eastern non-Africans (ENA) descend primarily from a single migration wave eastwards, presumably via a route South of the Himalayas; and which possibly absorbed an earlier less successfull wave, at least regionally (Oceania and South Asia).

      This may also have happened via a more substructured wave: e.g. both a southern coastal route (along the coast of the Indian subcontinent) and a southern interior route (via an interior route along the southern Himalayan mountain range) into Southeast Asia and beyond. – It is however well possible that the southern coastal wave pre-dated the southern interior wave, and thus display different Denisovan signals. E.g. timely separated migrations waves of a shared clade.

      Regional Denisovan admixture events (or their partial absence as in the case of Jomon HGs [see the recent paper by Jiaqi Yang et al. 2025 "An early East Asian lineage with unexpectedly low Denisovan ancestry"]) can be explained that way, without needing several different distinct waves, which would contradict the observed genetic affinities of the different Basal Asian lineages. The low Denisovan ancestry of Jomon hunter-gatherers may or may not be affilated with the Shiraho_27k specimen, who appearently contributed some ancestry to later Jomon. For more information on the Shiraho_27k specimen, please contact your co-author Svante Pääbo or Hideaki Kanzawa-Kiriyama.

      Note that Tianyuan40k can successfully be modeled as Önge-like + IUP-affilated admixture (BachoKiro_IUP); which fits the presence of IUP material sites in nearby NW China and Mongolia. Such IUP admixture has also been noted to explain the observed affinity to the GoyetQ116-1 specimen in Europe, which similarly can be modeled as Kostenki14/Sunghir_UP + BachoKiro_IUP (see Hajdinjak et al. 2021 "Initial Upper Palaeolithic humans in Europe had recent Neanderthal ancestry").

      It is possible that this Siberian IUP group absorbed the EA-specific Denisovan component and via its admixture into Tianyuan, contributed it to other Eastern Asians in lesser amount. – Present-day East Asians in turn can be successfully modeled as Tianyuan-like (c. 25%) + Önge-like (c. 75%); (see McColl et al. 2018 "The prehistoric peopling of Southeast Asia" for example). – Via Tianyuan-like or Denisovan-admixed IUP groups, this archaic ancestry may have also reached regions further West (as with the supposed Denisovan signal in Sunghir etc.).

      You can also review Bennett et al. 2024 "Reconstructing the Human Population History of East Asia through Ancient Genomics", a recent summary paper on the peopling of Eastern Asia and beyond; as well as Tianyi Wang et al. 2025 "Prehistoric genomes from Yunnan reveal ancestry related to Tibetans and Austroasiatic speakers".

      A summary of my points regarding your postulated "earlier southern route" for Oceanians and a possible "northerly route" for East Asians:

      • The available genetic data strongly suggests a single shared migration wave for the primary ancestral source of all eastern non-Africans (Papuans, AASI, East Asians, Önge/Hoabinhian, and Tianyuan). The presence of multiple deeply branching EEC lineages in Southeast Asia and southern China suggest it to be a major place of diversification from a shared ancestral source.<br /> • Papuans/Oceanians (and AASI) may have limited amounts of admixture from an earlier wave, but primarily share ancestry with Önge and Tianyuan.<br /> • Tianyuan can be modeled as either an admixture between Önge-like (61–67%) and BachoKiro_IUP-like (33–39%) ancestries; or represents a deep split from the rest of eastern non-Africans; although with some geneflow into later East Asians.<br /> • Ancient and present-day East Asians can be modeled as primarily Önge-like (c. 75%) with Tianyuan-like admixture (c. 25%).<br /> • The different Denisovan introgression events, if not shared, may have happened regionally to explain the observed affinities, but the differences in Denisovan components among each group.

      Below some qpAdm results on this (AADR v.62 + Ranis dataset); allsnps=TRUE:

      Model1<br /> target: Papuan<br /> left: Hoabinhian, ZlatyKun, Denisovan<br /> right: Mbuti, Ranis13, Ust'Ishim, BachoKiro_IUP, Tianyuan, Önge, Kostenki14, Sunghir_UP<br /> Results: Hoabinhian: 93,9%; ZlatyKun: 3,2%; Denisovan: 2,9%;<br /> p-value: 0.061

      Model2<br /> target: Papuan<br /> left: Japan_Jomon, ZlatyKun, Denisovan<br /> right: Mbuti, Ranis13, Ust'Ishim, BachoKiro_IUP, Tianyuan, Kostenki14, Sunghir_UP<br /> Results: Japan_Jomon: 94,1%; ZlatyKun: 2,4%; Denisovan: 3,5%;<br /> p-value: 0.094

      Model3<br /> target: Tianyuan<br /> left: Önge, BachoKiro_IUP<br /> right: Mbuti, Ranis13, Ust'Ishim, Oase1_IUP, Papuan, Hoabinhian, Kostenki14, Sunghir_UP<br /> Results: Önge: 65,5%%; BachoKiro_IUP: 34,5%%;<br /> p-value: 0.170

      Model4<br /> target: Japan_Jomon<br /> left: Önge, Amur33k<br /> right: Mbuti, Ranis13, Ust'Ishim, BachoKiro_IUP, Tianyuan, Papuan, Hoabinhian, Kostenki14, Sunghir_UP<br /> Results: Önge: 79,0%%; Amu33k: 21,0%;<br /> p-value: 0.053

      Model5<br /> target: Han_Chinese<br /> left: Önge, Amur33k<br /> right: Mbuti, Ranis13, Ust'Ishim, BachoKiro_IUP, Tianyuan, Papuan, Hoabinhian, Kostenki14, Sunghir_UP<br /> Results: Önge: 73,6%%; Amu33k: 26,4%;<br /> p-value: 0.090

      Etc.

      (Önge = ONG Mondal; Jomon = JpOd181/274/282).

      E.g. it is not likely that Oceanians were part of a distinct earlier wave into Oceania, separate from other mainland Asian groups, nor that East Asians reached East Asia along a distinct northern route (independently of Önge-like groups etc.). Next to qpAdm/qpWave or qpGraph models, f3/f4 statistics are quite clear on this. Papuans are (beyond their extra Denisovan ancestry and possible minor "earlier group" admixture) nested in eastern non-African diversity (e.g. EEC).

      It is plausible that after the OoA exit, and the IUP/EEC dispersals from a Hub on the Persian plateau, Eastern non-Africans (ENA/EEC) shared a secondary Hub somewhere in Northwest India, from which Oceanians expanded first, via a coastal route towards Oceania. Along the coast of the Indian subcontinent (South India?), they absorbed the Deep Denisovan ancestry and continued to expand to Oceania. – Some time afterwards, the remainder ENA/EEC group (residual) expanded along an interior route South of the Himalayas into Southeast Asia and Southern China; not admixing with the Deep Denisovan group. – There, one branch split and head towards Japan (low Denisovan), while another group headed northwards coming into contact with IUP groups & the EA-specific Denisovan (Denisovan3-like) component (=Tianyuan_40k); while the remainder absorbed a local Denisovan group in Southern China or Southeast Asia (=Önge-like). – This Önge-like groups expanded back into South Asia/India, absorbing the group with Deep Denisovan introgression (becoming the AASI). The Önge-like groups staying in Southeast Asia became the Hoabinhians, while early East Asians formed along a cline of Tianyuan-like and Önge-like ancestries.

      Of course the above scenario is just one of many possibilities; it is well possible that Oceanians used a southern route, while the ancestors of both East Asians and Önge used as northerly route. – Or any other scenario which can explain the aDNA data and genetic affinities.

      My suggestion is to define a model which alignes with both aDNA data and archaic components (for ancient and present-day populations), as well as, if possible, archaeologic and paleoenvironmental evidence.

      E.g. including a set of ancient and present-day groups to test on their Denisovan components and their overall genetic affinities (not just modern groups to prevent bias from ancient geneflow events): For South Asians: AASI-rich tribal groups from Southern India, such as Irula and Paniya; for SEA: Önge, Hoabinhians; for EA: the newly analyzed Xingyi_EN samples, Jomon, Longlin, Amur14k, Qihe3, Tianyuan and Amur33k, as well as present-day East/Southeas Asians; for Oceania: Papuans, Australians, and Aeta. Maybe a chart comparing shared/distinct Denisovan components and f3/f4 statistics of each test group to each other would help clarify the exact affinities, shared routes or geneflow events. Perhaps, your co-author Svante Pääbo can share informations on the Shiraho_27k specimen and its Denisovan components.

      A strong model should explain the genetic data/affinities of ancient/present-day populations, their different Denisovan components, and in best case also include archaeologic and paleoenvironmental data. To determine the influence of ancient geneflow, comparison between ancient specimens could help (Tianyuan vs Önge vs Jomon vs Longlin vs Amur14k etc.).

      I hope this information can help to tangle out some possible scenarios on the dispersal, contact and introgression events for the different deeply branching Denisovan components and present-day Asian populations. Or maybe inspire future studies on this topic.

      I am looking forward for the publication of your paper and more exciting findings!

      Thank you.

      Yours sincerely,<br /> Yamashita Tatsuya

    1. On 2018-07-31 11:27:24, user H. Etchevers wrote:

      Just so I remember, the article has been accepted for a forthcoming Special Issue of genesis in honor of the 150th anniversary of the discovery of the neural crest. There has been a production glitch for most of the articles in this issue, but it should be out by the autumn, 2018. The future DOI of the accepted version will be: http://dx.doi/org/10.1002/dvg.23221

    1. On 2018-01-19 02:16:43, user Xiaojian Li wrote:

      Considering the "integrate and fire" model of the neuron, Statistically the step of firing spike which is a strong non-linear phenomenon only plays the role as pulse density modulation coding for better transferring analog signals...

    1. On 2019-09-14 18:29:07, user Justin Perry wrote:

      While this is a valiant amount of work on a very important topic, the likelihood that the TCR+ macrophages you see ex vivo are because of clearance of T cells by macrophages (RNA, including polyA-RNA, is incredibly stable in the phagolysosome) is high. These would likely not be removed by any of the standard single cell-RNAseq "doublet" removal techniques. The issue of RNA "contamination" has been shown independently by Dennis Discher (https://www.ncbi.nlm.nih.go... "https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5846676/)") and Steffen Jung (https://www.ncbi.nlm.nih.go... "https://www.ncbi.nlm.nih.gov/pubmed/29777220)"), and anecdotally seen by a host of groups attempting RNAseq (especially single cell RNAseq) of macrophages. I would urge caution in interpreting TCR+ macrophages as anything other than a macrophage doing its job of efferocytosis, and be wary of interpreting much from the gene signatures of macrophages because of this potential T cell contamination. Engulfment of T cells by macrophages shows a frustratingly high level of T cell-associated genes, especially prevalent genes such as those associated with signaling. None of the data presented in this preprint negate the likelihood of efferocytosis. In fact, CD68 is most commonly associated with LAMP1 and the endo-lysosomal compartments, and is often used as a marker of phagocytic macrophages in situ. Furthermore, FACS analysis of ex vivo TAMs could just as easily be of a T cell binding to TAMs, a TAM with a partially eaten T cell, or a manifestation of the tissue digestion process, where digestion at 37C for as little as 15-30 minutes can result in transfer of intact proteins (such as intact TCR), trogocytosis, or phagocytosis (like we frustratingly observed and reported previously https://www.cell.com/immuni... "https://www.cell.com/immunity/fulltext/S1074-7613(18)30144-4?_returnURL=https%3A%2F%2Flinkinghub.elsevier.com%2Fretrieve%2Fpii%2FS1074761318301444%3Fshowall%3Dtrue)").

    1. On 2021-06-08 18:04:03, user Charles Warden wrote:

      Hi,

      Thank you very much for posting this preprint.

      I am working on trying to read through the manuscript more carefully, which I hope can improve my understanding of STARsolo as well the regular STAR alignment. I also thought the different results with varying settings of Alevin was interesting and important.

      However, in the meantime, I believe that you have a minor error in one of your references:

      [30]. R. S. Brüning et al. Comparative Analysis of Common Alignment Tools for Single Cell RNA Sequencing. preprint. Bioinformatics, 2021. doi: 10.1101/2021.02.15.430948.

      I think this should be a bioRxiv preprint (not a Bioinformatics preprint)?

      For example, the DOI leads to this reference:

      https://www.biorxiv.org/con...

      Thanks Again,<br /> Charles

    1. On 2020-05-01 23:34:33, user Aaron wrote:

      The title of the manuscript seems a bit disingenuous. The authors show an increase in prevalence for G614 alongside increasing case numbers, but that is correlation, not causation. To make the jump that this is a more transmissible form of the virus would require functional studies. It seems that authors decided to go with the more sensational title rather than the more important one that this mutation didn't show any significant difference in patient outcome.

      Importantly, we have millions of cases and only thousands of genomes sequenced at this point. There is likely to be some amount of bias in which genomes are being collected, with localized founder effects apt to skew proportions of mutations for a given country or region, i.e. a majority of sequences for a country/state coming from a single center.

    1. On 2025-02-20 12:15:37, user kei wrote:

      Thank you for your interesting paper.

      I am curious about the behavior when only the non-natural amino acids in cyclic peptides are not tokenized at the atomic level, and all cyclic peptides are represented using SMILES. I am thinking of investigating this.

      That said, I have one question:<br /> What is the input format when inferring complexes of cyclic peptides containing non-natural amino acids and proteins?<br /> (In other words, how are non-natural amino acids formatted in the input?)

      If possible, I would appreciate it if you could share an example of the YAML file or other input data used for Boltz1.

      Thank you in advance.