Open Flask: January 2020

"It's just a peptide, how hard could it be?" That was Phil's remark in late 2017, when I set out to pursue the total synthesis of the alkaloid tryptorubin A.

Fast forward two years and a bit, and the lab has published the molecule's synthesis (Download paper here). That means it's my turn to write another blog post, on topics (at least vaguely) related to that paper.

A summary of the paper if you feel the manuscript is tl;dr, or you're behind a paywall:

Tryptorubin A is a neat indole alkaloid. It's isolated from an interesting ternary symbiosis: Leaf-cutter ants pile up leaves to compost into mulch. Then, the mulch they have generated goes into little mushroom farms, where the ants grow mushrooms to eat. This is their main diet. Meanwhile, on top these farmed mushrooms live a variety of defense bacteria. One of those bacteria, a Streptomyces sp., generates tryptorubin A as a metabolite. From these bacteria Clardy et al. isolated the molecule and published its fascinating structure (link to isolation).

Actually, none of this is in the paper but it's just cool trivia about the molecule.

Although nothing is known about the bioactivity or biological role of tryptorubin A, we pursued its synthesis strictly for reasons of structural interest. This is a somewhat old-school, very academic way of thinking. The project was incepted as follows:

<scrolling through JACS ASAPS>, "Hey, that's an interesting structure. I bet it would be hard to make."
Me, to Phil: "This molecule would be hard to make."
Phil, to me: "Great. The best projects are ones where people are slightly scared at the start. Have fun."*
Me: "Damnit, now I have to make it."

Off I set to make it. The chemistry is pretty self-explanatory (tons of peptide couplings, an Ulmann and a Friedel-crafts), so I won't go into much detail there. However, I will note that in the SI, we briefly highlight a dozen or so failed strategies (SI pages 8-10), if you're curious to see some early things we tried.

The extremely short summary of chemical lessons learned is (a) don't try to close highly strained macrocycles with macrolactamizations, if possible, and (b) KISS. Fancy ring contraction or other trickery never beat out simple, straightforward old school robust chemistry.

I made what I thought was the natural product <9 months into the project. This is where stuff got interesting.

None of the NMR data matched the real natural product.
However, the 2D data unambiguously confirmed that we had the correct atomic connectivity and stereochemistry.
What the hell was going on?

At this point, we reached out to team Clardy and got all the original NMR data. I spent about 2 weeks simply staring at NMRs trying to understand what could possibly be the difference between what I had made and the natural product.

This was, by far, both the most challenging and the most rewarding couple weeks in my graduate career.
I don't have many photos relating to this project, but I have this one, which summarizes the era nicely. This was taken at about 4AM. Coffee in one hand, beer (unopened?) in the other. Cumulative sleep in the preceding 72 hours? Approximately 6 hours total. Structural assignment is great. And also terrible.

Finally, we hit the conclusion. Both natural tryptorubin A (compound 1a in the published manuscript) and what we had made (compound 1b) had identical connectivity and point chirality, but were inside-out with respect to one another. Formally, this renders them 'non-canonical atropisomers'**

We spent a lot of time thinking about how to communicate this isomerism most clearly, and our best efforts are represented with cartoon drawings in the paper, reproduced here:

We of course still had to make the natural product. Luckily, a crystal structure (compound 7) in the paper gave us a big hint: The indoline's point chirality enforced a pro-atropisomeric conformation that only allowed the bicycle to close in one way. We're not the first to use point chirality to relay to atrop-control, but to our knowledge this is the first control of such a "right-side-in vs. inside-out" molecular shape in a natural product.
We made the molecule, and everyone was happy. Well, I would be happy, except I haven't gotten my fricking molecule cake. Phil, I know you're reading this. Everyone who finishes a molecule gets a cake. Where's my cake? Cake please.
Given the interesting structural nuances of this story, we desperately attempted to get solid-state structures of the respective isomers. I launched a collaboration with Hosea Nelson @ UCLA to try to do MicroED, and also with James Nowick @ UCI to try high-throughput crystallography screening, but unfortunately, never got any structures. So, we're still reliant on NMR for all conclusions.
In parallel with finishing the synthesis, we were also lucky to tie in with the Clardy group who was further elucidating the biosynthesis of this molecule. That's an interesting story in itself, but not mine to tell -- the extremely short version is this molecule is a RiPP, not an NRPS as originally believed, and there is some very cool CYP machinery to close the necessary macrocycles. More to come on this side of the story, I hope.

Fundamentally, this is a synthesis paper that's not about synthesis -- it's about understanding molecular conformation and supramolecular isomerism, and about how we can control said isomerism in a flask. We hope it's an enjoyable read.

I want to close out by thanking the incredible team I was lucky to work with throughout this process, most notably Yang Gao, who is an all around killer chemist. Thanks also to Jon Clardy and his talented postdocs Allison, Eric, and Emily for being a pleasure to collaborate with. And Phil, of course, is an amazing boss to work with. Even when he forgets to buy cake.

Y. Gao (left) and S.H. Reisberg. The Baran lab student subset of the tryptorubin A team.

*TANGENTIAL SIDEBAR #1: "Why the heck do we still do total synthesis in the modern era?"
I want to start this sidebar with the statement that opinions here (and throughout this blog post) are strictly my own and may not represent Phil or the lab's overall perspective. For Phil's opinion on the utility of total synthesis in a modern context, it's worth reading his recent perspective.

With that note, here is my hot take on total synthesis in the abstract. My viewpoint is that the goal of a total synthesis must be considered with intentionality, and that the total synthesis, once completed, should be scored against that intentionally-designed goal. In my view, any of the following are perfectly reasonable drivers of a synthesis:

Scalable access to material is needed
SAR of unnatural analogs needs to be probed
The robustness of a method for a key step is being tested, and the synthetic target is simply an arbitrary vehicle on which to test a new methodology
Drivers for new methods are desired, and the total synthesis is being pursued to illuminate "which bonds are difficult to make with existing methods" with the explicit plan of spinning out new methods projects
Blind faith that with structural complexity comes chemical insight, and that some serendipitous discovery occurs
Sheer aesthetic work (directly analogous to a concert violinist's profession; creating beauty for the sake of creating beauty)

I feel that synthesis done under any of these umbrellas is useful. The problems start to arise when the declared goal is not actually enabled in the synthesis. It drives me up a wall when people introduce synthesis as 'a tool to make scalable amounts of material' and then make <1 mg. Because, at that point, you're not actually progressing solutions to the problem you're trying to solve. Likewise, if someone has a declared goal of exploring SAR, and subsequently make a single analog, it amounts to snakeoil salesmanship. These kinds of farces are where the field bears risk for stagnation.

So, I encourage everyone considering a synthesis to spend some deep introspection asking, "Why should I make this molecule," and subsequently throughout the project enforcing that all strategic decisions are well-aligned with the answer to that 'why.'

**TANGENTIAL SIDEBAR #2: "What's in a name?"

One issue that is worth discussing is the complexity around naming the isomeric relationship between 1a and 1b. The easy layman's explanation that 'one is inside-out relative to the other' is somewhat difficult to translate into formal language. Originally (i.e., before the peer review process), we referred to these compounds as topoisomers. If you poke around for the formal definition of topoisomer, the two definitions that pop up are "two compounds with identical connectivity but non-identical molecular graphs" and "two compounds with identical connectivity and point chirality, but that can only be interconverted via scission and reformation of bonds." Now, according to the former definition, 1a and 1b are definitely not topoisomers. According to the latter definition, things begin to get a little hairy, because if one ignores the physical limitations of bond lengths, 1a and 1b can interconvert (see Figure 4), but it's a very unphysical transformation.

However, a very thoughtful reviewer (thanks!) was kind enough to point out that physicality should be ignored in the topoisomer definition, and thus we can't think of 1a and 1b as topoisomers. For this reason, they should actually be called atropisomers. This risks swinging too far in the other direction, though: 1a and 1b's isomerism ('inside out') is categorically different from a prototypical atropisomer (e.g., binaphthyl) -- tryptorubin's isomerism has way more going on than simple sigma bond torsion.

To give a sense of just how confusing all this nomenclature can be, we decided to run a small social experiment. Phil tweeted a poll about lasso peptide isomerism:

Now, before we view the results of this poll, note that lasso peptides are just like tryptorubin: They are formally topologically trivial, but have a very defined set of shape-based isomers.

Go to twitter to read some of the fascinating commentary, but the short version is this: Even Twitter experts (people with the doctoral degree TweeHD?) have diverse and contradictory views on what this isomerism should be called.

Combining all of this, we coined the term 'non-canonical atropisomerism' to encapsulate tryptorubin's (and, for that matter, lasso peptides') shape-based isomerism. For me, the upshot of the entire nomenclature story is a classic 'a rose by any other name would smell as sweet' -- what we call the isomerism basically doesn't matter; the important part is that it's communicated clearly, and hopefully the manuscript (and especially structural graphics) make everything comprehensible.

Thursday, January 2, 2020

Tryptorubin A