Ideas: Accelerating Foundation Models Research: AI for all
www.microsoft.com
Transcript[TEASER][MUSIC PLAYS UNDER DIALOG]EVELYNE VIEGAS: So AFMR is really a program which enabled us to provide access to foundation models, but its also a global network of researchers. And so for us, I think when we started that program, it was making sure that AI was made available to anyone and not just the few, right? And really important to hear from our academic colleagues, what they were discovering and covering and what were those questions that were not even really thinking about, right? So thats how we started with AFMR.CESAR TORRES: One of the things that the AFMR program has allowed me to see is this kind of ability to better visualize the terrain of creativity. And its a little bit of a double-edged sword because when we talk about disrupting creativity and we think about tools, its typically the case that the tool is making something easier for us. So my big idea is to actually think about tools that are purposely making us slower, that have friction, that have errors, that have failures. To say that maybe the easiest path is not the most advantageous, but the one that you can feel the most fulfillment or agency towards.MUHAMMED IDRIS: For me, I think what programs like AFMR have enabled us to do is really start thinking outside the box as to how will these or how can these emerging technologies revolutionize public health? What truly would it take for an LLM to understand context? And really, I think for the first time, we can truly, truly achieve personalized, if you want to use that term, health communication.[TEASER ENDS][MUSIC PLAYS]GRETCHEN HUIZINGA: Youre listening to Ideas, a Microsoft Research podcast that dives deep into the world of technology research and the profound questions behind the code. Im Gretchen Huizinga. In this series, well explore the technologies that are shaping our future and big ideas that propel them forward.[MUSIC FADES]Im excited to share the mic today with three guests to talk about a really cool program called Accelerating Foundation Models Research, or AFMR for short. With me is Cesar Torres, an assistant professor of computer science at the University of Texas, Arlington, and the director of a program called The Hybrid Atelier. More on that soon. Im also joined by Muhammed Idris, an assistant professor of medicine at the Morehouse School of Medicine. And finally, I welcome Evelyne Viegas, a technical advisor at Microsoft Research. Cesar, Muhammed, Evelyne, welcome to Ideas!EVELYNE VIEGAS: Pleasure.CESAR TORRES: Thank you.MUHAMMED IDRIS: Thank you.HUIZINGA: So I like to start these episodes with what Ive been calling the research origin story and since there are three of you, Id like you each to give us a brief overview of your work. And if there was one, what big idea or larger than life person inspired you to do what youre doing today? Cesar lets start with you and then well have Muhammed and Evelyne give their stories as well.CESAR TORRES: Sure, thanks for having me. So, I work at the frontier of creativity especially thinking about how technology could support or augment the ways that we manipulate our world and our ideas. And I would say that the origin of why I happened into this space can really come back down to a bring your kid to work day. [LAUGHTER] My dad, who worked at Maquiladora, which is a factory on the border, took me over he was an accountant and so he first showed me the accountants and hes like look at the amazing work that these folks are doing. But the reality is that a lot of what they do is hidden behind spreadsheets and so it wasnt necessarily the most engaging. Suffice to say I did not go into accounting like my dad! [LAUGHTER] But then he showed us the chemical engineer in the factory, and he would tell me this chemical engineer holds the secret formula to the most important processes in the entire company. But again, it was this black box, right? And I got a little bit closer when I looked at this process engineer who was melting metal and pulling it out of a furnace making solder and I thought wow, thats super engaging but at the same time its like it was hidden behind machinery and heat and it was just unattainable. And so finally I saw my future career and it was a factory line worker who was opening boxes. And the way that she opened boxes was incredible. Every movement, every like shift of weight was so perfectly coordinated. And I thought, here is the peak of human ability. [LAUGHTER] This was a person who had just like found a way to leverage her surroundings, to leverage her body, the material she was working with. And I thought, this is what I want to study. I want to study how people acquire skills. And I realized that moment, I realized just how important the environment and visibility was to being able to acquire skills. And so from that moment, everything that Ive done to this point has been trying to develop technologies that could get everybody to develop a skill in the same way that I saw that factory line worker that day.HUIZINGA: Wow, well, well get to the specifics on what youre doing now and how thats relevant in a bit. But thank you for that. So Muhammed, whats the big idea behind your work and how did you get to where you are today?MUHAMMED IDRIS: Yeah, no. First off, Cesar, I think its a really cool story. I wish I had an origin story [LAUGHTER] from when I was a kid, and I knew exactly what my lifes work was going to be. Actually, my story, I figured out my why much later. Actually, my background was in finance. And I started my career in the hedge fund space at a company called BlackRock, really large financial institution you might have heard of. Then I went off and I did a PhD at Penn State. And I fully intended on going back. I was going to basically be working in spreadsheets for the rest of my life. But actually during my postdoc at the time I was living in Montreal, I actually had distant relatives of mine who were coming to Montreal to apply for asylum and it was actually in helping them navigate the process, that it became clear to me, you know, the role, it was very obvious to me, the role that technology can play in helping people help themselves. And kind of the big idea that I realized is that, you know, oftentimes, you know, the world kind of provides a set of conditions, right, that strip away our rights and our dignity and our ability to really fend for ourselves. But it was so amazing to see, you know, 10-, 12-year-old kids who, just because they had a phone, were able to help their families navigate what shelter to go to, how to apply for school, and more importantly, how do they actually start the rest of their lives? And so actually at the time, I, you know, got together a few friends, and, you know, we started to think about, well, you know, all of this information is really sitting on a bulletin board somewhere. How can we digitize it? And so we put together a pretty, I would say, bad-ass team, interdisciplinary team, included developers and refugees, and we built a prototype over a weekend. And essentially what happened was we built this really cool platform called Atar. And in many ways, I would say that it was the first real solution that leveraged a lot of the natural language processing capabilities that everyone is using today to actually help people help themselves. And it did that in three really important ways. The first way is that people could essentially ask what they needed help with in natural language. And so we had some algorithms developed that would allow us to identify somebodys intent. Taking that information then, we had a set of models that would then ask you a set of questions to understand your circumstances and determine your eligibility for resources. And then from that, wed create a customized checklist for them with everything that they needed to know, where to go, what to bring, and who to talk to in order to accomplish that thing. And it was amazing to see how that very simple prototype that we developed over a weekend really became a lifeline for a lot of people. And so thats really, I think, what motivated my work in terms of trying to combine data science, emerging technologies like AI and machine learning, with the sort of community-based research that I think is important for us to truly identify applications where, in my world right now, its really studying health disparities.HUIZINGA: Yeah. Evelyne, tell us how you got into doing what youre doing as a technical advisor. Whats the big idea behind what you do and how you got here?EVELYNE VIEGAS: So as a technical advisor in Microsoft Research, I really look for ideas out there. So ideas can come from anywhere. And so think it of scanning the horizon to look for some of those ideas out there and then figuring out, are there scientific hypotheses we should be looking at? And so the idea here is, once we have identified some of those ideas, the goal is really to help nurture a healthy pipeline for potential big bets. What I do is really about subtle science and exact art and we discover as we do and it involves a lot of discussions and conversations working with our researchers here, our scientists, but of course with the external research community. And how I got here well first I will say that I am so excited to be alive in a moment where AI has made it to industry because Ive looked and worked in AI for as long as I can remember with very different approaches. And actually as important, importantly for me is really natural languages which have enabled this big evolution. People sometimes also talk about revolution in AI, via the language models. Because when I started, so I was very fortunate growing up in an environment where my family, my extended family spoke different languages, but then it was interesting to see the different idioms in those natural languages. Just to give you an example, in English you say, it rains cats and dogs. Well, in France, in French it doesnt mean anything, right? In French, actually, it rains ropes, right? Which probably doesnt mean anything in English. [LAUGHTER] And so I was really curious about natural languages and communication. When I went to school, being good at math, I ended up doing math, realizing very quickly that I didnt want to do a career in math. You know, proofs all that is good in high school, doing a full career, was not my thing, math. You know, proofs, all that. Its good in high school, but doing a full career, it was not my thing, math. But there was that class I really, really enjoyed, which was mathematical logic. And so little by little, I started discovering people working in that field. And at the same time, I was still restless with natural languages. And so I also took some classes in linguistics on the humanity university in Toulouse in France. And I stumbled on those people who were actually working in some in linguistics, some in computer science, and then there was this lab doing computational linguistics. And then that was it for me. I was like, thats, you know, so thats how I ended up doing my PhD in computational linguistics. And the last aspect Ill talk about, because in my role today, the aspect of working with a network of people, with a global network, is still so important to me, and I think for science as a whole. At the time, there was this nascent field of computational lexical semantics. And for me, it was so important to bring people together because I realized that we all had different approaches, different theories, not even in France, but across the world, and actually, I worked with somebody else, and we co-edited the first book on computational lexical semantics, where we started exposing what it meant to do lexical semantics and the relationships between words within a larger context, with a larger context of conversations, discourse, and all those different approaches. And thats an aspect which for me to this day is so important and that was also really important to keep as we develop what were going to talk about today, Accelerating Foundation Models Research program.HUIZINGA: Yeah, this is fascinating because I didnt even know all of these stories. I just knew that there were stories here and this is the first time Im hearing them. So its like this discovery process and the sort of pushing on a door and having it be, well, thats not quite the door I want. [LAUGHTER] Lets try door number two. Lets try door number three. Well, lets get onto the topic of Accelerating Foundation Models Research and unpack the big idea behind that. Evelyne, I want to stay with you on this for a minute because Im curious as to how this initiative even came to exist and what it hopes to achieve. So, maybe start out with a breakdown of the title. It might be confusing for some people, Accelerating Foundation Models Research. What is it?VIEGAS: Yeah, thank you for the question. So I think Im going to skip quickly on accelerate research. I think people can understand its just like to bring HUIZINGA: Make it faster VIEGAS: well, faster and deeper advances. I mean, there are some nuances there, but I think the terms like foundation models, maybe thats where Ill start here. So when we talk about foundation models, just think about any model which has been trained on broad data, and which actually enables you to really do any task. Thats, I think, the simplest way to talk about it. And indeed, actually people talk a lot about large language models or language models. And so think of language models as just one part, right, for those foundation models. The term was actually coined at Stanford when people started looking at GPTs, the generative pre-trained transformers, this new architecture. And so that term was coined like to go not just talk about language models, but foundation models, because actually its not just language models, but there are also vision models. And so there are other types of models and modalities really. And so when we started with Accelerating Foundation Models Research and from now on, I will say AFMR if thats okay.HUIZINGA: Yeah. Not to be confused with ASMR, which is that sort of tingly feeling you get in your head when you hear a good sound, but AFMR, yes.VIEGAS: So with the AFMR, so actually I need to come a little bit before that and just remind us that actually that this is not just new. The point I was making earlier about its so important to engage with the external research community in academia. So Microsoft Research has been doing it for as long as Ive been at Microsoft and Ive been 25 years, I just did 25 in January.HUIZINGA: Congrats!VIEGAS: And so, I thank you! and so, its really important for Microsoft Research, for Microsoft. And so we had some programs even before the GPT, ChatGPT moment where we had engaged with the external research community on a program called the Microsoft Turing Academic Program where we provided access to the Turing model, which was a smaller model than the one then developed by OpenAI. But at that time, it was very clear that we needed to be responsible, to look at safety, to look at trustworthiness of those models. And so we cannot just drink our own Kool-Aid and so we really had to work with people externally. And so we were already doing that. But that was an effort which we couldnt scale really because to scale an effort and having multiple people that can have access to the resources, you need more of a programmatic way to be able to do that and rely on some platform, like for instance, Azure, which has security and privacy, confidentiality which enables to scale those type of efforts. And so what happens as were developing this program on the Turing model with a small set of academic people, then there was this ChatGPT moment in November 2022, which was the moment like the aha moment, I think, as I mentioned, for me, its like, wow, AI now has made it to industry. And so for us, it became very clear that we could not with this moment and the amount of resources needed on the compute side, access to actually OpenAI that new that GPT, at the beginning of GPT-3 and then 4 and then So how could we build a program? First, should we, and was there interest? And academia responded Yes! Please! Of course! right? [LAUGHTER] I mean, what are you waiting for? So AFMR is really a program which enabled us to provide access to foundation models, but its also a global network of researchers. And so for us, I think when we started that program, it was making sure that AI was made available to anyone and not just the few, right? And really important to hear from our academic colleagues, what they were discovering and covering and what were those questions that we were not even really thinking about, right? So thats how we started with AFMR.HUIZINGA: This is funny, again, on the podcast, you cant see people shaking their heads, nodding in agreement, [LAUGHTER] but the two academic researchers are going, yep, thats right. Well, Muhammed, lets talk to you for a minute. I understand AFMR started a little more than a year ago with a pilot project that revolved around health applications, so this is a prime question for you. And since youre in medicine, give us a little bit of a how it started, how its going from your perspective, and why its important for you at the Morehouse School of Medicine.IDRIS: For sure. You know, its something as we mentioned that really, I remember vividly is when I saw my first GPT-3 demo, and I was absolutely blown away. This was a little bit before the ChatGPT moment that Evelyne was mentioning, but just the possibilities, oh my God, were so exciting! And again, if I tie that back to the work that we were doing, where we were trying to kind of mimic what ChatGPT is today, there were so many models that we had to build, very complex architectures, edge cases that we didnt even realize. So you could imagine when I saw that, I said, wow, this is amazing. Its going to unlock so many possibilities. But at the same time, this demo was coming out, I actually saw a tweet about the inherent biases that were baked into these models. And Ill never forget this. I think it was at the time he was a grad student at Stanford, and they were able to show that if you asked the model to complete a very simple sentence, a sort of joke, Two Muslims walk into a bar what is it going to finish? And it was scary.HUIZINGA: Wow.IDRIS: Two thirds, it was about 66% of the time, the responses referenced some sort of violence, right? And that really was an aha moment for me personally, of course, not being that Im Muslim, but beyond that, that there are all of these possibilities. At the same time, theres a lot that we dont know about how these models might operate in the real world. And of course, the first thing that this made me do as a researcher was wonder how do these emerging technologies, how may they unintentionally lead to greater health disparities? Maybe they do. Maybe they dont. The reality is that we dont know.HUIZINGA: Right.IDRIS: Now I tie that back to something that Ive been fleshing out for myself, given my time here at Morehouse School of Medicine. And kind of what I believe is that, you know, the likely outcome, and I would say this is the case for really any sort of emerging technology, but lets specifically talk about AI, machine learning, large language models, is that if were not intentional in interrogating how they perform, then whats likely going to happen is that despite overall improvements in health, were going to see greater health disparities, right? Its almost kind of that trickle-down economics type model, right? And its really this addressing of health disparities, which is at the core of the mission of Morehouse School of Medicine. It is literally the reason why I came here a few years ago. Now, the overarching goal of our program, without getting too specific, is really around evaluating the capabilities of foundation models. And those, course, as Evelyne mentioned, are large language models. And were specifically working on facilitating accessible and culturally congruent cancer-related health information. And specifically, we need to understand that communities that are disproportionately impacted have specific challenges around trust. And all of these are kind of obstacles to taking advantage of things like cancer screenings, which we know significantly reduce the likelihood of mortality. And its going very well. We have a pretty amazing interdisciplinary team. And I think weve been able to develop a pretty cool research agenda, a few papers and a few grants. Id be happy to share about a little bit later.HUIZINGA: Yeah, thats awesome. And I will ask you about those because your project is really interesting. But I want Cesar to weigh in here on sort of the goals that are the underpinning of AFMR, which is aligning AI with human values, improving AI-human interaction, and accelerating scientific discovery. Cesar, how do these goals, writ large, align with the work youre doing at UT Arlington and how has this program helped?TORRES: Yeah, I love this moment in time that everybodys been talking about, that GPT or large language model exposure. Definitely when I experienced it, the first thing that came to my head was, I need to get this technology into the hands of my students because it is so nascent, theres so many open research questions, theres so many things that can go wrong, but theres also so much potential, right? And so when I saw this research program by Microsoft I was actually surprised. I saw that, hey, they are actually acknowledging the human element. And so the fact that there was this call for research that was looking at that human dimension was really refreshing. So like what Muhammad was saying, one of the most exciting things about these large language models is you dont have to be a computer scientist in order to use them. And it reminded me to this moment in time within the arts when digital media started getting produced. And we had this crisis. There was this idea that we would lose all the skills that we have learned from working traditionally with physical materials and having to move into a digital canvas.HUIZINGA: Right.TORRES: And its kind of this, the birth of a new medium. And were kind of at this unique position to guide how this medium is produced and to make sure that people develop that virtuosity in being able to use that medium but also understand its limitations, right? And so one of the fun projects that weve done here has been around working with our glass shop. Specifically, we have this amazing neon-bending artists here at UTA, Jeremy Scidmore and Justin Ginsberg. Weve been doing some collaborations with them, and weve been essentially monitoring how they bend glass. I run an undergraduate research program here and Ive had undergrads try to tackle this problem of how do you transfer that skill of neon bending? And the fact is that because of AFMR, here is just kind of a way to structure that undergraduate research process so that people feel comfortable to ask those dumb questions exactly where they are. But what I think is even more exciting is that they start to see that questions like skill acquisition is still something that our AI is not able to do. And so its refreshing to see; its like the research problems have not all been solved. It just means that new ones have opened and ones that we previously thought were unattainable now have this groundwork, this foundation in order to be researched, to be investigated. And so its really fertile ground. And I really thank AFMR the AFMR program for letting us have access to those grounds.HUIZINGA: Yeah. Im really eager to get into both your projects because theyre both so cool. But Evelyne, I want you to just go on this access line of thought for a second because Microsoft has given grants in this program, AFMR, to several Minority Serving Institutions, or MSIs, as theyre called, including Historically Black Colleges and Universities and Hispanic Serving Institutions, so what do these grants involve? Youve alluded to it already, but can you give us some more specifics on how Microsoft is uniquely positioned to give these and what theyre doing?VIEGAS: Yes. So the grant program, per se, is really access to resources, actually compute and API access to frontier models. So think about Azure, OpenAI but also now actually as the program evolves, its also providing access to even our research models, so Phi, I mean if you like smaller models HUIZINGA: Yeah, P-H-I.VIEGAS: Yes, Phi! [LAUGHTER] OK! So, so its really about access to those resources. Its also access to people. I was talking about this global research network and the importance of it. And Ill come back to that specifically with the Minority Serving Institutions, what we did. But actually when we started, I think we started a bit in a naive way, thinking we did an open call for proposals, a global one, and we got a great response. But actually at the beginning, we really had no participation from MSIs. [LAUGHTER] And then we thought, why? Its open its and I think what we missed there, at the beginning, is like we really focused on the technology and some people who were already a part of the kind of, this global network, started approaching us, but actually a lot of people didnt even know, didnt think they could apply, right? And so we ended up doing a more targeted call where we provided not only access to the compute resources, access to the APIs to be able to develop applications or validate or expand the work which is being done with foundation models, but also we acknowledged that it was important, with MSIs, to also enable the students of the researchers like Cesar, Muhammed, and other professors who are part of the program so that they could actually spend the time working on those projects because there are some communities where the teaching load is really high compared to other communities or other colleges. So we already had a good sense that one size doesnt fit all. And I think what came also with the MSIs and others, its like also one culture doesnt fit all, right? So its about access. Its about access to people, access to the resources and really co-designing so that we can really, really make more advances together.HUIZINGA: Yeah. Cesar lets go over to you because big general terms dont tell a story as well as specific projects with specific people. So your project is called, and Im going to read this, AI-Enhanced Bricolage: Augmenting Creative Decision Making in Creative Practices. That falls under the big umbrella of Creativity and Design. So tell our audience, and as you do make sure to explain what bricolage is and why you work in a Hybrid Atelier, terms Im sure are near and dear to Evelynes heart the French language. Talk about that, Cesar.TORRES: So at UTA, I run a lab called The Hybrid Atelier. And I chose that name because lab is almost too siloed into thinking about scientific methods in order to solve problems. And I wanted something that really spoke to the ethos of the different communities of practice that generate knowledge. And so The Hybrid Atelier is a space, its a makerspace, and its filled with the tools and knowledge that you might find in creative practices like ceramics, glass working, textiles, polymer fabrication, 3D printing. And so every year I throw something new in there. And this last year, what I threw in there was GPT and large language models. And it has been exciting to see how it has transformed. But speaking to this specific project, I think the best way I can describe bricolage is to ask you a question: what would you do if you had a paperclip, duct tape, and a chewing gum wrapper? What could you make with that, right? [LAUGHTER] And so some of us have these MacGyver-type mentalities, and that is what Claude Lvi-StraussHUIZINGA: Wow.TORRES: its been an exciting project to say the least.HUIZINGA: Okay, again, my face hurts because Im grinning so hard for so long. I have to stop. No, I dont because its amazing. You made me think of that movie Apollo 13 when theyre stuck up in space and this engineer comes in with a box of, well call it bricolage, throws it down on the table and says, we need to make this fit into this using this, go. And they didnt have AI models to help them figure it out, but they did a pretty good job. Okay, Cesar, thats fabulous. I want Muhammeds story now. I have to also calm down. Its so much fun. [LAUGHTER]IDRIS: No, know I love it. I love it and actually to bring it back to what Evelyne was mentioning earlier about just getting different perspectives in a room, I think this is a perfect example of it. Actually, Cesar, I never thought of myself as being a creative person but as soon as you said a paperclip and was it the gum wrapper HUIZINGA: Duct tape.IDRIS: duct tape or gum wrapper, I thought to myself, my first internship I was able to figure out how to make two paper clips and a rubber band into a this was of course before AirPods, right? But something that I could wrap my wires around and it was perfect! [LAUGHTER] I almost started thinking to myself, how could I even scale this, or maybe get a patent on it, but it was a paper clip yeah. Uh, so, no, no, I mean, this is really exciting stuff, yeah.HUIZINGA: Well, Muhammed, let me tee you up because I want to actually I want to say your project out loud IDRIS: Please.HUIZINGA: because its called Advancing Culturally Congruent Cancer Communication with Foundation Models. You might just beat Cesars long title with yours. I dont know. [LAUGHTER] You include alliteration, which as an English major, that makes my heart happy, but its positioned under the Cognition and Societal Benefits bucket, whereas Cesars was under Creativity and Design, but I see some crossover. Evelynes probably grinning too, because this is the whole thing about research is how do these things come together and help? Tell us, Muhammed, about this cultury culturally Tell us about your project! [LAUGHTER]IDRIS: So, you know, I think again, whenever I talk about our work, especially the mission and the why of Morehouse School of Medicine, everything really centers around health disparities, right? And if you think about it, health disparities usually comes from one of many, but lets focus on kind of three potential areas. You might not know you need help, right? If you know you need help, you might not know where to go. And if you end up there, you might not get the help that you need. And if you think about it, a lot of like the kind of the through line through all of these, it really comes down to health communication at the end of the day. Its not just what people are saying, its how people are saying it as well. And so our project focuses right now on language and text, right? But we are, as Ill talk about in a second, really exploring the kind of multimodal nature of communication more broadly and so, you know, I think another thing thats important in terms of just background context is that for us, these models are more than just tools, right? We really do feel that if were intentional about it that they can be important facilitators for public health more broadly. And thats where this idea of our project fitting under the bucket at benefiting society as a whole. Now, you know, the context is that over the past couple of decades, how weve talked about cancer, how weve shared health information has just changed dramatically. And a lot of this has to do with the rise, of course, of digital technologies more broadly, social media, and now theres AI. People have more access to health information than ever before. And despite all of these advancements, of course, as I keep saying over and over again, not everyones benefiting equally, especially when it comes to cancer screening. Now, breast and cervical cancer, thats what were focusing on specifically, are two of the leading causes of cancer-related deaths in women worldwide. And actually, black and Hispanic women in the US are at particular risk and disproportionately impacted by not just lower screening rates, but later diagnoses, and of course from that, higher mortality rates as well. Now again, an important part of the context here is COVID-19. I think there are, by some estimates, about 10 million cancer screenings that didnt happen. And this is also happening within a context of just a massive amount of misinformation. Its actually something that the WHO termed as an infodemic. And so our project is trying to kind of look for creative emerging technologies-based solutions for this. And I think were doing it in a few unique ways. Now the first way is that were looking at how foundation models like the GPTs but also open-source models and those that are, lets say, specifically fine-tuned on medical texts, how do they perform in terms of their ability to generate health information? How accurate are they? How well is it written? And whether its actually useful for the communities that need it the most. We developed an evaluation framework, and we embedded within that some qualitative dimensions that are important to health communications. And we just wrapped up an analysis where we compared the general-purpose models, like a ChatGPT, with medical and more science-specific domain models and as youd expect, the general-purpose models kind of produced information that was easier to understand, but that was of course at the risk of safety and more accurate responses that the medically tuned models were able to produce. Now a second aspect of our work, and I think this is really a unique part of not what Ive called, but actually literally theres a book called The Morehouse Model, is how is it that we could actually integrate communities into research? And specifically, my work is thinking about how do we integrate communities into the development and evaluation of language models? And thats where we get the term culturally congruent. That these models are not just accurate, but theyre also aligned with the values, the beliefs, and even the communication styles of the communities that theyre meant to serve. One of the things that were thinking, you know, quite a bit about, right, is that these are not just tools to be published on and maybe put in a GitHub, you know, repo somewhere, right? That these are actually meant to drive the sort of interventions that we need within community. So of course, implementation is really key. And so for this, you know, not only do you need to understand the context within which these models will be deployed, the goal here really is to activate you and prepare you with information to be able to advocate for yourself once you actually see your doctor, right? So that again, I think is a good example of that. But you also have to keep in mind Gretchen that, you know, our goal here is, we dont want to create greater disparities between those who have and those who dont, right? And so for example, thinking about accessibility is a big thing and thats been a part of our project as well. And so for example, were leveraging some of Azure API services for speech-to-text and were even going as far as trying to leverage some of the text-to-image models to develop visuals that address health literacy barriers and try to leverage these tools to truly, truly benefit health.HUIZINGA: One of the most delightful and sometimes surprising benefits of programs like AFMR is that the technologies developed in conjunction with people in minority communities have a big impact for people in majority communities as well, often called the Curb Cut Effect. Evelyne, I wonder if youve seen any of this happen in the short time that AFMR has been going?VIEGAS: Yeah, so, Im going to focus a bit more maybe on education and examples there where weve seen, as Cesar was also talking about it, you know for scaling and all that. But weve seen a few examples of professors working with their students where English is not the first language.HUIZINGA: Yeah VIEGAS: Another one I would mention is in the context of domains. So for domains, what I mean here is application domains, like not just in CS, but weve been working with professors who are, for instance, astronomers, or lawyers, or musicians working in universities. So they started looking actually at these LLMs as more of the super advisor helping them. And so its another way of looking at it. And actually they started focusing on, can we actually build small astronomy models, right? And Im thinking, okay, that could maybe also we learn something which could be potentially applied to some other domain. So these are some of the things we are seeing.HUIZINGA: Yes.VIEGAS: But I will finish with something which may, for me, kind of challenges this Curb Cut Effect to certain extent, if I understand the concept correctly, is that I think, with this technology and the way AI and foundation models work compared to previous technologies, I feel its kind of potentially the opposite. Its kind of like the tail catching up with the head. But here I feel that with the foundation models, I think its a different way to find information and gain some knowledge. I think that actually when we look at that, these are really broad tools that now actually can be used to help customize your own curb, as it were! So kind of the other way around.HUIZINGA: Oh, interesting VIEGAS: So I think its maybe there are two dimensions. Its not just I work on something small, and it applies to everyone. I feel there is also a dimension of, this is broad, this is any tasks, and it enables many more people. I think Cesar and Muhammed made that point earlier, is you dont have to be a CS expert or rocket scientist to start using those tools and make progress in your field. So I think that maybe there is this dimension of it.HUIZINGA: I love the way you guys are flipping my questions back on me. [LAUGHTER] So, and again, that is fascinating, you know, a custom curb, not a curb cut. Cesar, Muhammad, do you, either of you, have any examples of how perhaps this is being used in your work and youre having accidental or serendipitous discoveries that sort of have a bigger impact than what you mightve thought?TORRES: Well, one thing comes to mind. Its a project that two PhD students in my lab, Adam Emerson and Shreyosi Endow have been working on. Its around this idea of communities of practice and that is to say, when we talk about how people develop skills as a group, its often through some sort of tiered structure. And Im making a tree diagram with my hands here! [LAUGHTER] And so we often talk about what its like for an outsider to enter from outside of the community, and just how much effort it takes to get through that gate, to go through the different rungs, through the different rites of passage, to finally be a part of the inner circle, so to speak. And one of the projects that weve been doing, we started to examine these known communities of practice, where they exist. But in doing this analysis, we realized that theres a couple of folks out there that exist on the periphery. And by really focusing on them, we could start to see where the field is starting to move. And these are folks that have said, Im neither in this community or another, Im going to kind of pave my own way. While were still seeing those effects of that research go through, I think being able to monitor the communities at the fringe is a really telling sign of how were advancing as a society. I think shining some light into these fringe areas, its exactly how research develops, how its really just about expanding at some bleeding edge. And I think sometimes we just have to recontextualize that that bleeding edge is sometimes the group of people that we havent been necessarily paying attention to.HUIZINGA: Right. Love it. Muhammad, do you have a quick example or, I mean, you dont have to, but I just was curious.IDRIS: Yeah, maybe Ill just give one quick example that I think keeps me excited, actually has to do with the idea of kind of small language models, right? And so, you know, I gave the example of GPT-3 and how its trained on the entirety of the internet and with that is kind of baked in some unfortunate biases, right? And so we asked ourselves the flip side of that question. Well, how is it that we can go about actually baking in some of the good bias, right? The cultural context thats important to train these models on. And the reality is that we started off by saying, lets just have focus groups. Lets talk to people. But of course that takes time, it takes money, it takes effort. And what we quickly realized actually is there are literally generations of people who have done these focus groups specifically on breast and cervical cancer screening. And so what we actually have since done is leverage that real world data in order to actually start developing synthetic data sets that are HUIZINGA: Ahhhh.IDRIS: small enough but are of higher quality enough that allow us to address the specific concerns around bias that might not exist. And so for me, thats a really like awesome thing that we came across that I think in trying to solve a problem for our kind of specific use case, I think this could actually be a method for developing more representative, context-aware, culturally sensitive models and I think overall this contributes to the overall safety and reliability of these large language models and hopefully can create a method for people to be able to do it as well.HUIZINGA: Yeah. Evelyne, I see why its so cool for you to be sitting at Microsoft Research and working with these guys Its about now that I pose the what could possibly go wrong if you got everything right? question on this podcast. And Im really interested in how researchers are thinking about the potential downsides and consequences of their work. So, Evelyne, do you have any insights on things that youve discovered along the path that might make you take preemptive steps to mitigate?VIEGAS: Yeah, I think its coming back to actually what Muhammed was just talking about, I think Cesar, too, around data, the importance of data and the cultural value and the local value. I think an important piece of continuing to be positive for me [LAUGHTER] is to make sure that we fully understand that at the end of the day, data, which is so important to build those foundation models is, especially language models in particular, are just proxies to human beings. And I feel that its uh we need to remember that its a proxy to humans and that we all have some different beliefs, values, goals, preferences. And so how do we take all that into account? And I think that beyond the data safety, provenance, I think theres an aspect of data caring. I dont know how to say it differently, [LAUGHTER] but its kind of in the same way that we care for people, how do we care for the data as a proxy to humans? And Im thinking of, you know, when we talk about like in, especially in cases where there is no economic value, right? [LAUGHTER] And so, but there is local value for those communities. And I think actually there is cultural value across countries. So just wanted to say that there is also an aspect, I think we need to do more research on, as data as proxies to humans. And as complex humans we are, right?HUIZINGA: Right. Well, one of the other questions I like to ask on these Ideas episodes is, is about the idea of blue sky or moonshot research, kind of outrageous ideas. And sometimes theyre not so much outrageous as they are just living outside the box of traditional research, kind of the what if questions that make us excited. So just briefly, is there anything on your horizon, specifically Cesar and Muhammed, that you would say, in light of this program, AFMR, that youve had access to things that you think, boy, this now would enable me to ask those bigger questions or that bigger question. I dont know what it is. Can you share anything on that line?TORRES: I guess from my end, one of the things that the AFMR program has allowed me to see is this kind of ability to better visualize the terrain of creativity. And its a little bit of a double-edged sword because when we talk about disrupting creativity and we think about tools, its typically the case that the tool is making something easier for us. But at the same time, if somethings easier, then some other thing is harder. And then we run into this really strange case where if everything is easy, then we are faced with the blank canvas syndrome, right? Like what do you even do if everything is just equally weighted with ease? And so my big idea is to actually think about tools that are purposely making us slower HUIZINGA: Mmmmm TORRES: that have friction, that have errors, that have failures and really design how those moments can change our attitudes towards how we move around in space. To say that maybe the easiest path is not the most advantageous, but the one that you can feel the most fulfillment or agency towards. And so I really do think that this is hidden in the latent space of the data that we collect. And so we just need to be immersed in that data. We need to traverse it and really it becomes an infrastructure problem. And so the more that we expose people to these foundational models, the more that were going to be able to see how we can enable these new ways of walking through and exploring our environment.HUIZINGA: Yeah. I love this so much because Ive actually been thinking some of the best experiences in our lives havent seemed like the best experiences when we went through them, right? The tough times are what make us grow. And this idea that AI makes everything accessible and easy and frictionless is what youve said. Ive used that term too. I think of the people floating around in that movie WALL-E and all they have to do is pick whether Im wearing red or blue today and which drink I want. I love this, Cesar. Thats something I hadnt even expected you might say and boom, out of the park. Muhammad, do you have any sort of outrageous ? That was flipping it back!IDRIS: I was going to say, yeah, no, I listen, I dont know how I could top that. But no, I mean, so its funny, Cesar, as you were mentioning that I was thinking about grad school, how at the time, it was the most, you know, friction-filled life experience. But in hindsight, I wouldnt trade it in for the world. For me, you know, one of the things Im often thinking about in my job is that, you know, what if we lived in a world where everyone had all the information that they needed, access to all the care they need? What would happen then? Would we magically all be the healthiest version of ourselves? Im a little bit skeptical. Im not going to lie, right? [LAUGHTER] But thats something that Im often thinking about. Now, bringing that back down to our project, one of the things that I find a little bit amusing is that I tend to ping-pong between, this is amazing, the capabilities are just, the possibilities are endless; and then there will be kind of one or two small things where its pretty obvious that theres still a lot of research that needs to be done, right? So my whole, my big what if actually, I want to bring that back down to a kind of a technical thing which is, what if AI can truly understand culture, not just language, right? And so right now, right, an AI model can translate a public health message. Its pretty straightforward from English to Spanish, right? But it doesnt inherently understand why some Spanish speaking countries may be more hesitant about certain medical interventions. It doesnt inherently appreciate the historical context that shapes that hesitancy or what kinds of messaging would build trust rather than skepticism, right? So theres literal like cultural nuances. That to me is what, when I say culturally congruent or cultural context, what it is that I mean. And I think for me, I think what programs like AFMR have enabled us to do is really start thinking outside the box as to how will these, or how can these, emerging technologies revolutionize public health? What truly would it take for an LLM to understand context? And really, I think for the first time, we can truly, truly achieve personalized, if you want to use that term, health communication. And so thats what I would say for me is like, what would that world look like?HUIZINGA: Yeah, the big animating what if? I love this. Go ahead, Evelyne, you had something. Please.VIEGAS: Can I expand? I cannot talk. Im going to do like Muhammed, I cannot talk! Like that friction and the cultural aspect, but can I expand? And as I was listening to Cesar on the education, I think I heard you talk about the educational rite of passage at some point, and Muhammed on those cultural nuances. So first, before talking about what if? I want to say that there is some work, again, when we talk about AFMR, is the technology is all the brain power of people thinking, having crazy ideas, very creative in the research being done. And there is some research where people are looking at what it means, actually, when you build those language models and how you can take into account different language and different culture or different languages within the same culture or between different cultures speaking the same language, or So there is very interesting research. And so it made me think, expanding on what Muhammed and Cesar were talking about, so this educational rite of passage, I dont know if youre aware, so in Europe in the 17th, 18th century, there was this grand tour of Europe and that was reserved to just some people who had the funds to do that grand tour of Europe, [LAUGHTER] lets be clear! But it was this educational rite of passage where actually they had to physically go to different countries to actually get familiar and experience, experiment, philosophy and different types of politics, and So that was kind of this passage oblig we say in French. I dont know if there is a translation in English, but kind of this rite of passage basically. And so I am like, wow, what if actually we could have, thanks to the AI looking at different nuances of cultures, of languages not just language, but in a multimodal point of viewpoint, what if we could have this citizen of the world rite of passage, where we before we are really citizens of the world, we need to understand other cultures, at least be exposed to them. So that would be my what if? How do we make AI do that? And so without and for anyone, right, not just people who can afford it.HUIZINGA: Well, I dont even want to close, but we have to. And Id like each of you to reflect a bit. I think I want to frame this in a way you can sort of pick what youd like to talk about. But I often have a little bit of vision casting in this section. But there are some specific things Id like you to talk about. What learnings can you share from your experience with AFMR? Or/and whats something that strikes you as important now that may not have seemed that way when you started? And you can also, Im anticipating you people are going to flip that and say, what wasnt important that is now? And also, how do see yourself moving forward in light of this experience that youve had? So Muhammed, lets go first with you, then Cesar, and then Evelyne, you can close the show.IDRIS: Awesome. One of the things that, that Im often thinking about and one of the concepts Im often reminded of, given the significance of the work that institutions like a Morehouse School of Medicine and UT Arlington and kind of Minority Serving Institutions, right, it almost feels like there is an onslaught of pushback to addressing some of these more systemic issues that we all struggle with, is what does it mean to strive for excellence, right? So in our tradition theres a concept called Ihsan. Ihsan you know theres a lot of definitions of it but essentially to do more than just the bare minimum to truly strive for excellence and I think it was interesting, having spent time at Microsoft Research in Redmond as part of the AFMR program, meeting other folks who also participated in the program that, that I started to appreciate for myself the importance of this idea of the responsible design, development, and deployment of technologies if we truly are going to achieve the potential benefits. And I think this is one of the things that I could kind of throw out there as something to take away from this podcast, its really, dont just think of what were developing as tools, but also think of them as how will they be applied in the real world? And when youre thinking about the context within which something is going to be deployed, that brings up a lot of interesting constraints, opportunities, and just context that I think is important, again, to not just work on an interesting technology for the sake of an interesting technology, but to truly achieve that benefit for society.HUIZINGA: Hmm. Cesar.TORRES: I mean, echoing Muhammad, I think the community is really at the center of how we can move forward. I would say the one element that really struck a chord with me, and something that I very much undervalued, was the power of infrastructure and spending time laying down the proper scaffolds and steppingstones, not just for you to do what youre trying to do, but to allow others to also find their own path. I was setting up Azure from one of my classes and it took time, it took effort, but the payoff has been incredible in in so much the impact that I see now of students from my class sharing with their peers. And I think this culture of entrepreneurship really comes from taking ownership of where youve been and where you can go. But it really just, it all comes down to infrastructure. And so AFMR for me has been that infrastructure to kind of get my foot out the door and also have the ability to bring some folks along the journey with me, so HUIZINGA: Yeah. Evelyne, how blessed are you to be working with people like this? Again, my face hurts from grinning so hard. Bring us home. What are your thoughts on this?VIEGAS: Yeah, so first of all, I mean, its so wonderful just here live, like listening to the feedback from Muhammed and Cesar of what AFMR brings and has the potential to bring. And first, let me acknowledge that to put a program like AFMR, it takes a village. So Im here, the face here, or well, not the face, the voice rather! [LAUGHTER] But its so many people who have, at Microsoft on the engineering side, were just talking about infrastructure, Cesar was talking about, you know, the pain and gain of leveraging an industry-grade infrastructure like Azure and Azure AI services. So, also our policy teams, of course, our researchers. But above all, the external research community so grateful to see. Its, as you said, I feel super blessed and fortunate to be working on this program and really listening what we need to do next. How can we together do better? There is one thing for me, I want to end on the community, right? Muhammed talked about this, Cesar too, the human aspect, right? The technology is super important but also understanding the human aspect. And I will say, actually, my curb cut moment for me [LAUGHTER] was really working with the MSIs and the cohort, including Muhammed and Cesar, when they came to Redmond, and really understanding some of the needs which were going beyond the infrastructure, beyond you know a small network, how we can put it bigger and deployments ideas too, coming from the community and thats something which actually we also try to bring to the whole of AFMR moving forward. And I will finish on one note, which for me is really important moving forward. We heard from Muhammed talking about the really importance of interdisciplinarity, right, and let us not work in silo. And so, and I want to see AFMR go more international, internationality if the word exists [LAUGHTER]HUIZINGA: It does now!VIEGAS: It does now! But its just making sure that when we have those collaborations, its really hard actually, time zones, you know, practically its a nightmare! But I think there is definitely an opportunity here for all of us.HUIZINGA: Well, Cesar Torres, Muhammed Idris, Evelyne Viegas. This has been so fantastic. Thank you so much for coming on the show to share your insights on AFMR today.[MUSIC PLAYS]TORRES: It was a pleasure.IDRIS: Thank you so much.VIEGAS: Pleasure.
0 Comments ·0 Shares ·51 Views