Filtrar por gênero
The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact. Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy. We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.
- 1015 - 781: Ensuring Successful Enterprise AI Deployments, with Sol Rashidi
Sol Rashidi, a distinguished data executive who has served in C-suite roles at Fortune 100 companies, joins Jon Krohn to delve into successful enterprise AI strategies and the reasons behind the high turnover among Chief Data Officers. This episode provides an in-depth look at selecting AI projects that succeed and understanding the strategic value of patents in various industries. Benefit from Sol’s extensive experience and practical advice on navigating complex corporate challenges. This episode is brought to you by AWS Inferentia (https://go.aws/3zWS0au) and AWS Trainium (https://go.aws/3ycV6K0). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • How CDOs and related roles have such high turnover because [09:40] • The importance of building relationships in AI projects [17:01] • How Sol's book "The AI Survival Guide" came about [20:44] • How high-criticality, low-complexity AI projects are the ones with the highest probability of success [27:11] • How Enterprise data security issues can be resolved with technologies like Protopia’s stained-glass data-masking solution [36:10] • Why having great data engineers is essential [47:57] • The value of patents [51:45] Additional materials: www.superdatascience.com/781
Tue, 07 May 2024 - 1h 04min - 1014 - 780: How to Become a Data Scientist, with Dr. Adam Ross Nelson
Want to become a data scientist? Jon and Adam discuss the key steps to becoming a data scientist, with a focus on developing portfolio projects. Hear about the 10 project ideas Adam recommends in his book to help you stand out in the data science community. Additional materials: www.superdatascience.com/780 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Fri, 03 May 2024 - 08min - 1013 - 779: The Tidyverse of Essential R Libraries and their Python Analogues, with Dr. Hadley Wickham
Tidyverse, ggplot2, and the secret to a tech company’s longevity: Hadley Wickham talks to Jon Krohn about Posit’s rebrand, Tidyverse and why it needs to be in every data scientist’s toolkit, and why getting your hands dirty with open-source projects can be so lucrative for your career. This episode is brought to you by Intel and HPE Ezmeral Software (https://bit.ly/hpeintel). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • All about the Tidyverse [04:46] • Hadley’s favorite R libraries [17:10] • The goal of Posit [30:29] • On bringing multiple programming languages together [36:02] • The principles for a long-lasting tech company [52:10] • How Hadley developed ggplot2 [55:24] • How to contribute to the open-source community [1:05:43] Additional materials: www.superdatascience.com/779
Tue, 30 Apr 2024 - 1h 27min - 1012 - 778: Mixtral 8x22B: SOTA Open-Source LLM Capabilities at a Fraction of the Compute
Mixtral 8x22B is the focus on this week's Five-Minute Friday. Jon Krohn examines how this model from French AI startup Mistral leverages its mixture-of-experts architecture to redefine efficiency and specialization in AI-powered tasks. Tune in to learn about its performance benchmarks and the transformative potential of its open-source license. Additional materials: www.superdatascience.com/778 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Fri, 26 Apr 2024 - 06min - 1011 - 777: Generative AI in Practice, with Bernard Marr
Generative AI is reshaping our world, and Bernard Marr, world-renowned futurist and best-selling author, joins Jon Krohn to guide us through this transformation. In this episode, Bernard shares his insights on how AI is transforming industries, revolutionizing daily life, and addressing global challenges. With his extensive experience advising top organizations worldwide, he also examines the ethical considerations of AI deployment. This episode is brought to you by Intel and HPE Ezmeral Software (https://bit.ly/hpeintel). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • How Generative AI will transform industries [03:55] • The evolution of Generative AI [10:19] • How will Generative AI impact daily life [16:52] • The ethical challenges of AI [18:55] • How corporations can harness Generative AI for collaboration [24:36] • Industries that will be impacted by Generative AI [32:20] • How Sora-like Generative AI systems will create highly immersive entertainment [42:16] • How Generative AI could unlock 99% of business data [53:34] Additional materials: www.superdatascience.com/777
Tue, 23 Apr 2024 - 1h 08min - 1010 - 776: Deep Utopia: AI Could Solve All Human Problems in Our Lifetime
What are the risks of AI progressing beyond a point of no return? What do we stand to gain? On this Five-Minute Friday, Jon Krohn talks ‘books’ as he outlines two nonfiction works on AI and futurism by Oxford philosopher Nick Bostrom. Listen to a breakdown of DEEP UTOPIA and SUPERINTELLIGENCE in this episode. Additional materials: www.superdatascience.com/776 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Fri, 19 Apr 2024 - 07min - 1009 - 775: What will humans do when machines are vastly more intelligent? With Aleksa Gordić
Tech entrepreneurship, artificial superintelligence, and the future of education: Aleksa Gordić speaks to Jon Krohn about his strategies for self-directed learning, the traits that help people succeed in moving from big tech to entrepreneurship, and the social impact of artificial superintelligence. This episode is brought to you by Ready Tensor, where innovation meets reproducibility (https://www.readytensor.ai/). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • How to motivate yourself to become a tech entrepreneur [17:02] • Aleksa’s checklist for the perfect CTO [35:00] • Potential sustainable solutions for LLMs [41:51] • The next major developments in AI and tech [48:29] • How hobbies have a knock-on effect for a person’s career [1:01:53] • How and why formal education needs to change [1:09:24] Additional materials: www.superdatascience.com/775
Tue, 16 Apr 2024 - 1h 36min - 1008 - 774: RFM-1 Gives Robots Human-like Reasoning and Conversation Abilities
Covariant's RFM-1: Jon Krohn explores the future of AI-driven robotics with RFM-1, a groundbreaking robot arm designed by Covariant and discussed by A.I. roboticist Pieter Abbeel. Explore how this innovation aims to merge digital intelligence with the physical world, promising a new era of efficiency and autonomy. Additional materials: www.superdatascience.com/774 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Fri, 12 Apr 2024 - 12min - 1007 - 773: Deep Reinforcement Learning for Maximizing Profits, with Prof. Barrett Thomas
Dr. Barrett Thomas, an award-winning Research Professor at the University of Iowa, explores the intricacies of Markov decision processes and their connection to Deep Reinforcement Learning. Discover how these concepts are applied in operations research to enhance business efficiency and drive innovations in same-day delivery and autonomous transportation systems. This episode is brought to you by Ready Tensor, where innovation meets reproducibility (https://www.readytensor.ai/). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Barrett's start in operations logistics [02:27] • Concorde Solver and the traveling salesperson problem [09:59] • Cross-function approximation explained [19:13] • How Markov decision processes relate to deep reinforcement learning [26:08] • Understanding policy in decision-making contexts [33:40] • Revolutionizing supply chains and transportation with aerial drones [46:47] • Barrett’s career evolution: past changes and future prospects [52:19] Additional materials: www.superdatascience.com/773
Tue, 09 Apr 2024 - 1h 07min - 1006 - 772: In Case You Missed It in March 2024
Pytorch benefits, how to get funding for your AI startup, and managing scientific silos: In our new series for SuperDataScience, “In Case You Missed It”, host Jon Krohn engages in some “reinforcement learning through human feedback” of his own with need-to-hear sound bites from past SDS episodes! Additional materials: www.superdatascience.com/772 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Fri, 05 Apr 2024 - 24min - 1005 - 771: Gradient Boosting: XGBoost, LightGBM and CatBoost, with Kirill Eremenko
Kirill Eremenko joins Jon Krohn for another exclusive, in-depth teaser for a new course just released on the SuperDataScience platform, “Machine Learning Level 2”. Kirill walks listeners through why decision trees and random forests are fruitful for businesses, and he offers hands-on walkthroughs for the three leading gradient-boosting algorithms today: XGBoost, LightGBM, and CatBoost. This episode is brought to you by Ready Tensor, where innovation meets reproducibility (https://www.readytensor.ai/), and by Data Universe, the out-of-this-world data conference (https://datauniverse2024.com). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • All about decision trees [09:28] • All about ensemble models [22:03] • All about AdaBoost [38:46] • All about gradient boosting [46:51] • Gradient boosting for classification problems [1:01:26] • All about XGBoost, LightGBM and CatBoost [1:04:12] Additional materials: www.superdatascience.com/771
Tue, 02 Apr 2024 - 1h 55min - 1004 - 770: The Neuroscientific Guide to Confidence
Explore the science of confidence with Lucy Antrobus, as she unveils neuroscience-backed strategies to build and boost confidence through practice, positive energy, and the power of laughter. An essential listen for fostering unshakable self-assurance. Additional materials: www.superdatascience.com/770 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Fri, 29 Mar 2024 - 45min - 1003 - 769: Generative AI for Medicine, with Prof. Zack Lipton
Generative AI in medicine takes center stage as Prof. Zachary Lipton, Chief Scientific Officer at Abridge, joins host Jon Krohn to discuss the significant advancements in AI that are reshaping healthcare. This episode is brought to you by the DataConnect Conference (https://www.dataconnectconf.com/dccwest/conference), and by Data Universe, the out-of-this-world data conference (https://datauniverse2024.com). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • The inspiration for Zack to get started in ML and healthcare [03:56] • The hardware required to use Abridge [12:29] • The key data science projects at Abridge right now [35:05] • Abridge's tech stack [59:54] • How Abridge ensures reliability in a high-stakes setting like healthcare [1:07:29] • How Zack’s academic research cross-pollinates with his commercial ML projects [1:21:05] • How Zack’s jazz background molded his entrepreneur and data science journey [1:30:32] Additional materials: www.superdatascience.com/769
Tue, 26 Mar 2024 - 1h 49min - 1002 - 768: Is Claude 3 Better than GPT-4?
Claude 3, LLMs and testing ML performance: Jon Krohn tests out Anthropic’s new model family, Claude 3, which includes the Haiku, Sonnet and Opus models (written in order of their performance power, from least to greatest). Can it stand shoulder to shoulder with other models such as GPT-4 and Gemini 1.0 Ultra? And how important is it for machine learning practitioners to try out these models with their own benchmarks? Jon walks listeners through a test of his own in this Five-Minute Friday. Additional materials: www.superdatascience.com/768 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Fri, 22 Mar 2024 - 12min - 1001 - 767: Open-Source LLM Libraries and Techniques, with Dr. Sebastian Raschka
Jon Krohn sits down with Sebastian Raschka to discuss his latest book, Machine Learning Q and AI, the open-source libraries developed by Lightning AI, how to exploit the greatest opportunities for LLM development, and what’s on the horizon for LLMs. This episode is brought to you by the DataConnect Conference (https://www.dataconnectconf.com/dccwest/conference), and by Data Universe, the out-of-this-world data conference (https://datauniverse2024.com). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • All about Machine Learning Q and AI [04:13] • Sebastian Raschka’s role as Staff Research Engineer at Lightning AI [19:21] • PyTorch Lightning’s and Lightning Fabric’s capabilities [39:32] • Large language models: Opportunities and challenges [43:35] • DoRA vs LoRA [48:56] • How to be a successful AI educator [1:34:18] Additional materials: www.superdatascience.com/767
Tue, 19 Mar 2024 - 1h 48min - 1000 - 766: Vonnegut's Player Piano (1952): An Eerie Novel on the Current AI Revolution
Kurt Vonnegut's "Player Piano" delivers striking parallels between its dystopian vision and today's AI challenges. This week, Jon Krohn explores the novel's depiction of a world where humans are marginalized by machines, reflecting on the impact of automation on society and the ethical considerations it raises. Tune in as we unpack the timeless relevance of Vonnegut's work to the AI era. Additional materials: www.superdatascience.com/766 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Fri, 15 Mar 2024 - 08min - 999 - 765: NumPy, SciPy and the Economics of Open-Source, with Dr. Travis Oliphant
Explore the origins of NumPy and SciPy with their creator, Dr. Travis Oliphant. Discover the journey from personal need to global impact, the challenges overcome, and the future of these essential Python libraries in scientific computing and data science. This episode is brought to you by the DataConnect Conference (https://www.dataconnectconf.com/dccwest/conference), by Data Universe, the out-of-this-world data conference (https://datauniverse2024.com), and by CloudWolf (https://www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Travis's journey to creating NumPy and SciPy [08:05] • How Anaconda got started [42:24] • How Numba, a high-performance Python compiler, was brought to market [54:48] • Python's influence on the thought processes of scientists and engineers [1:04:21] • The commercial projects that support Travis’s vast open-source efforts and communities [1:10:22] • How to get involved in Travis's commercial projects and communities [1:22:34] • The future of scientific computing and Python libraries [1:29:50] Additional materials: www.superdatascience.com/765
Tue, 12 Mar 2024 - 1h 37min - 998 - 764: The Top 10 Episodes of 2023
Data science futurists, bestselling authors, and lively how-to guides from the industry’s top practitioners, which range from applying data science for good to using open-source tools for NLP: This is The Super Data Science Podcast’s top ten most listened-to episodes in 2023, hosted by Jon Krohn. A great snapshot of our great content from 2023. Additional materials: www.superdatascience.com/764 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Fri, 08 Mar 2024 - 08min - 997 - 763: The Best A.I. Startup Opportunities, with venture capitalist Rudina Seseri
At Glasswing Ventures, Rudina Seseri wants to be able to answer the question: What has Glasswing Ventures done for the company beyond capital investment? She speaks to Jon Krohn about how her company uses data to assess venture capital investments, the secret sauce of successful AI startups, and why she feels generative AI is only the start of a much broader impact that AI will make in communities and businesses. This episode is brought to you by the DataConnect Conference (https://www.dataconnectconf.com/dccwest/conference), and by Ready Tensor, where innovation meets reproducibility (https://www.readytensor.ai/). Interested in sponsoring a SuperDataScience Podcast episode? Visit https://passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Potential interest areas for Series A AI venture capitalists [12:22] • How Glasswing’s AI Palette helps AI startups [23:06] • How data driven the venture capital industry is [27:21] • Advice for adopting services from AI providers [47:21] • Model collapse: Causes and concerns [58:44] • Glasswing’s checklist for AI startups [1:04:59] Additional materials: www.superdatascience.com/763
Tue, 05 Mar 2024 - 1h 27min - 996 - 762: Gemini 1.5 Pro, the Million-Token-Context LLM
Jon Krohn presents an insightful overview of Google's groundbreaking Gemini Pro 1.5, a million-token LLM that's transforming the landscape of AI. Discover the innovative aspects of Gemini Pro 1.5, from its extensive context window to its multimodal functionalities, which are broadening the scope of AI technology and signifying a significant leap in data science. Plus, join Jon for a practical demonstration, showcasing the real-world applications, capabilities, and limitation of this advanced language model. Additional materials: www.superdatascience.com/762 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Fri, 01 Mar 2024 - 16min - 995 - 761: Gemini Ultra: How to Release an A.I. Product for Billions of Users, with Google's Lisa Cohen
Google's Gemini Ultra takes the spotlight this week, as host Jon Krohn welcomes Lisa Cohen, Google's Director of Data Science and Engineering, for a conversation about the launch of Gemini Ultra. Discover the capabilities of this cutting-edge large language model and how it stands toe-to-toe with GPT-4. Lisa shares her insights on the development, rollout, and potential of Gemini Ultra in reshaping various sectors. Whether you're a data science professional, tech enthusiast, or curious about the future of AI, this episode offers a deep dive into one of the most significant advancements in artificial intelligence. This episode is brought to you by Ready Tensor, where innovation meets reproducibility (https://www.readytensor.ai/), and by Intel and HPE Ezmeral Software Solutions (https://hpe.com/ezmeral/chatbots). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Google’s Gemini model family and Lisa's key responsibilities [04:55] • How LLMs will transform the practice of Data Science [19:47] • Lisa on prompt engineering and reinforcement learning from human feedback [24:38] • How to fine-tune Gemini models with Google's Vertex AI [30:52] • How AI-assistants will transform life and work for everyone from data scientists to educators to children [47:14] • The challenges of developing a data-centric culture [57:31] • Centralized vs decentralized data science teams [1:03:50] Additional materials: www.superdatascience.com/761
Tue, 27 Feb 2024 - 1h 10min - 994 - 760: Humans Love A.I.-Crafted Beer
AI-crafted beer, machine learning for passion projects, and self-taught data science: Jon Krohn and Beau Warren’s hotly anticipated, data-driven, punny lager Krohn&Borg is finally given a taste test in this week’s Five-Minute Friday. Heading to the Species X brewery in Columbus, Ohio, Jon Krohn and Beau Warren launched the beer that had been predicted, optimized and developed by a machine-learning model. Additional materials: www.superdatascience.com/760 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Fri, 23 Feb 2024 - 06min - 993 - 759: Full Encoder-Decoder Transformers Fully Explained, with Kirill Eremenko
Encoders, cross attention and masking for LLMs: SuperDataScience Founder Kirill Eremenko returns to the SuperDataScience podcast, where he speaks with Jon Krohn about transformer architectures and why they are a new frontier for generative AI. If you’re interested in applying LLMs to your business portfolio, you’ll want to pay close attention to this episode! This episode is brought to you by Ready Tensor, where innovation meets reproducibility (https://www.readytensor.ai/), by Oracle NetSuite business software (netsuite.com/superdata), and by Intel and HPE Ezmeral Software Solutions (http://hpe.com/ezmeral/chatbots). Interested in sponsoring a SuperDataScience Podcast episode? Visit https://passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • How decoder-only transformers work [15:51] • How cross-attention works in transformers [41:05] • How encoders and decoders work together (an example) [52:46] • How encoder-only architectures excel at understanding natural language [1:20:34] • The importance of masking during self-attention [1:27:08] Additional materials: www.superdatascience.com/759
Tue, 20 Feb 2024 - 1h 43min - 992 - 758: The Mamba Architecture: Superior to Transformers in LLMs
Explore the groundbreaking Mamba model, a potential game-changer in AI that promises to outpace the traditional Transformer architecture with its efficient, linear-time sequence modeling. Additional materials: www.superdatascience.com/758 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Fri, 16 Feb 2024 - 08min - 991 - 757: How to Speak so You Blow Listeners' Minds, with Cole Nussbaumer Knaflic
Explore mind-blowing storytelling with Cole Nussbaumer Knaflic in this episode. Audience favorite and author of "Storytelling with You," Cole returns to share essential tips for crafting impactful presentations, emphasizing narrative construction and audience engagement. Learn how to effectively communicate data and stories, enhancing your presentations with insights from a leading expert in the field. This episode is brought to you by CloudWolf (https://www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • How to become a confident communicator [11:59] • How to get rid of filler words [26:32] • How facts alone can't make a strong impact [41:44] • Cole's overview of her book Storytelling with You [55:19] • How to craft an effective presentation [1:00:24] • Common mistakes in virtual presentations [1:09:48] • Cole's virtual presentation setup [1:15:33] • Cole's next book Daphne Draws Data [1:20:23] Additional materials: www.superdatascience.com/757
Tue, 13 Feb 2024 - 1h 29min - 990 - 756: AlphaGeometry: AI is Suddenly as Capable as the Brightest Math Minds
AlphaGeometry, intuitive AI, and geometric deduction: In this week’s Five-Minute Friday, Super Data Science host Jon Krohn looks into developments from DeepMind, Google’s ground-breaking AI lab, and explores how this is a critical step towards a future of broadly accessible AI solutions across scientific disciplines. Additional materials: www.superdatascience.com/756 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Fri, 09 Feb 2024 - 08min - 989 - 755: Brewing Beer with A.I., with Beau Warren
ChatGPT applications and data-driven beer: Beer brewer and Super Data Science regular listener Beau Warren talks to Jon Krohn about the wonders of “sweaty ales”, how to brew beer with data, and how to get started on creative machine learning projects even without a degree in data science. This episode is brought to you by CloudWolf (https://www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit https://passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • About Species X [06:31] • How to become a certified beer taster [12:37] • How Beau checks the quality of his beer [25:01] • Beau and Jon’s machine learning project [38:02] • About genetic algorithms [52:35] • How to get creativity out of LLMs [1:24:46] Additional materials: www.superdatascience.com/755
Tue, 06 Feb 2024 - 1h 35min - 988 - 754: A Code-Specialized LLM Will Realize AGI, with Jason Warner
Explore the future of coding with poolside co-founder and CEO Jason Warner as he explores the potential of code-specialized LLMs and their revolutionary impact on the developer's role. Tune in for insights on the shift towards an AI-led development paradigm. Additional materials: www.superdatascience.com/754 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Fri, 02 Feb 2024 - 37min - 987 - 753: Blend Any Programming Languages in Your ML Workflows, with Dr. Greg Michaelson
Explore the future of collaborative ML workflows in this engaging episode with Dr. Greg Michaelson, Co-Founder of Zerve. Dr. Michaelson introduces the groundbreaking Zerve IDE and Pypelines project, addressing the critical gap in AutoML for commercial use and pinpointing why many A.I. projects don't meet their objectives. Gain insights into steering AI initiatives towards success and enhancing project communication, all in this insightful session. This episode is brought to you by Oracle NetSuite business software (https://netsuite.com/superdata), and by Prophets of AI (https://prophetsofai.com), the leading agency for AI experts. Interested in sponsoring a SuperDataScience Podcast episode? Visit https://passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Why Zerve IDE is so sorely needed [04:50] • Pypelines: AutoML open-source in python [30:00] • Why most commercial A.I. projects fail and how to ensure they succeed [47:45] • How AutoML will impact the role of the data scientist [53:21] • Greg's background as a pastor and working at DataRobot [1:03:40] • How to develop impressive communication and storytelling skills [1:16:16] Additional materials: www.superdatascience.com/753
Tue, 30 Jan 2024 - 1h 26min - 986 - 752: AI is Disadvantaging Job Applicants, But You Can Fight Back
Jon Krohn interviews Hilke Schellmann about the ethics of recruitment algorithms, the field’s current state of play, and what can be improved about AI used in recruiting. Additional materials: www.superdatascience.com/752 Interested in sponsoring a SuperDataScience Podcast episode? Visit https://passionfroot.me/superdatascience for sponsorship information.
Fri, 26 Jan 2024 - 50min - 985 - 751: How to Found and Fund Your Own A.I. Startup, with Dr. Rasmus Rothe
Venture capital and AI, and how to succeed with an AI company in 2024: Rasmus Rothe, Cofounder of Merantix, speaks to Jon Krohn about the Merantix campus in Berlin, how a venture capitalist identifies the best AI startups, the surefire ways for AI company founders to raise venture capital, and the jobs that are most and least vulnerable to disruption by automation. This episode is brought to you by Oracle NetSuite business software (netsuite.com/superdata), by QuickChat customized AI assistants (https://quickchat.ai), and by Prophets of AI (https://prophetsofai.com), the leading agency for AI experts. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • How Merantix started [05:17] • How does Merantix work and how to apply for funding [08:19] • How to secure AI funding [21:02] • How AI companies can prove competitiveness [33:46] • Ensuring AI regulation [41:17] • How AI will change the future of work [56:56] Additional materials: www.superdatascience.com/751
Tue, 23 Jan 2024 - 1h 18min - 984 - 750: How A.I. is Transforming Science
Explore the transformative power of AI in science. Jon Krohn reviews the groundbreaking AI-driven discoveries at MIT and beyond, showcasing how AI is reshaping various scientific fields, from pharmaceuticals to climate science, and pondering the balance between AI's capabilities and human ingenuity. Additional materials: www.superdatascience.com/750 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Fri, 19 Jan 2024 - 09min - 983 - 749: Data Science for Clean Energy, with Emily Pastewka
Data science for clean energy takes center stage as Emily Pastewka from Palmetto joins Jon Krohn this week, exploring innovative paths to a sustainable future. This episode covers the impact of AI on smart energy choices, the creation of a smart grid, and the wide array of professionals required to bring cleantech data solutions to life. This episode is brought to you by Prophets of AI (https://prophetsofai.com), the leading agency for AI experts. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Emily on her Master's in Deep Learning [08:20] • Using AI to solve clean energy challenges at Palmetto [17:22] • The different roles needed to solve cleantech problems [27:33] • How econometrics impacts consumer decision-making [38:56] • How Emily manages high-performing teams [56:30] • The tools and technologies that drive small teams [1:06:58] Additional materials: www.superdatascience.com/749
Tue, 16 Jan 2024 - 1h 16min - 982 - 748: The Five Levels of AGI
Artificial General Intelligence gets a new definition: This episode introduces Google DeepMind’s paper, “Levels of AGI: Operationalizing Progress on the Path to AGI”. Hear how its authors have organized narrow and general AI into hierarchical categories defined by human capability, from Level 0 (no AI) and Level 1 (equal to or somewhat better than an unskilled human) to Level 5 (able to outperform 100% of humans). A scary thought? Or a vision of a better future? Host Jon Krohn details the strengths of this research in this Five-Minute Friday. Additional materials: www.superdatascience.com/748 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Fri, 12 Jan 2024 - 11min - 981 - 747: Technical Intro to Transformers and LLMs, with Kirill Eremenko
Attention and transformers in LLMs, the five stages of data processing, and a brand-new Large Language Models A-Z course: Kirill Eremenko joins host Jon Krohn to explore what goes into well-crafted LLMs, what makes Transformers so powerful, and how to succeed as a data scientist in this new age of generative AI. This episode is brought to you by Intel and HPE Ezmeral Software Solutions (https://hpe.com/ezmeral/chatwithyourdata), and by Prophets of AI (https://prophetsofai.com), the leading agency for AI experts. Interested in sponsoring a SuperDataScience Podcast episode? Visit https://passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Supply and demand in AI recruitment [08:30] • Kirill and Hadelin's new course on LLMs, “Large Language Models (LLMs), Transformers & GPT A-Z” [15:37] • The learning difficulty in understanding LLMs [19:46] • The basics of LLMs [22:00] • The five building blocks of transformer architecture [36:29] - 1: Input embedding [44:10] - 2: Positional encoding [50:46] - 3: Attention mechanism [54:04] - 4: Feedforward neural network [1:16:17] - 5: Linear transformation and softmax [1:19:16] • Inference vs training time [1:29:12] • Why transformers are so powerful [1:49:22] Additional materials: www.superdatascience.com/747
Tue, 09 Jan 2024 - 2h 06min - 980 - 746: A Continuous Calendar for 2024
Jon’s continuous calendar for 2024 is here! Now in an updated format, learn about its unique layout and benefits, and how it can revolutionize your planning for the new year. Additional materials: www.superdatascience.com/746 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Fri, 05 Jan 2024 - 02min - 979 - 745: 2024 Data Science Trend Predictions
2024 data science trends take the spotlight in this special episode, where Jon joins Sadie St. Lawrence to analyze last year's predictions and delve into the emerging technologies reshaping the field. From AI hardware accelerators to the transformative role of large language models, this episode is a treasure trove of insights for anyone interested in the future of data science. This episode is brought to you by CloudWolf (www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit https://passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Reviewing predictions for 2023 [05:56] • Sadie's trend predictions for 2024 [20:49] - 1: Hardware evolution [21:17] - 2: LLMOS [35:30] - 3: Slow-thinking model [48:18] - 4: Tool consolidation [54:46] - 5: Workforce Upheaval [58:06] • Jon's predictions [1:06:26] - 1: AI bubble bursting [1:08:11] - 2: Breakthroughs in Edge AI [1:12:22] • Sadie on her productivity planner [1:17:50] Additional materials: www.superdatascience.com/745
Tue, 02 Jan 2024 - 1h 30min - 978 - 744: To a Peaceful 2024
2023: A year of great movement and change. Technological developments have rocketed generative AI’s capabilities into the stratosphere of possibilities for future approaches to work, health, and play. Host Jon Krohn recognizes the benefits we have seen over the past year, discusses the important role we all have in ensuring ethics remains at the core of AI development and use, and he ends the year with a musical surprise for his listeners! Additional materials: www.superdatascience.com/744 Interested in sponsoring a SuperDataScience Podcast episode? Visit http://passionfroot.me/superdatascience for sponsorship information.
Fri, 29 Dec 2023 - 06min - 977 - 743: How to Integrate Generative A.I. Into Your Business, with Piotr Grudzień
Chatbots, large language models and generative AI: Founder of Quickchat AI Piotr Grudzień believes the key to any successful AI platform is to ensure it can be tailored to a company’s specific needs. He speaks to host Jon Krohn about helping clients generate realistic and satisfying conversations that help their customer base find what they need quickly. This episode is brought to you by Gurobi (https://gurobi.com/sds), the Decision Intelligence Leader, and by CloudWolf (https://www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit http://passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • About Quickchat AI and how it works [02:46] • How to successfully set up a conversational AI [23:58] • What “temperature” is in the context of AI [38:38] • How the LLM landscape has changed in recent years [40:24] • The future of generative AI [57:43] • The advantages of an AI accelerator [1:09:38] Additional materials: www.superdatascience.com/743
Tue, 26 Dec 2023 - 1h 19min - 976 - 742: Happy Holidays from All of Us
Join us on a brief journey through the AI world in 2023. A year ago, GPT-3.5 crafting our holiday message was a marvel, but now, with GPT-4's arrival, we're seeing an even more astounding evolution in AI. As we wave goodbye to the trend of generative AI, the Super Data Science Podcast team is bringing a personal touch back. Tune in for our heartfelt Happy Holidays message and a big thank you to all our listeners for your unwavering support. Additional materials: www.superdatascience.com/742 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 22 Dec 2023 - 02min - 975 - 741: How to Visualize Data Effectively, with Prof. Alberto Cairo
Data visualization remains at the forefront as Dr. Alberto Cairo from the University of Miami guides us beyond numerical figures, exploring the art of weaving compelling narratives through data. In his book, "The Art of Insight," he reveals the varied motivations driving visualization experts and highlights the serene, meditative process inherent in crafting visualizations. Emphasizing the fusion of scientific principles and personal style for effective data communication, Dr. Cairo also discusses with Jon the impending impact of AI on both interactive and static graphics. This episode is brought to you by Gurobi (https://gurobi.com/sds), the Decision Intelligence Leader, by Intel and HPE Ezmeral Software Solutions (https://hpe.com/ezmeral/chatwithyourdata), and by CloudWolf (https://www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • Alberto's book, The Art of Insight [04:07] • How to transform data into engaging visuals [07:06] • What it takes to enter in a meditation-like flow state when creating visualizations [11:21] • How balancing the science of visualization with one’s personal style [29:29] • The importance of Smart Brevity for great data visualizations [37:32] • How data visualization can drive social change [42:31] • How diversity in designers enriches the field [52:07] • The future of data visualizations [59:10] Additional materials: www.superdatascience.com/741
Tue, 19 Dec 2023 - 1h 18min - 974 - 740: Q*: OpenAI's Rumored AGI Breakthrough
Sam Altman’s exit and rehiring, AGI, and OpenAI’s Q*: In this week’s Five-Minute Friday, Jon Krohn peeks behind the curtains of OpenAI, where development of the world’s first model that can solve complex, nonlinear logical problems, Q*, might be well underway. This episode casts light on the rumors behind OpenAI’s Q*, what its emergence could mean for the future of AI, and the controversies already surrounding an agent that has not yet reached the market. Additional materials: www.superdatascience.com/740 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 15 Dec 2023 - 11min - 973 - 739: AI is Eating Biology and Chemistry, with Dr. Ingmar Schuster
AI Protein design, machine learning and cancer care, and pharmaceuticals: At Exazyme, CEO and Co-Founder Ingmar Schuster uses AI to design proteins. He speaks with Jon Krohn about their wider applications in pharmaceuticals and chemistry, how Kernel methods make the design of synthetic biological catalysts more efficient, and when to use shallow machine learning over deep learning. This episode is brought to you by Gurobi (https://gurobi.com/sds), the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • On designing proteins with AI [03:14] • Designing proteins at Exazyme [08:22] • About the kernel methods [18:10] • The importance of human-led approaches in protein research [35:44] • Europe’s focus on AI regulation [43:45] • Deep vs shallow in AI [59:35] • How a background in academia helps with entrepreneurship [1:09:17] Additional materials: www.superdatascience.com/739
Tue, 12 Dec 2023 - 1h 21min - 972 - 738: Engineering Biomaterials with Generative AI, with Dr. Pierre Salvy
Bioengineering and Generative AI converge under the visionary leadership of Dr. Pierre Salvy at Cambrium GmbH, propelling material science into uncharted territories. He sits down with Jon Krohn live at Merantix A.I. Campus in Berlin to discuss how he's transforming material design, exemplified by his swift development of NovaColl, a vegan collagen crafted within two years. Additional materials: www.superdatascience.com/738 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 08 Dec 2023 - 22min - 971 - 737: scikit-learn's Past, Present and Future, with scikit-learn co-founder Dr. Gaël Varoquaux
scikit-learn co-founder Gaël Varoquaux and Jon Krohn are live at the historic Sorbonne in Paris, where they discuss the evolution of scikit-learn. From its origins as a memory-efficient Python implementation of support vector machines to its present-day status as a pivotal resource in machine learning, Gaël paints a vivid picture of its remarkable growth. Join us for a glimpse into scikit-learn's evolution, the realm of open-source collaboration, and the transformative power of data-driven insights in today's dynamic data landscape. This episode is brought to you by Gurobi (gurobi.com/sds), the Decision Intelligence Leader, by Data Universe (https://datauniverse2024.com), the out-of-this-world data conference, and by CloudWolf (www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • The early beginnings and growth of scikit-learn [05:34] • Development principles of scikit-learn [18:05] • How to apply scikit-learn to your ML problem [21:16] • Resource-efficiency and scikit-learn development [25:32] • How to contribute to an open-source project like scikit-learn yourself [38:21] • The future of scikit-learn [51:13] • Gaël on the social-impact data projects in his Soda lab [1:02:33] • Why domain expertise and statistical rigor are more important than ever [1:11:24] Additional materials: www.superdatascience.com/737
Tue, 05 Dec 2023 - 1h 30min - 970 - 736: How to Officially Certify your AI Model, with Jan Zawadzki
AI certification and EU regulation: Jan Zawadzki, CTO and CO Managing Director of Certif.ai, talks to Jon Krohn about the future of certification for AI startups and keeping within rigorous international regulations. Additional materials: www.superdatascience.com/736 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 01 Dec 2023 - 14min - 969 - 735: A.I. Product Management, with Google DeepMind's Head of Product, Mehdi Ghissassi
Artificial General Intelligence, AlphaGo, and Google DeepMind: Jon Krohn speaks to Mehdi Ghissassi, Director of Product Management at Google DeepMind, about the ethics and social impact of AI, keeping up with AI releases with safety in mind, and other pressing AI problems that keep him awake at night. In this episode, Mehdi and Jon also take a broader look at the current AI landscape, the opportunities for AI investors and startups, and what AI product managers need to get ahead. This episode is brought to you by Gurobi (https://gurobi.com/sds), the Decision Intelligence Leader, and by CloudWolf (https://www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • How DeepMind seeks to ‘solve intelligence’ [05:14] • The impact of AGI’s capabilities on medicine [16:37] • How the general public might come to apply future AI systems [28:09] • How working on product development for Africa has shaped Mehdi’s perspective on AI’s potential and challenges [37:17] • How to stay on top of rapid changes in AI [39:17] • What investors look for in AI startups [59:16] • Tips for product managers [1:03:34] Additional materials: www.superdatascience.com/735
Tue, 28 Nov 2023 - 1h 21min - 968 - 734: Humanoid Robot Soccer, with the Dutch RoboCup Team
Robot Soccer takes center stage as Jon Krohn and Dário Catarrinho, Secretary of the Dutch Nao Team and an AI student at the University of Amsterdam, discuss the intricate machine learning that enables robots to navigate the field, make decisions in real-time, respond to sound, and compete against each other in a gripping display of skill and strategy. Additional materials: www.superdatascience.com/734 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 24 Nov 2023 - 17min - 967 - 733: OpenAssistant: The Open-Source ChatGPT Alternative, with Dr. Yannic Kilcher
Yannic Kilcher, a leading ML YouTuber and DeepJudge CTO, teams up with Jon Krohn this week to delve into the open-source ML community, the technology powering Yannic’s Swiss-based startup, and the significant implications of adversarial examples in ML. Tune in as they also unpack Yannic's approach to tracking ML research, future AI prospects and his startup challenges. This episode is brought to you by Gurobi (https://gurobi.com/sds), the Decision Intelligence Leader, and by CloudWolf (https://www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • About OpenAssistant project [03:39] • Alignment issues in open-source vs closed-source [08:36] • Alternative formulas vital for crafting superior LLMs [20:29] • Strategies to foster open-source LLM ecosystems [27:07] • Yannic's pioneering work in legal document processing at DeepJudge [31:31] • Comprehensive overview of adversarial examples [1:04:02] • The future AI's landscape [1:18:08] • Startup challenges [1:25:35] Additional materials: www.superdatascience.com/733
Tue, 21 Nov 2023 - 1h 40min - 966 - 732: Data Science for Astronomy, with Dr. Daniela Huppenkothen
Exploring our vast universe, in this episode Jon Krohn meets with Daniela Huppenkothen at the University of Amsterdam's astronomy department for a wide-ranging discussion about building instrumentation for telescopes, collecting data from outer space and how to sort astronomy’s problem of enormous amounts of data. Additional materials: www.superdatascience.com/732 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 17 Nov 2023 - 44min - 965 - 731: A.I. Agents Will Develop Their Own Distinct Culture, with Nell Watson
Ethics and machine intelligence pioneer Nell Watson speaks to host Jon Krohn about the differences between AI ethics and AI safety, how crying wolf may result in future complications for AI development and the importance of ensuring IEEE standards to mitigate and regulate AI risks. She also touches on what she considers a “second Enlightenment”, in which we may start to form intimate relationships with AI—to both parties’ benefit. This episode is brought to you by Gurobi (https://gurobi.com/sds), the Decision Intelligence Leader, and by CloudWolf (https://www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • AI ethics and AI safety [05:30] • How "moving fast" could break the world [18:07] • The shifting relationship between humans and machines [29:54] • International ethics standards, and their review process [52:10] • Current and future ethical standards [1:05:31] • Building a universal basic income with AI [1:19:23] Additional materials: www.superdatascience.com/731
Tue, 14 Nov 2023 - 1h 28min - 964 - 730: How GitHub Operationalizes AI for Teamwide Collaboration and Productivity
In this episode, Kyle Daigle, COO of GitHub, joins Jon Krohn to discuss the transformative impact of generative AI tools like GitHub Copilot. Learn how these tools streamline software development, enhance collaboration, and accelerate code reviews. Discover innovative approaches to collaboration and innersourcing, reshaping the future of teamwork in the digital age. Additional materials: www.superdatascience.com/730 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 10 Nov 2023 - 18min - 963 - 729: Universal Principles of Intelligence (Across Humans and Machines), with Prof. Blake Richards
Dr. Blake Richards discusses the world of AI and human cognition this week. Learn about the essence of intelligence, the ways AI research informs our understanding of the human brain, and discover the potential future scenarios where AI and humanity might intersect. This episode is brought to you by Gurobi (https://gurobi.com/sds), the Decision Intelligence Leader, and by CloudWolf (https://www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • Blake's research and his take on intelligence [09:56] • How we can evaluate progress in artificial general intelligence [15:54] • Blake's thoughts on biomimicry [20:57] • Why Blake thinks the fears regarding AI are overdone [25:38] • The most effective strategies to mitigate AI fears without hindering innovation [35:31] • What steps can we take to ensure that AI supports human flourishing [45:23] • The importance of interpreting neuroscience data through the lens of ML [55:08] • Backpropagation, gradient descent and the brain [1:17:32] Additional materials: www.superdatascience.com/729
Tue, 07 Nov 2023 - 1h 46min - 962 - 728: Use Contrastive Search to get Human-Quality LLM Outputs
Learn how to achieve human-like outputs from LLMs in this week’s Five-Minute Friday with Jon Krohn. Understand the various current methods available to decode and generate text, as well as the differences between them. Find out about greedy search, beam search, sampling, and contrastive search, and how you can use them to create incredibly useful LLMs. Additional materials: www.superdatascience.com/728 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 03 Nov 2023 - 05min - 961 - 727: Unmasking A.I. Injustice, with Dr. Joy Buolamwini
Coded bias, intersectionality in AI, and computer vision: Founder of the Algorithmic Justice League Joy Buolamwini talks to host Jon Krohn about the impact of exclusion and inclusion in datasets, the need to address intersectionality when identifying racial, age, or gender-based prejudice in machine learning tools, protections for artists and creative practitioners against AI, and the role that AI may have in combating systemic racism. This episode is brought to you by Gurobi (https://gurobi.com/sds), the Decision Intelligence Leader, and by CloudWolf (https://www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • What coded bias is [06:49] • The problem with bias in machine learning datasets [18:41] • The Incoding Movement [42:08] • About the Pilot Parliaments Benchmark [52:07] • Ethics and the future of AI [1:20:10] • The potential for AI to end systemic racism [1:32:59] Additional materials: www.superdatascience.com/727
Tue, 31 Oct 2023 - 1h 45min - 960 - 726: Seven Factors for Successful Data Leadership
Ben Jones, CEO of Data Literacy, discusses the seven crucial components of effective data leadership. From ethics to technology and fostering a data-centric culture, Jones provides actionable insights and practical examples. Tune in to empower your organization with purposeful and ethical data strategies from day one. Additional materials: www.superdatascience.com/726 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 27 Oct 2023 - 32min - 959 - 725: Neuroscience + Machine Learning, with Google DeepMind's Dr. Kim Stachenfeld
Dr. Kim Stachenfeld, Research Scientist at Google DeepMind and Affiliate Professor at Columbia University, delves into the realms of AI and neuroscience as she discusses computer-based simulations of the human brain, the efficiency of language in compression, and the neuroscience theories shaping the future of artificial intelligence. Discover the secrets behind memory formation, cognitive enhancement, and the potential of Artificial General Intelligence (AGI) in this thought-provoking episode. This episode is brought to you by Gurobi (https://gurobi.com/sds), the Decision Intelligence Leader, and by ODSC (https://odsc.com), the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • The importance of simulations in the context of human intelligence [05:44] • The basic approach to simulating human intelligence or physical systems [09:30] • Will simulations help us realize AGI? [37:21] • The cross-disciplinary potential of LLMs [40:20] • The special role of our brain’s hippocampus in memory formation [1:05:15] • Kim's research on reinforcement learning and neural representation [1:15:02] • Compression in representation learning [1:38:51] • What skills should an aspiring computational neuroscientist hone [1:50:30] Additional materials: www.superdatascience.com/725
Tue, 24 Oct 2023 - 1h 58min - 958 - 724: Decoding Speech from Raw Brain Activity, with Dr. David Moses
In this Friday episode, host Jon Krohn talks to UCSF’s David Moses about BRAVO (Brain-Computer Interface Restoration of Arm and Voice), a study led by Edward Chang and Karunesh Ganguly that helps patients who have lost the ability to speak to communicate once again via a speech neuroprosthesis. Postdoctoral engineer David Moses, who is a part of BRAVO, reveals the data and machine learning models that help BRAVO predict the words and facial expressions that a paralyzed patient is trying to form via their brain activity, crucially helping patients to communicate with medical practitioners and loved ones. Additional materials: www.superdatascience.com/724 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 20 Oct 2023 - 42min - 957 - 723: Mathematical Optimization, with Jerry Yurchisin
Mathematical optimization should be known to every data scientist: Jon Krohn speaks to Jerry Yurchisin, Data Science Strategist at Gurobi, the decision-making technology and best-kept secret of 80% of America’s leading enterprises. This episode is brought to you by the Zerve data science dev environment (https://zerve.ai), the Decision Intelligence Leader, by ODSC (https://odsc.com), the Open Data Science Conference, and by CloudWolf (https://www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • What mathematical optimization is [04:27] • How Gurobi solver works [29:01] • How to use Gurobi with Python [36:08] • Coding and algebra resources [41:14] • When to use mathematical optimization and machine learning together [54:23] • Using mathematical optimization in natural language processing [1:01:00] Additional materials: www.superdatascience.com/723
Tue, 17 Oct 2023 - 1h 37min - 956 - 722: AI Emits Far Less Carbon Than Humans (Doing the Same Task)
This episode delves into an intriguing research paper from top institutions like UC Irvine and MIT, analyzing the carbon emissions of AI-driven writing and illustrating versus traditional human methods. The findings might surprise you. Is AI the more eco-friendly option? Listen now to explore this compelling intersection of technology and sustainability. Additional materials: www.superdatascience.com/722 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 13 Oct 2023 - 07min - 955 - 721: Quantum Machine Learning, with Dr. Amira Abbas
Dr. Amira Abbas, Quantum Computing Researcher at the University of Amsterdam, explores the captivating world of Quantum Machine Learning. Learn about the distinct characteristics of qubits and the vital processes of Quantum ML. For those keen on exploring further, Amira offers noteworthy ML tools suggestions to kickstart your journey in Quantum Computing. This episode is brought to you by Gurobi (https://gurobi.com/sds), the Decision Intelligence Leader, by ODSC (https://odsc.com), the Open Data Science Conference, and by CloudWolf (https://www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • Quantum computing vs classical computing [03:42] • What is quantum entanglement [11:45] • What is a qubit [15:07] • The best problems for quantum ML [30:08] • Three distinct steps in quantum ML and its potential [39:06] • Quantum neural networks [49:03] • What Amira's working on at the moment [1:10:20] • How to get started in quantum ML [1:21:06] • Amira's recommended ML tools for quantum computing [1:30:39] Additional materials: www.superdatascience.com/721
Tue, 10 Oct 2023 - 1h 42min - 954 - 720: OpenAI’s DALL-E 3, Image Chat and Web Search
DALL-E may be playing second fiddle to Midjourney no longer with OpenAI’s latest model for generative AI art, DALL-E 3. Host Jon Krohn breaks down the newest model’s capabilities to go beyond producing incredible artistic images, and that follows your written brief to the letter. Additional materials: www.superdatascience.com/720 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 06 Oct 2023 - 12min - 953 - 719: Computational Mathematics and Fluid Dynamics, with Prof. Margot Gerritsen
In this episode, Margot Gerritsen and Jon Krohn discuss the fundamentals of computational mathematics and its application in studying fluid dynamics. Margot also talks about how her synesthesia led to a lifelong interest in math, using computational mathematics to predict airflow, and why it is so important that underrepresented groups in data science become more visible through organizations like Women in Data Science. This episode is brought to you by the Zerve data science dev environment (https://zerve.ai), by Gurobi (https://gurobi.com/sds), the Decision Intelligence Leader, and by ODSC (https://odsc.com), the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • About computational mathematics and its relation to data science [03:19] • Margot’s current research into emissions simulation [15:05] • Computational Mathematics: Real-World Applications [33:18] • The importance of wind tunnels in testing designs [47:54] • The beauty of linear algebra [1:05:59] • Synesthesia: Seeing Numbers as Colors [1:16:33] • About Women in Data Science [1:24:59] Additional materials: www.superdatascience.com/719
Tue, 03 Oct 2023 - 1h 47min - 952 - 718: ChatGPT Custom Instructions: A Major, Easy Hack for Data Scientists
Elevate your ChatGPT game with a useful custom instruction. Tune in to hear Jon’s trick for maximizing ChatGPT’s potential. Additional materials: www.superdatascience.com/718 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 29 Sep 2023 - 05min - 951 - 717: Overcoming Adversaries with A.I. for Cybersecurity, with Dr. Dan Shiebler
Dr. Dan Shiebler, Head of ML at Abnormal Security, joins Jon Krohn this week and unveils the intricacies of cybercrime detection and email protection, and the role of AI in future challenges. This episode is brought to you by Grafbase (https://grafbase.com), the unified data layer, by ODSC (https://odsc.com/), the Open Data Science Conference, and by Modelbit (https://modelbit.com), for deploying models in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • The heuristic and “intermediate” ML models that they develop at Abnormal Security [07:08] • How Dan uses LLMs at Abnormal Security [15:46] • How false negatives are individually the biggest classification error to avoid in cybersecurity [20:49] • How head-to-head competitor analysis helps refine models [34:34] • Resilient ML in cybersecurity [38:36] • Abnormal Security’s routine for updating their models [52:37] • AI's impact on the urban world [1:09:57] • How to stay updated in data science and AI [1:13:46] Additional materials: www.superdatascience.com/717
Tue, 26 Sep 2023 - 1h 20min - 950 - 716: Happiness and Life-Fulfillment Hacks
Jon Krohn's 94-year-old grandmother, Annie, who's bursting with life and wisdom, shares her recipe to lifelong happiness and how relationships and daily intentions play an integral role. Annie also shares her curious take on modern technology. Get inspired by her infectious joy and perspective on life. Additional materials: www.superdatascience.com/716 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 22 Sep 2023 - 13min - 949 - 715: Make Better Decisions with Data, with Dr. Allen Downey
Join us as Dr. Allen Downey, renowned author and professor, shares insights from his upcoming book 'Probably Overthinking It,' breaking down underused techniques like Survival Analysis, explaining common paradoxes, and discussing the dynamic Overton Window. This episode is brought to you by the Zerve data science dev environment (https://zerve.ai), by Modelbit (https://modelbit.com), for deploying models in seconds, and by Grafbase (https://grafbase.com), the unified data layer. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • Why interpreting data is not always easy [06:21] • What is Survival Analysis [15:32] • Preston's Paradox [22:09] • Are you Normal? [36:52] • How to better prepare for rare “Black Swan” events [42:48] • What is an Overton Window? [53:06] • What is the base rate fallacy? [1:23:31] • How to protect yourself from biased samples [1:33:39] • Simpson’s Paradox [1:42:43] Additional materials: www.superdatascience.com/715
Tue, 19 Sep 2023 - 1h 55min - 948 - 714: Using A.I. to Overcome Blindness and Thrive as a Data Scientist
In this Friday episode, guest Tim Albiges explores with host Jon Krohn how people with blindness can have a lucrative and fulfilling career in data science, how Tim’s PhD thesis applied machine learning to help diagnose chronic respiratory diseases, and the communication tools that blind people can use to live a full and independent life. Additional materials: www.superdatascience.com/714 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 15 Sep 2023 - 36min - 947 - 713: Llama 2, Toolformer and BLOOM: Open-Source LLMs with Meta's Dr. Thomas Scialom
Artificial General Intelligence, RLHF’s application in AI, and how entrepreneurs can enter the AI industry: Meta’s AI Research Scientist Thomas Scialom gives us behind-the-scenes insights into developing Llama 2 and what’s in the works for Llama 3. With host Jon Krohn, he discusses the future of Artificial General Intelligence, why the Galactica science-focused LLM was taken down, and what he learned from it. This episode is brought to you by AWS Inferentia (https://go.aws/3zWS0au), by Grafbase (https://grafbase.com), the unified data layer, and by Modelbit (https://modelbit.com), for deploying models in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • Llama 2: Behind the Scenes of Today’s Top Open-Source LLM [05:04] • Responsible use of Llama 2 [15:26] • Toolformer: LLM That Learns How to Use External Tools [24:57] • Galactica: The Science-Specific LLM and Why It Was Brought Down [36:57] • Is AGI Around the Corner? [57:03] • Advice for AI entrepreneurs [1:05:46] • How Thomas develops and manages large-scale AI projects [1:14:42] Additional materials: www.superdatascience.com/713
Tue, 12 Sep 2023 - 1h 25min - 946 - 712: Code Llama
Code Llama might just be starting the revolution for how data scientists code. In this Five-Minute Friday, host Jon Krohn investigates the suite of models under the free-to-use Code Llama and how to find the best fit for your project’s needs. Additional materials: www.superdatascience.com/712 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 08 Sep 2023 - 06min - 945 - 711: Image, Video and 3D-Model Generation from Natural Language, with Dr. Ajay Jain
In this episode, host Jon Krohn explores with his guest Ajay Jain, Co-Founder of Genmo.ai, how creative general intelligence could take the video industry by storm. They also discuss the models that got Genmo to this point, the applications of NeRF, and how understanding human psychology is so essential to developing models that output high-fidelity video. This episode is brought to you by the Zerve data science dev environment (https://zerve.ai), by Grafbase (https://grafbase.com), the unified data layer, and by Modelbit (https://modelbit.com), for deploying models in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • About Genmo.ai and the term “creative general intelligence” [03:47] • Why Ajay started Genmo.ai [09:26] • The increased performance of multimodal models [21:12] • All about Denoising Diffusion Probabilistic Models (DDPMs) [31:03] • The application of Neural Radiance Fields (NeRF) [55:26] • Predicting pedestrian behavior at Uber [1:01:50] • How to save money in the process of training models [1:12:42] Additional materials: www.superdatascience.com/711
Tue, 05 Sep 2023 - 1h 26min - 944 - 710: LangChain: Create LLM Applications Easily in Python
Discover the power of Large Language Models with Kris Ograbek as he unravels the intricacies of LangChain and showcases a chatbot in action, all while putting our host Jon Krohn in the hot seat! Additional materials: www.superdatascience.com/710 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 01 Sep 2023 - 1h 03min - 943 - 709: Big A.I. R&D Risks Reap Big Societal Rewards, with Meta's Dr. Laurens van der Maaten
Meta's Senior Research Director, Dr. Laurens van der Maaten, takes center stage to unravel the captivating realm of AI innovation. Learn about his groundbreaking contributions, including pioneering the t-SNE dimensionality reduction technique and harnessing AI for novel protein synthesis, climate change mitigation, and wearable materials simulation. Join us to explore the transformative power of AI across diverse domains and gain a glimpse into its future societal implications. This episode is brought to you by AWS Inferentia (https://go.aws/3zWS0au), by Modelbit (https://modelbit.com), for deploying models in seconds, and by Grafbase (https://grafbase.com), the unified data layer. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • Large-scale learning of image recognition models on web data [05:05] • Evolutionary Scale Modeling protein models [16:45] • Fighting climate change by building an A.I. model [29:49] • The CrypTen privacy-preserving ML framework [38:36] • Concerns about adversarial examples [53:25] • Laurens’ t-SNE algorithm [58:56] • How to make a big impact [1:07:25] Additional materials: www.superdatascience.com/709
Tue, 29 Aug 2023 - 1h 20min - 942 - 708: ChatGPT Code Interpreter: 5 Hacks for Data Scientists
On this week’s Five-Minute Friday, host Jon Krohn gives five reasons why he is so excited about ChatGPT’s Code Interpreter and walks listeners through its capabilities with a practical example. Additional materials: www.superdatascience.com/708 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 25 Aug 2023 - 22min - 941 - 707: Vicuña, Gorilla, Chatbot Arena and Socially Beneficial LLMs, with Prof. Joey Gonzalez
LLM Vicuña, Chatbot Arena, and the race to increase LLM context windows: This episode’s guest Joey Gonzalez talks to Jon Krohn about developing models and platforms that leverage and improve LLMs, as well as the future of AI development and access. This episode is brought to you by the AWS Insiders Podcast (https://pod.link/1608453414), by Modelbit (https://modelbit.com), for deploying models in seconds, and by Grafbase (https://grafbase.com), the unified data layer. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • Vicuña: How the revolutionary LLM came to be [03:35] • Chatbot Arena: The leading LLM leaderboard [09:47] • Trusting LLM results [17:54] • Gorilla: The open-source ChatGPT plugin alternative [32:13] • About LMSYS and long context windows [47:48] • Open- vs closed-source LLMs: Which is better? [1:01:39] • Aqueduct [1:16:49] • Founding GraphLab [1:27:02] • How AI will positively impact society in the coming decades [1:33:23] Additional materials: www.superdatascience.com/707
Tue, 22 Aug 2023 - 1h 47min - 940 - 706: Large Language Model Leaderboards and Benchmarks
In this episode, Caterina Constantinescu dives deep into Large Language Models (LLMs), spotlighting top leaderboards, evaluation benchmarks, and real-world user perceptions. Plus, discover the challenges of dataset contamination and the intricacies of platforms like HELM and Chatbot Arena. Additional materials: www.superdatascience.com/706 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 18 Aug 2023 - 33min - 939 - 705: Feeding the World with ML-Powered Precision Agriculture
Join Jon Krohn as he chats with Syngenta Group's Feroz Sheikh, Jeremy Groeteke, and Thomas Jung about the digital revolution in agriculture. Learn how data science is evolving farming, from precision techniques to global food solutions. A compelling blend of tech meets nature. This episode is brought to you by AWS Inferentia (https://go.aws/3zWS0au) and by Modelbit (https://modelbit.com), for deploying models in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • What is precision agriculture? [09:43] • What is computational agronomy? [12:30] • How Syngenta helps growers optimize yields [21:37] • How to bridge the gap between R&D and out in the real world [33:58] • What is generative chemistry? [37:52] • How generative chemistry accelerates the discovery of new compounds [41:55] • How you could make a big social impact in agriculture with data science [56:22] • How to go about designing ML models for agriculture [1:00:27] Additional materials: www.superdatascience.com/705
Tue, 15 Aug 2023 - 1h 29min - 938 - 704: Jon’s “Generative A.I. with LLMs” Hands-on Training
Take on the world of GPT and learn to develop your own, commercially successful Large Language Models (LLMs) with Jon Krohn’s comprehensive, guided training video for generative AI. Get to grips with the technology, learn which tools to use, and find out how to get an eye for business-viable models with Jon’s (ad-)free educational video. Additional materials: www.superdatascience.com/704 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 11 Aug 2023 - 04min - 937 - 703: How Data Happened: A History, with Columbia Prof. Chris Wiggins
Statistics history, interdisciplinarity, and data and society. Chris Wiggins talks with Jon Krohn about the power dynamics of data, the transformation of the field of biology through data-driven approaches to genetic sequencing, and the New York Times’ data science team’s cutting-edge approach to accommodating its tech stack. This episode is brought to you by the AWS Insiders Podcast (https://pod.link/1608453414) and by Modelbit (https://modelbit.com), for deploying models in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • The importance of the humanities in data science [09:18] • How data science “rearranges” power [17:19] • An overview of How Data Happened [20:36] • The controversial nature of Bayes theorem [29:16] • Why we need to consider data ethics [34:00] • How biology came to adopt data science into its field [45:44] • The data science tech stack at the New York Times [49:18] Additional materials: www.superdatascience.com/703
Tue, 08 Aug 2023 - 1h 09min - 936 - 702: Llama 2 — It's Time to Upgrade your Open-Source LLM
This week, Jon Krohn is examining Meta's newly released open-source large language model, Llama 2, highlighting its commercial prospects, immense capacity, model variety, and unique 'time awareness' feature. He also discusses its innovative two-stage RLHF approach that enhances its performance. Additional materials: www.superdatascience.com/702 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 04 Aug 2023 - 10min - 935 - 701: Generative A.I. without the Privacy Risks (with Prof. Raluca Ada Popa)
Dr. Raluca Ada Popa, renowned computer scientist, entrepreneur, and President of Opaque Systems, joins Jon Krohn to share her insights on securely interacting with AI APIs like OpenAI's GPT-4, the pros and cons of open vs. closed-source AI development, and the seamless operation of compute pipelines across multiple clouds. This episode is brought to you by AWS Inferentia (https://go.aws/3zWS0au) and by Modelbit (https://modelbit.com), for deploying models in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • What is a confidential computing platform? [04:31] • How to get started with confidential computing [12:10] • The challenges of confidential computing and LLMs [21:11] • How to safeguard your data while using commercial LLMs like GPT-4 [38:00] • Open-source vs closed-source [52:28] • Raluca's PreVail cybersecurity company [1:01:50] • Combining entrepreneurship and academic career [1:04:03] • DARE Program [1:10:39] Additional materials: www.superdatascience.com/701
Tue, 01 Aug 2023 - 1h 21min - 934 - 700: "The Dream of Life" by Alan Watts
Yoga and Hindu mythology: This special episode continues the thread of our centenary episodes, SDS 500: Yoga Nidra with Jes Allen and SDS 600: Yoga Nidra Practice with Steve Fazzari, which talked through guided meditation techniques to help improve posture, sleep, and expand consciousness. Inspired by these sessions, host Jon Krohn explores Hindu mythology via Alan Watts’ “The Dream of Life”. Additional materials: www.superdatascience.com/700 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 28 Jul 2023 - 04min - 933 - 699: The Modern Data Stack, with Harry Glaser
Model deployment, data warehouse options for running models, and how to best leverage BI tools: Harry Glaser and Jon Krohn discuss Modelbit’s capabilities to automate ML models from notebooks into production-ready models, reducing the time and effort in ‘translating’ information from one mode to another. Harry’s conversation with host Jon Krohn expanded on the importance of automating this task, and how developments in ML modeling have widened access to entire teams to analyze data, whatever their level of expertise. This episode is brought to you by the AWS Insiders Podcast (https://pod.link/1608453414). Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • What the modern data stack is [03:28] • Version control for data scientists [13:30] • CI/CD, load balancing and logging [20:38] • Snowflake vs. Redshift [30:10] • How tools like Looker and Tableau help monitor models [35:26] Additional materials: www.superdatascience.com/699
Tue, 25 Jul 2023 - 50min - 932 - 698: How Firms Can Actually Adopt A.I., with Rehgan Avon
Company-wide AI adoption can take a lot of persuasion. Rehgan Avon talks to host Jon Krohn about why AI has become necessary for forward-thinking businesses and the steps to implement AI in an institution so that everyone benefits. Additional materials: www.superdatascience.com/698 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 21 Jul 2023 - 27min - 931 - 697: The (Short) Path to Artificial General Intelligence, with Dr. Ben Goertzel
AI visionary and CEO of SingularityNET Dr. Ben Goertzel provides a deep dive into the possible realization of Artificial General Intelligence (AGI) within 3-7 years. Explore the intriguing connections between self-awareness, consciousness, and the future of Artificial Super Intelligence (ASI) and discover the transformative societal changes that could arise. This episode is brought to you by AWS Inferentia (https://go.aws/3zWS0au), the AWS Insiders Podcast (https://pod.link/1608453414), and by Modelbit (https://modelbit.com), for deploying models in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • Decentralized and benevolent AGI [03:13] • The SingularityNET ecosystem [13:10] • Dr. Goertzel's vision for realizing AGI - combining DL with neuro-symbolic systems, genetic algorithms and knowledge graphs [25:50] • How reaching AGI will trigger Artificial Super Intelligence [38:51] • Dr. Goertzel's approach to AGI using OpenCog Hyperon [42:34] • Why Dr. Goertzel believes AGI will be positive for humankind [53:07] • How to ensure the AGI is benevolent [1:06:43] • How AGI or ASI may act ethically [1:13:50] Additional materials: www.superdatascience.com/697
Tue, 18 Jul 2023 - 1h 27min - 930 - 696: Brain-Computer Interfaces and Neural Decoding, with Prof. Bob Knight
Jon Krohn welcomes Professor Dr. Bob Knight to explore human intelligence, the prefrontal cortex, and the transformative potential of brain implants for data collection. Discover the pivotal role of machine learning in treating Parkinson's and delve into exciting future advancements. Additional materials: www.superdatascience.com/696 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 14 Jul 2023 - 1h 02min - 929 - 695: NLP with Transformers, feat. Hugging Face's Lewis Tunstall
What are transformers in AI, and how do they help developers to run LLMs efficiently and accurately? This is a key question in this week’s episode, where Hugging Face’s ML Engineer Lewis Tunstall sits down with host Jon Krohn to discuss encoders and decoders, and the importance of continuing to foster democratic environments like GitHub for creating open-source models. This episode is brought to you by the AWS Insiders Podcast (https://pod.link/1608453414), by https://WithFeeling.ai, the company bringing humanity into AI, and by Modelbit (https://modelbit.com), for deploying models in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • What a transformer is, and why it is so important for NLP [04:34] • Different types of transformers and how they vary [11:39] • Why it’s necessary to know how a transformer works [31:52] • Hugging Face’s role in the application of transformers [57:10] • Lewis Tunstall’s experience of working at Hugging Face [1:02:08] • How and where to start with Hugging Face libraries [1:18:27] • The necessity to democratize ML models in the future [1:25:25] Additional materials: www.superdatascience.com/695
Tue, 11 Jul 2023 - 1h 38min - 928 - 694: CatBoost: Powerful, efficient ML for large tabular datasets
Modeling tabular data and spreadsheets doesn’t have to be tedious with CatBoost’s open-source tree-boosting algorithm. CatBoost does what it says on the tin, blending categories with boosting that allows you to train your models faster and handle large datasets for ML tasks across multiple GPUs. In this week’s Five-Minute Friday, host Jon Krohn gets to grips with the technical components of CatBoost that give it the speed and accuracy so acclaimed by its users. Additional materials: www.superdatascience.com/694 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 07 Jul 2023 - 07min - 927 - 693: YOLO-NAS: The State of the Art in Machine Vision, with Harpreet Sahota
Harpreet Sahota, a data science expert and deep learning developer at Deci AI, joins Jon Krohn to explore the fascinating realm of object detection and the revolutionary YOLO-NAS model architecture. Discover how machine vision models have evolved and the techniques driving compute-efficient edge device applications. This episode is brought to you by AWS Inferentia (https://go.aws/3zWS0au), by https://WithFeeling.ai, the company bringing humanity into AI, and by Modelbit (https://modelbit.com), for deploying models in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • What is machine vision? [07:02] • Object detection and YOLO architectures [13:00] • Deci's YOLO-NAS: Optimal object detection model architecture [23:39] • Developer Relations [1:00:16] • Harpreet's 'top-down' approach to learning Deep Learning [1:06:50] Additional materials: www.superdatascience.com/693
Tue, 04 Jul 2023 - 1h 20min - 926 - 692: Lossless LLM Weight Compression: Run Huge Models on a Single GPU
Join Jon as he navigates listeners through the innovative SpQR approach—a cutting-edge, lossless LLM weight compression technique that harnesses the power of quantization. Tune in as Jon delves into the four steps behind this groundbreaking method in this week's episode. Additional materials: www.superdatascience.com/692 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 30 Jun 2023 - 07min - 925 - 691: A.I. Accelerators: Hardware Specialized for Deep Learning
GPUs vs CPUs, chip design and the importance of chips in AI research: This highly technical episode is for anyone who wants to learn what goes into chip development and how to get into the competitive industry of accelerator design. With advice from expert guest Ron Diamant, Senior Principal Engineer at AWS, you’ll get a breakdown of the need-to-know technical terms, what chip engineers need to think about during the design phase and what the future holds for processing hardware. This episode is brought to you by Posit, the open-source data science company (https://posit.co), by the AWS Insiders Podcast (https://pod.link/1608453414), and by https://WithFeeling.ai, the company bringing humanity into AI. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • What CPUs and GPUs are [05:29] • The differences between accelerators used for deep learning [14:31] • Trainium and Inferentia: AWS's A.I. Accelerators [22:10] • If model optimizations will lead to lower demand for hardware to process them [43:14] • How a chip designer goes about production [48:34] • Breaking down the technical terminology for chips (accelerator interconnect, dynamic execution, collective communications) [55:29] • The importance of AWS Neuron, a software development kit [1:15:42] • How Ron got his foot in the door with chip design [1:26:40] Additional materials: www.superdatascience.com/691
Tue, 27 Jun 2023 - 1h 34min - 924 - 690: How to Catch and Fix Harmful Generative A.I. Outputs
Krishna Gade, the founder and CEO of Fiddler.AI, discusses the challenges faced by Large Language Models (LLMs) in Generative AI, including inaccuracies, biases, and privacy risks. He emphasizes the importance of monitoring to build trust in AI and highlights Fiddler's explainability algorithms and pre-built bias detection tools as vital solutions. Additional materials: www.superdatascience.com/690 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 23 Jun 2023 - 26min - 923 - 689: Observing LLMs in Production to Automatically Catch Issues
Arize's Amber Roberts and Xander Song join Jon Krohn this week, sharing invaluable insights into ML Observability, drift detection, retraining strategies, and the crucial task of ensuring fairness and ethical considerations in AI development. This episode is brought to you by Posit, the open-source data science company (https://posit.co), by AWS Inferentia (go.aws/3zWS0au), and by Anaconda, the world's most popular Python distribution (https://superdatascience.com/anaconda). Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • What is ML Observability [05:07] • What is Drift [08:18] • The different kinds of model drift [15:31] • How frequently production models should be retrained? [25:15] • Arize's open-source product, Phoenix [30:49] • How ML Observability relates to discovering model biases [50:30] • Arize case studies [57:13] • What is a developer advocate [1:04:51] Additional materials: www.superdatascience.com/689
Tue, 20 Jun 2023 - 1h 18min - 922 - 688: Six Reasons Why Building LLM Products Is Tricky
Prompt injection, prompt engineering, context windows, and more: In this week’s Five-Minute Friday, Jon explains why anyone looking to build their own product leveraging LLMs should stop to consider these and three more issues before jumping in. Phillip Carter first outlined these six issues in his article “All the Hard Stuff Nobody Talks About when Building Products with LLMs”. Additional materials: www.superdatascience.com/688 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 16 Jun 2023 - 14min - 921 - 687: Generative Deep Learning, with David Foster
Autoencoders, transformers, latent space: Learn the elements of generative AI and hear what data scientist David Foster has to say about the potential for generative AI in music, as well as the role that world models play in blending generative AI with reinforcement learning. This episode is brought to you by Posit, the open-source data science company (https://posit.co), by Anaconda, the world's most popular Python distribution (superdatascience.com/anaconda), and by https://WithFeeling.ai, the company bringing humanity into AI. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • Generative modeling vs discriminative modeling [04:21] • Generative AI for Music [13:12] • On the threats of AI [23:15] • Autoencoders Explained [38:36] • Noise in Generative AI [48:11] • What CLIP models are (Contrastive Language-Image Pre-training) [54:07] • What World Models are [1:00:40] • What a Transformer is [1:11:14] • How to use transformers for music generation [1:19:50] Additional materials: www.superdatascience.com/687
Tue, 13 Jun 2023 - 1h 46min - 920 - 686: Open-Source "Responsible A.I." Tools, with Ruth Yakubu
Mircosoft’s Ruth Yakubu joins Jon Krohn to discuss Responsible AI principles and the open-source Responsible AI Toolbox, allowing users to assess their models for fairness, inclusiveness, privacy, explainability, accountability, and reliability before deployment. Additional materials: www.superdatascience.com/686 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 09 Jun 2023 - 29min - 919 - 685: Tools for Building Real-Time Machine Learning Applications, with Richmond Alake
Richmond Alake, a Machine Learning Architect at Slalom Build, sits down with Jon to share real-time ML insights, tools and career experiences for a high-energy and high impact episode. From his work at Slalom Build to his two AI startups, discover the software choices, ML tools, and front-end development techniques used by a leader in the field. This episode is brought to you by Posit, the open-source data science company (https://posit.co), by AWS Inferentia (go.aws/3zWS0au), and by https://WithFeeling.ai, the company bringing humanity into AI. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • What is a Machine Learning Architect? [03:09] • Richmond's startups [12:07] • Why Richmond started a podcast [29:51] • Richmond's new course on feature stores [38:05] • Why Richmond produces data science content [43:25] • Why All Data Scientists Should Write [51:30] Additional materials: www.superdatascience.com/685
Tue, 06 Jun 2023 - 1h 06min - 918 - 684: Get More Language Context out of your LLM
Open-source LLMs, FlashAttention and generative AI terminology: Host Jon Krohn gives us the lift we need to explore the next big steps in generative AI. Listen to the specific way in which Stanford University’s “exact attention” algorithm, FlashAttention, could become a competitor for GPT-4’s capabilities. Additional materials: www.superdatascience.com/684 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 02 Jun 2023 - 05min - 917 - 683: Contextual A.I. for Adapting to Adversaries, with Dr. Matar Haller
Monitoring malicious, user-generated content; contextual AI; adapting to novel evasion attempts: Matar Haller speaks to Jon Krohn about the challenges of identifying, analyzing and flagging malicious information online. In this episode, Matar explains how contextual AI and a “database of evil” can help resolve the multiple challenges of blocking dangerous content across a range of media, even those that are live-streamed. This episode is brought to you by Posit, the open-source data science company (posit.co), by Anaconda, the world's most popular Python distribution (superdatascience.com/anaconda), and by https://WithFeeling.ai, the company bringing humanity into AI. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • How ActiveFence helps its customers to moderate platform content [05:36] • How ActiveFence finds extreme social media users trying to evade detection [16:32] • How to monitor live-streaming content and analyze it for dangerous material [29:13] • The technologies ActiveFence uses to run its platform [35:54] • Matar’s experience of the Insight Fellows Program (Data Science Fellowship) [40:28] • Leadership opportunities for women in STEM [1:00:41] • Israel’s R&D edge for AI [1:13:19] Additional materials: www.superdatascience.com/683
Tue, 30 May 2023 - 1h 20min - 916 - 682: Business Intelligence Tools, with Mico Yuk
In this week's episode, Mico Yuk, host of 'Analytics on Fire', joins Jon Krohn to share her effective business intelligence and analytics framework, BIDS, for persuading key decision makers. She crowns one "power" tool as the analytics king and discusses emerging tools that could challenge its dominance. Tune in for unapologetic insights on future and current BI trends and happenings from the world of BI and analytics. Additional materials: www.superdatascience.com/682 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Fri, 26 May 2023 - 27min
Podcasts semelhantes a Super Data Science: ML & AI Podcast with Jon Krohn
- Global News Podcast BBC World Service
- El Partidazo de COPE COPE
- Herrera en COPE COPE
- The Dan Bongino Show Cumulus Podcast Network | Dan Bongino
- Es la Mañana de Federico esRadio
- La Noche de Dieter esRadio
- Hondelatte Raconte - Christophe Hondelatte Europe 1
- Dateline NBC NBC News
- 財經一路發 News98
- La rosa de los vientos OndaCero
- Más de uno OndaCero
- La Zanzara Radio 24
- L'Heure Du Crime RTL
- El Larguero SER Podcast
- Nadie Sabe Nada SER Podcast
- SER Historia SER Podcast
- Todo Concostrina SER Podcast
- 安住紳一郎の日曜天国 TBS RADIO
- TED Talks Daily TED
- アンガールズのジャンピン[オールナイトニッポンPODCAST] ニッポン放送
- 辛坊治郎 ズーム そこまで言うか! ニッポン放送
- 飯田浩司のOK! Cozy up! Podcast ニッポン放送
- 吳淡如人生實用商學院 吳淡如
- 武田鉄矢・今朝の三枚おろし 文化放送PodcastQR