Ben Lorica (original) (raw)
Ben Lorica is the former Chief Data Scientist at O’Reilly Media, and the former Program Chair of: the Strata Data Conference, the O’Reilly Artificial Intelligence Conference, and TensorFlow World. Ben is also an advisor to a few exciting startups and organizations: Databricks, Alluxio, Matroid, Anodot, Determined AI, Anyscale.io, Faculty.ai , Graphistry, Yakit, and The Center for Data Intensive Science + Open Commons Consortium (University of Chicago). He is the host and organizer of thedataexchange.media podcast.
Radar
The road to Software 2.0
December 10, 2019
It’s clear that AI can and will have a big influence on how we develop software.
A world of deepfakes
November 7, 2019
We need to remember that creating fakes is an application, not a tool—and that malicious applications are not the whole story.
Building and deploying AI applications and systems at scale
October 16, 2019
Ben Lorica and Roger Chen review how companies are building AI applications today.
Recent trends in data and machine learning technologies
September 25, 2019
Ben Lorica dives into emerging technologies for building data infrastructures and machine learning platforms.
How new tools in data and AI are being used in health care and medicine
September 3, 2019
An overview of applications of new tools for overcoming silos, and for creating and sharing high-quality data.
How organizations are sharpening their skills to better understand and use AI
August 26, 2019
To successfully implement AI technologies, companies need to take a holistic approach toward retraining their workforces.
Got speech? These guidelines will help you get started building voice applications
August 8, 2019
Speech adds another level of complexity to AI applications—today’s voice applications provide a very early glimpse of what is to come.
Managing machine learning in the enterprise: Lessons from banking and health care
July 15, 2019
A look at how guidelines from regulated industries can help shape your ML strategy.
RISELab’s AutoPandas hints at automation tech that will change the nature of software development
July 1, 2019
Neural-backed generators are a promising step toward practical program synthesis.
One simple chart: Who is interested in Spark NLP?
June 27, 2019
As we close in on its two-year anniversary, Spark NLP is proving itself a viable option for enterprise use.
AI and machine learning will require retraining your entire organization
June 26, 2019
To successfully integrate AI and machine learning technologies, companies need to take a more holistic approach toward training their workforce.
What are model governance and model operations?
June 19, 2019
A look at the landscape of tools for building and deploying robust, production-ready machine learning models.
The quest for high-quality data
June 18, 2019
Machine learning solutions for data integration, cleaning, and data generation are beginning to emerge.
AI adoption is being fueled by an improved tool ecosystem
June 11, 2019
We now are in the implementation phase for AI technologies.
Becoming a machine learning company means investing in foundational technologies
May 21, 2019
Companies successfully adopt machine learning either by building on existing data products and services, or by modernizing existing models and algorithms.
How AI and machine learning are improving customer experience
May 14, 2019
From data quality to personalization, to customer acquisition and retention, and beyond, AI and ML will shape the customer experience of the future.
Sustaining machine learning in the enterprise
May 1, 2019
Drawing insights from recent surveys, Ben Lorica analyzes important trends in machine learning.
Checking in on AI tools
April 17, 2019
Ben Lorica and Roger Chen assess the state of AI technologies and adoption in 2019.
The evolution and expanding utility of Ray
February 21, 2019
There are growing numbers of users and contributors to the framework, as well as libraries for reinforcement learning, AutoML, and data science.
Three surveys of AI adoption reveal key advice from more mature practices
February 20, 2019
An overview of emerging trends, known hurdles, and best practices in artificial intelligence.
Artificial intelligence and machine learning adoption in European enterprise
February 4, 2019
How companies in Europe are preparing for and adopting AI and ML technologies.
How companies are building sustainable AI and ML initiatives
January 29, 2019
A recent survey investigated how companies are approaching their AI and ML practices, and measured the sophistication of their efforts.
Overcoming barriers to AI adoption
January 16, 2019
The program for our Artificial Intelligence Conference in New York City will showcase tools, best practices, and use cases from companies leading the way in AI adoption.
9 AI trends on our radar
January 15, 2019
How new developments in automation, machine deception, hardware, and more will shape AI.
7 data trends on our radar
January 8, 2019
From infrastructure to tools to training, Ben Lorica looks at what’s ahead for data.
Managing risk in machine learning
November 13, 2018
Considerations for a world where ML models are becoming mission critical.
The state of automation technologies
October 10, 2018
Ben Lorica and Roger Chen highlight recent trends in data, compute, and machine learning.
Why it’s hard to design fair machine learning models
September 27, 2018
The O’Reilly Data Show Podcast: Sharad Goel and Sam Corbett-Davies on the limitations of popular mathematical formalizations of fairness.
Preserving privacy and security in machine learning
September 12, 2018
Ben Lorica offers an overview of recent tools for building privacy-preserving and secure machine learning products and services.
Unlocking innovation in AI
September 6, 2018
Ben Lorica and Roger Chen provide a glimpse into tools and trends poised to accelerate AI innovation.
3 promising areas for AI skills development
August 14, 2018
O'Reilly survey results and usage data reveal growing trends and topics in artificial intelligence.
5 findings from O’Reilly’s machine learning adoption survey companies should know
August 7, 2018
New survey results highlight the ways organizations are handling machine learning's move to the mainstream.
How companies can get started with AI
July 30, 2018
The program for our Artificial Intelligence Conference in London is structured to help companies that are still very much in the early stages of AI adoption.
How to take machine learning from exploration to implementation
July 23, 2018
Recognizing the interest in ML, the Strata Data Conference program is designed to help companies adopt ML across large sections of their existing operations.
What machine learning means for software development
July 11, 2018
“Human in the loop” software development will be a big part of the future.
How to think about AI and machine learning technologies, and their roles in automation
June 11, 2018
An overview and framework, including tools that can be used to enable automation.
Building a stronger data ecosystem
May 23, 2018
Ben Lorica looks at the problems we’re facing as we collect and store data, particularly when our machine learning models require huge amounts of labeled data.
Understanding automation
May 1, 2018
Ben Lorica and Roger Chen discuss the state of reinforcement learning and automation.
4 things business leaders should know as they explore AI and deep learning
April 10, 2018
Our survey reveals how organizations are using tools, techniques, and training to apply AI through deep learning.
Privacy in the age of machine learning
March 7, 2018
Ben Lorica explores emerging security best practices for business intelligence, machine learning, and mobile computing products.
We need to build machine learning tools to augment machine learning engineers
January 11, 2018
As the use of analytics proliferate, companies will need to be able to identify models that are breaking bad.
5 AI trends to watch in 2018
January 9, 2018
From methods to tools to ethics, Ben Lorica looks at what's in store for artificial intelligence.
8 fintech trends on our radar
January 3, 2018
AI, blockchain, payment regionalization, and other fintech trends to watch.
What lies ahead for data
January 2, 2018
How new developments in algorithms, machine learning, analytics, infrastructure, data ethics, and culture will shape the data world.
The state of AI adoption
December 18, 2017
An overview of adoption, and suggestions to companies interested in AI technologies.
Practical applications of reinforcement learning in industry
December 14, 2017
An overview of commercial and industrial applications of reinforcement learning.
Responsible deployment of machine learning
December 6, 2017
Ben Lorica explains how to guard against flaws and failures in your machine learning deployments.
How companies can navigate the age of machine learning
October 24, 2017
To become a “machine learning company,” you need tools and processes to overcome challenges in data, engineering, and models.
The age of machine learning
September 27, 2017
Ben Lorica discusses the state of machine learning.
The state of AI adoption
September 19, 2017
AI Conference chairs Ben Lorica and Roger Chen reveal the current AI trends they've observed in industry.
The current state of applied data science
August 24, 2017
Recent trends in practical use and a discussion of key bottlenecks in supervised machine learning.
Why continuous learning is key to AI
August 7, 2017
A look ahead at the tools and methods for learning from sparse feedback.
What are machine learning engineers?
June 6, 2017
A new role focused on creating data products and making data science work in production.
7 AI trends to watch in 2017
January 3, 2017
From tools, to research, to ethics, Ben Lorica looks at what’s in store for artificial intelligence in 2017.
8 data trends on our radar for 2017
From deep learning to decoupling, here are the data trends to watch in the year ahead.
What is Artificial Intelligence?
June 29, 2016
Mike Loukides and Ben Lorica examine factors that have made AI a hot topic in recent years, today's successful AI systems, and where AI may be headed.
How intelligent data platforms are powering smart cities
October 14, 2015
Smart cities and smart nations run on data.
Big data’s big ideas
October 28, 2014
From cognitive augmentation to artificial intelligence, here's a look at the major forces shaping the data world.
Content
One simple chart: Who is interested in Apache Pulsar?
August 28, 2019
Multi-layer architecture, scalability, multitenancy, and durability are just some of the reasons companies have been using Pulsar.
One simple graphic: Researchers love PyTorch and TensorFlow
July 25, 2019
Interest in PyTorch among researchers is growing rapidly.
Specialized tools for machine learning development and model governance are becoming essential
April 2, 2019
Why companies are turning to specialized machine learning tools like MLflow.
Sustaining machine learning in the enterprise
March 27, 2019
Drawing insights from recent surveys, Ben Lorica analyzes important trends in machine learning.
You created a machine learning application. Now make sure it’s secure.
February 28, 2019
The software industry has demonstrated, all too clearly, what happens when you don’t pay attention to security.
Core technologies and tools for AI, big data, and cloud computing
February 11, 2019
Highlights and use cases from companies that are building the technologies needed to sustain their use of analytics and machine learning.
Deep automation in machine learning
December 19, 2018
We need to do more than automate model building with autoML; we need to automate tasks at every stage of the data pipeline.
Assessing progress in automation technologies
December 6, 2018
When it comes to automation of existing tasks and workflows, you need not adopt an “all or nothing” attitude.
Notes from the first Ray meetup
August 15, 2018
Ray is beginning to be used to power large-scale, real-time AI applications.
Data collection and data markets in the age of privacy and machine learning
July 18, 2018
While models and algorithms garner most of the media coverage, this is a great time to be thinking about building tools in data.
A new benchmark suite for machine learning
May 16, 2018
MLPerf is a new set of benchmarks compiled by a growing list of industry and academic contributors.
How to build analytic products in an age when data privacy has become critical
May 3, 2018
Privacy-preserving analytics is not only possible, but with GDPR about to come online, it will become necessary to incorporate privacy in your data products.
Building tools for the AI applications of tomorrow
April 26, 2018
We’re currently laying the foundation for future generations of AI applications, but we aren’t there yet.
How companies around the world apply machine learning
April 3, 2018
Strata Data London will introduce technologies and techniques; showcase use cases; and highlight the importance of ethics, privacy, and security.
What happens when AI experts from Silicon Valley and China meet
March 7, 2018
Why we're taking the AI Conference to Beijing.
Introducing RLlib: A composable and scalable reinforcement learning library
January 19, 2018
RISE Lab’s Ray platform adds libraries for reinforcement learning and hyperparameter tuning.
Put machine learning to work in the real world
January 17, 2018
The sessions and training courses at Strata Data San Jose 2018 will focus on practical use cases of machine learning for data scientists, engineers, managers, and executives.
Use deep learning on data you already have
January 18, 2017
Putting deep learning into practice with new tools, frameworks, and future developments.
3 ideas to add to your data science toolkit
July 27, 2016
Techniques to address overfitting, hyperparameter tuning, and model interpretability.
The next 10 years of Apache Hadoop
July 18, 2016
Doug Cutting, Tom White, and Ben Lorica explore Hadoop's role over the coming decade.
The next 10 years of Apache Hadoop
March 31, 2016
Apache Hadoop co-founders Doug Cutting and Mike Cafarella explore the future of Hadoop.
Compressed representations in the age of big data
January 21, 2016
Emerging trends in intelligent mobile applications and distributed computing.
10 data trends on our radar for 2016
January 3, 2016
Promising topics in data that we'll be watching closely in the year ahead.
Ask your data new questions
November 3, 2015
Consolidating data across silos improves business insight.
We need open and vendor-neutral metadata services
October 11, 2015
Comprehensive metadata collection and analysis can pave the way for many interesting applications.
Specialized and hybrid data management and processing engines
September 30, 2015
A new crop of interesting solutions for the complexity of operating multiple systems in a distributed computing setting.
Showcasing the real-time processing revival
August 31, 2015
Tools and learning resources for building intelligent, real-time products.
Learning Paths: a new way to build data skills
August 18, 2015
Logical and well-crafted collections of data video courses get you where you need to go.
Why data preparation frameworks rely on human-in-the-loop systems
July 1, 2015
The O'Reilly Data Show Podcast: Ihab Ilyas on building data wrangling and data enrichment tools in academia and industry.
Apache Spark: Powering applications on-premise and in the cloud
June 3, 2015
The O'Reilly Data Show Podcast: Patrick Wendell on the state of the Spark ecosystem.
Announcing Cassandra certification
May 26, 2015
A new partnership between O’Reilly and DataStax offers certification and training in Cassandra.
More tools for managing and reproducing complex data projects
April 28, 2015
A survey of the landscape shows the types of tools remain the same, but interfaces continue to improve.
A real-time processing revival
April 1, 2015
Things are moving fast in the stream processing world.
Let’s build open source tensor libraries for data science
March 16, 2015
Tensor methods for machine learning are fast, accurate, and scalable, but we'll need well-developed libraries.
Network structure and dynamics in online social systems
February 4, 2015
Understanding information cascades, viral content, and significant relationships.
The evolution of GraphLab
January 28, 2015
The O'Reilly Data Show Podcast: Carlos Guestrin on the early days of GraphLab and the evolution of GraphLab Create.
Building and deploying large-scale machine learning pipelines
January 22, 2015
We need primitives, pipeline synthesis tools, and most importantly, error analysis and verification.
Lessons from next-generation data wrangling tools
January 7, 2015
Drawing inspiration from recent advances in data preparation.
Building Apache Kafka from scratch
December 3, 2014
In this episode of the O'Reilly Data Show Podcast, Jay Kreps talks about data integration, event data, and the Internet of Things.
The science of moving dots: The O’Reilly Data Show Podcast
November 19, 2014
Rajiv Maheswaran talks about the tools and techniques required to analyze new kinds of sports data.
Active learning: Best practices for creating labeled data sets
August 21, 2014
Learn simple ways to improve data models by cleaning up and tweaking the distribution of training data.
Scaling up data frames
August 7, 2014
New frameworks for interactive business analysis and advanced analytics fuel the rise in tabular data objects.
There are many use cases for graph databases and analytics
July 9, 2014
Business users are becoming more comfortable with graph analytics.
Streamlining feature engineering
June 14, 2014
Researchers and startups are building tools that enable feature discovery.
A growing number of applications are being built with Spark
June 1, 2014
Many more companies want to highlight how they're using Apache Spark in production.
Welcome to Intelligence Matters
May 13, 2014
Casting a critical eye on the exciting developments in the world of AI.
What I use for data visualization
January 25, 2014
An array of tools for tackling data visualizations.
IPython: A unified environment for interactive data analysis
January 18, 2014
It has roots in academic scientific computing, but has features that appeal to many data scientists.
Six reasons why I recommend scikit-learn
December 29, 2013
It's an extensive, well-documented, and accessible, curated library of machine-learning models
Data scientists and data engineers like Python and Scala
November 30, 2013
Python and Scala are popular among members of several well-attended SF Bay Area Meetups
Data wrangling gets a fresh look
November 23, 2013
We are in the early days of productivity technology in data science
How companies are using Spark
November 9, 2013
The inaugural Spark Summit will feature a wide variety of real-world applications
Stream processing and mining just got more interesting
September 21, 2013
A general purpose stream processing framework from the team behind Kafka and new techniques for computing approximate quantiles.
How Twitter monitors millions of time series
September 14, 2013
A distributed, near real-time system simplifies the collection, storage, and mining of massive amounts of event data
Data analysis: Just one component of the data science workflow
September 7, 2013
Specialized tools run the risk of being replaced by others that have more coverage.
Data analysis tools target non-experts
August 24, 2013
Tools simplify the application of advanced analytics and the interpretation of results
Interactive big data analysis using approximate answers
August 17, 2013
As data sizes continue to grow, interactive query systems may start adopting the sampling approach central to BlinkDB.
Surfacing anomalies and patterns in machine data
August 10, 2013
Compelling large-scale data platforms originate from the world of IT Operations
Data scientists tackle the analytic lifecycle
July 13, 2013
A new crop of data science tools for deploying, monitoring, and maintaining models
Improving options for unlocking your graph data
May 18, 2013
Graph data is an area that has attracted many enthusiastic entrepreneurs and developers
11 essential features that visual analysis tools should have
May 12, 2013
Visual analysis tools are adding advanced analytics for big data
Tachyon: An open source, distributed, fault-tolerant, in-memory file system
April 27, 2013
Tachyon enables data sharing across frameworks and performs operations at memory speed
The re-emergence of time-series
April 6, 2013
Researchers begin to scale up pattern recognition, machine-learning, and data management tools.
Python data tools just keep getting better
March 23, 2013
A variety of tools are making data science tasks easy to do in Python
Shark: Real-time queries and analytics for big data
November 26, 2012
Shark is 100X faster than Hive for SQL, and 100X faster than Hadoop for machine-learning
Seven reasons why I like Spark
August 21, 2012
Spark is becoming a key part of a big data toolkit.