Ben Lorica (original) (raw)

Ben Lorica

Ben Lorica is the former Chief Data Scientist at O’Reilly Media, and the former Program Chair of: the Strata Data Conference, the O’Reilly Artificial Intelligence Conference, and TensorFlow World. Ben is also an advisor to a few exciting startups and organizations: Databricks, Alluxio, Matroid, Anodot, Determined AI, Anyscale.io, Faculty.ai , Graphistry, Yakit, and The Center for Data Intensive Science + Open Commons Consortium (University of Chicago). He is the host and organizer of thedataexchange.media podcast.

Radar

The road to Software 2.0

December 10, 2019

It’s clear that AI can and will have a big influence on how we develop software.

A world of deepfakes

November 7, 2019

We need to remember that creating fakes is an application, not a tool—and that malicious applications are not the whole story.

Building and deploying AI applications and systems at scale

October 16, 2019

Ben Lorica and Roger Chen review how companies are building AI applications today.

September 25, 2019

Ben Lorica dives into emerging technologies for building data infrastructures and machine learning platforms.

How new tools in data and AI are being used in health care and medicine

September 3, 2019

An overview of applications of new tools for overcoming silos, and for creating and sharing high-quality data.

How organizations are sharpening their skills to better understand and use AI

August 26, 2019

To successfully implement AI technologies, companies need to take a holistic approach toward retraining their workforces.

Got speech? These guidelines will help you get started building voice applications

August 8, 2019

Speech adds another level of complexity to AI applications—today’s voice applications provide a very early glimpse of what is to come.

Managing machine learning in the enterprise: Lessons from banking and health care

July 15, 2019

A look at how guidelines from regulated industries can help shape your ML strategy.

RISELab’s AutoPandas hints at automation tech that will change the nature of software development

July 1, 2019

Neural-backed generators are a promising step toward practical program synthesis.

One simple chart: Who is interested in Spark NLP?

June 27, 2019

As we close in on its two-year anniversary, Spark NLP is proving itself a viable option for enterprise use.

AI and machine learning will require retraining your entire organization

June 26, 2019

To successfully integrate AI and machine learning technologies, companies need to take a more holistic approach toward training their workforce.

What are model governance and model operations?

June 19, 2019

A look at the landscape of tools for building and deploying robust, production-ready machine learning models.

The quest for high-quality data

June 18, 2019

Machine learning solutions for data integration, cleaning, and data generation are beginning to emerge.

AI adoption is being fueled by an improved tool ecosystem

June 11, 2019

We now are in the implementation phase for AI technologies.

Becoming a machine learning company means investing in foundational technologies

May 21, 2019

Companies successfully adopt machine learning either by building on existing data products and services, or by modernizing existing models and algorithms.

How AI and machine learning are improving customer experience

May 14, 2019

From data quality to personalization, to customer acquisition and retention, and beyond, AI and ML will shape the customer experience of the future.

Sustaining machine learning in the enterprise

May 1, 2019

Drawing insights from recent surveys, Ben Lorica analyzes important trends in machine learning.

Checking in on AI tools

April 17, 2019

Ben Lorica and Roger Chen assess the state of AI technologies and adoption in 2019.

The evolution and expanding utility of Ray

February 21, 2019

There are growing numbers of users and contributors to the framework, as well as libraries for reinforcement learning, AutoML, and data science.

Three surveys of AI adoption reveal key advice from more mature practices

February 20, 2019

An overview of emerging trends, known hurdles, and best practices in artificial intelligence.

Artificial intelligence and machine learning adoption in European enterprise

February 4, 2019

How companies in Europe are preparing for and adopting AI and ML technologies.

How companies are building sustainable AI and ML initiatives

January 29, 2019

A recent survey investigated how companies are approaching their AI and ML practices, and measured the sophistication of their efforts.

Overcoming barriers to AI adoption

January 16, 2019

The program for our Artificial Intelligence Conference in New York City will showcase tools, best practices, and use cases from companies leading the way in AI adoption.

January 15, 2019

How new developments in automation, machine deception, hardware, and more will shape AI.

January 8, 2019

From infrastructure to tools to training, Ben Lorica looks at what’s ahead for data.

Managing risk in machine learning

November 13, 2018

Considerations for a world where ML models are becoming mission critical.

The state of automation technologies

October 10, 2018

Ben Lorica and Roger Chen highlight recent trends in data, compute, and machine learning.

Why it’s hard to design fair machine learning models

September 27, 2018

The O’Reilly Data Show Podcast: Sharad Goel and Sam Corbett-Davies on the limitations of popular mathematical formalizations of fairness.

Preserving privacy and security in machine learning

September 12, 2018

Ben Lorica offers an overview of recent tools for building privacy-preserving and secure machine learning products and services.

Unlocking innovation in AI

September 6, 2018

Ben Lorica and Roger Chen provide a glimpse into tools and trends poised to accelerate AI innovation.

3 promising areas for AI skills development

August 14, 2018

O'Reilly survey results and usage data reveal growing trends and topics in artificial intelligence.

5 findings from O’Reilly’s machine learning adoption survey companies should know

August 7, 2018

New survey results highlight the ways organizations are handling machine learning's move to the mainstream.

How companies can get started with AI

July 30, 2018

The program for our Artificial Intelligence Conference in London is structured to help companies that are still very much in the early stages of AI adoption.

How to take machine learning from exploration to implementation

July 23, 2018

Recognizing the interest in ML, the Strata Data Conference program is designed to help companies adopt ML across large sections of their existing operations.

What machine learning means for software development

July 11, 2018

“Human in the loop” software development will be a big part of the future.

How to think about AI and machine learning technologies, and their roles in automation

June 11, 2018

An overview and framework, including tools that can be used to enable automation.

Building a stronger data ecosystem

May 23, 2018

Ben Lorica looks at the problems we’re facing as we collect and store data, particularly when our machine learning models require huge amounts of labeled data.

Understanding automation

May 1, 2018

Ben Lorica and Roger Chen discuss the state of reinforcement learning and automation.

4 things business leaders should know as they explore AI and deep learning

April 10, 2018

Our survey reveals how organizations are using tools, techniques, and training to apply AI through deep learning.

Privacy in the age of machine learning

March 7, 2018

Ben Lorica explores emerging security best practices for business intelligence, machine learning, and mobile computing products.

We need to build machine learning tools to augment machine learning engineers

January 11, 2018

As the use of analytics proliferate, companies will need to be able to identify models that are breaking bad.

January 9, 2018

From methods to tools to ethics, Ben Lorica looks at what's in store for artificial intelligence.

January 3, 2018

AI, blockchain, payment regionalization, and other fintech trends to watch.

What lies ahead for data

January 2, 2018

How new developments in algorithms, machine learning, analytics, infrastructure, data ethics, and culture will shape the data world.

The state of AI adoption

December 18, 2017

An overview of adoption, and suggestions to companies interested in AI technologies.

Practical applications of reinforcement learning in industry

December 14, 2017

An overview of commercial and industrial applications of reinforcement learning.

Responsible deployment of machine learning

December 6, 2017

Ben Lorica explains how to guard against flaws and failures in your machine learning deployments.

How companies can navigate the age of machine learning

October 24, 2017

To become a “machine learning company,” you need tools and processes to overcome challenges in data, engineering, and models.

The age of machine learning

September 27, 2017

Ben Lorica discusses the state of machine learning.

The state of AI adoption

September 19, 2017

AI Conference chairs Ben Lorica and Roger Chen reveal the current AI trends they've observed in industry.

The current state of applied data science

August 24, 2017

Recent trends in practical use and a discussion of key bottlenecks in supervised machine learning.

Why continuous learning is key to AI

August 7, 2017

A look ahead at the tools and methods for learning from sparse feedback.

What are machine learning engineers?

June 6, 2017

A new role focused on creating data products and making data science work in production.

January 3, 2017

From tools, to research, to ethics, Ben Lorica looks at what’s in store for artificial intelligence in 2017.

From deep learning to decoupling, here are the data trends to watch in the year ahead.

What is Artificial Intelligence?

June 29, 2016

Mike Loukides and Ben Lorica examine factors that have made AI a hot topic in recent years, today's successful AI systems, and where AI may be headed.

How intelligent data platforms are powering smart cities

October 14, 2015

Smart cities and smart nations run on data.

Big data’s big ideas

October 28, 2014

From cognitive augmentation to artificial intelligence, here's a look at the major forces shaping the data world.

Content

One simple chart: Who is interested in Apache Pulsar?

August 28, 2019

Multi-layer architecture, scalability, multitenancy, and durability are just some of the reasons companies have been using Pulsar.

One simple graphic: Researchers love PyTorch and TensorFlow

July 25, 2019

Interest in PyTorch among researchers is growing rapidly.

Specialized tools for machine learning development and model governance are becoming essential

April 2, 2019

Why companies are turning to specialized machine learning tools like MLflow.

Sustaining machine learning in the enterprise

March 27, 2019

Drawing insights from recent surveys, Ben Lorica analyzes important trends in machine learning.

You created a machine learning application. Now make sure it’s secure.

February 28, 2019

The software industry has demonstrated, all too clearly, what happens when you don’t pay attention to security.

Core technologies and tools for AI, big data, and cloud computing

February 11, 2019

Highlights and use cases from companies that are building the technologies needed to sustain their use of analytics and machine learning.

Deep automation in machine learning

December 19, 2018

We need to do more than automate model building with autoML; we need to automate tasks at every stage of the data pipeline.

Assessing progress in automation technologies

December 6, 2018

When it comes to automation of existing tasks and workflows, you need not adopt an “all or nothing” attitude.

Notes from the first Ray meetup

August 15, 2018

Ray is beginning to be used to power large-scale, real-time AI applications.

Data collection and data markets in the age of privacy and machine learning

July 18, 2018

While models and algorithms garner most of the media coverage, this is a great time to be thinking about building tools in data.

A new benchmark suite for machine learning

May 16, 2018

MLPerf is a new set of benchmarks compiled by a growing list of industry and academic contributors.

How to build analytic products in an age when data privacy has become critical

May 3, 2018

Privacy-preserving analytics is not only possible, but with GDPR about to come online, it will become necessary to incorporate privacy in your data products.

Building tools for the AI applications of tomorrow

April 26, 2018

We’re currently laying the foundation for future generations of AI applications, but we aren’t there yet.

How companies around the world apply machine learning

April 3, 2018

Strata Data London will introduce technologies and techniques; showcase use cases; and highlight the importance of ethics, privacy, and security.

What happens when AI experts from Silicon Valley and China meet

March 7, 2018

Why we're taking the AI Conference to Beijing.

Introducing RLlib: A composable and scalable reinforcement learning library

January 19, 2018

RISE Lab’s Ray platform adds libraries for reinforcement learning and hyperparameter tuning.

Put machine learning to work in the real world

January 17, 2018

The sessions and training courses at Strata Data San Jose 2018 will focus on practical use cases of machine learning for data scientists, engineers, managers, and executives.

Use deep learning on data you already have

January 18, 2017

Putting deep learning into practice with new tools, frameworks, and future developments.

3 ideas to add to your data science toolkit

July 27, 2016

Techniques to address overfitting, hyperparameter tuning, and model interpretability.

The next 10 years of Apache Hadoop

July 18, 2016

Doug Cutting, Tom White, and Ben Lorica explore Hadoop's role over the coming decade.

The next 10 years of Apache Hadoop

March 31, 2016

Apache Hadoop co-founders Doug Cutting and Mike Cafarella explore the future of Hadoop.

Compressed representations in the age of big data

January 21, 2016

Emerging trends in intelligent mobile applications and distributed computing.

January 3, 2016

Promising topics in data that we'll be watching closely in the year ahead.

Ask your data new questions

November 3, 2015

Consolidating data across silos improves business insight.

We need open and vendor-neutral metadata services

October 11, 2015

Comprehensive metadata collection and analysis can pave the way for many interesting applications.

Specialized and hybrid data management and processing engines

September 30, 2015

A new crop of interesting solutions for the complexity of operating multiple systems in a distributed computing setting.

Showcasing the real-time processing revival

August 31, 2015

Tools and learning resources for building intelligent, real-time products.

Learning Paths: a new way to build data skills

August 18, 2015

Logical and well-crafted collections of data video courses get you where you need to go.

Why data preparation frameworks rely on human-in-the-loop systems

July 1, 2015

The O'Reilly Data Show Podcast: Ihab Ilyas on building data wrangling and data enrichment tools in academia and industry.

Apache Spark: Powering applications on-premise and in the cloud

June 3, 2015

The O'Reilly Data Show Podcast: Patrick Wendell on the state of the Spark ecosystem.

Announcing Cassandra certification

May 26, 2015

A new partnership between O’Reilly and DataStax offers certification and training in Cassandra.

More tools for managing and reproducing complex data projects

April 28, 2015

A survey of the landscape shows the types of tools remain the same, but interfaces continue to improve.

A real-time processing revival

April 1, 2015

Things are moving fast in the stream processing world.

Let’s build open source tensor libraries for data science

March 16, 2015

Tensor methods for machine learning are fast, accurate, and scalable, but we'll need well-developed libraries.

Network structure and dynamics in online social systems

February 4, 2015

Understanding information cascades, viral content, and significant relationships.

The evolution of GraphLab

January 28, 2015

The O'Reilly Data Show Podcast: Carlos Guestrin on the early days of GraphLab and the evolution of GraphLab Create.

Building and deploying large-scale machine learning pipelines

January 22, 2015

We need primitives, pipeline synthesis tools, and most importantly, error analysis and verification.

Lessons from next-generation data wrangling tools

January 7, 2015

Drawing inspiration from recent advances in data preparation.

Building Apache Kafka from scratch

December 3, 2014

In this episode of the O'Reilly Data Show Podcast, Jay Kreps talks about data integration, event data, and the Internet of Things.

The science of moving dots: The O’Reilly Data Show Podcast

November 19, 2014

Rajiv Maheswaran talks about the tools and techniques required to analyze new kinds of sports data.

Active learning: Best practices for creating labeled data sets

August 21, 2014

Learn simple ways to improve data models by cleaning up and tweaking the distribution of training data.

Scaling up data frames

August 7, 2014

New frameworks for interactive business analysis and advanced analytics fuel the rise in tabular data objects.

There are many use cases for graph databases and analytics

July 9, 2014

Business users are becoming more comfortable with graph analytics.

Streamlining feature engineering

June 14, 2014

Researchers and startups are building tools that enable feature discovery.

A growing number of applications are being built with Spark

June 1, 2014

Many more companies want to highlight how they're using Apache Spark in production.

Welcome to Intelligence Matters

May 13, 2014

Casting a critical eye on the exciting developments in the world of AI.

What I use for data visualization

January 25, 2014

An array of tools for tackling data visualizations.

IPython: A unified environment for interactive data analysis

January 18, 2014

It has roots in academic scientific computing, but has features that appeal to many data scientists.

Six reasons why I recommend scikit-learn

December 29, 2013

It's an extensive, well-documented, and accessible, curated library of machine-learning models

Data scientists and data engineers like Python and Scala

November 30, 2013

Python and Scala are popular among members of several well-attended SF Bay Area Meetups

Data wrangling gets a fresh look

November 23, 2013

We are in the early days of productivity technology in data science

How companies are using Spark

November 9, 2013

The inaugural Spark Summit will feature a wide variety of real-world applications

Stream processing and mining just got more interesting

September 21, 2013

A general purpose stream processing framework from the team behind Kafka and new techniques for computing approximate quantiles.

How Twitter monitors millions of time series

September 14, 2013

A distributed, near real-time system simplifies the collection, storage, and mining of massive amounts of event data

Data analysis: Just one component of the data science workflow

September 7, 2013

Specialized tools run the risk of being replaced by others that have more coverage.

Data analysis tools target non-experts

August 24, 2013

Tools simplify the application of advanced analytics and the interpretation of results

Interactive big data analysis using approximate answers

August 17, 2013

As data sizes continue to grow, interactive query systems may start adopting the sampling approach central to BlinkDB.

Surfacing anomalies and patterns in machine data

August 10, 2013

Compelling large-scale data platforms originate from the world of IT Operations

Data scientists tackle the analytic lifecycle

July 13, 2013

A new crop of data science tools for deploying, monitoring, and maintaining models

Improving options for unlocking your graph data

May 18, 2013

Graph data is an area that has attracted many enthusiastic entrepreneurs and developers

11 essential features that visual analysis tools should have

May 12, 2013

Visual analysis tools are adding advanced analytics for big data

Tachyon: An open source, distributed, fault-tolerant, in-memory file system

April 27, 2013

Tachyon enables data sharing across frameworks and performs operations at memory speed

The re-emergence of time-series

April 6, 2013

Researchers begin to scale up pattern recognition, machine-learning, and data management tools.

Python data tools just keep getting better

March 23, 2013

A variety of tools are making data science tasks easy to do in Python

Shark: Real-time queries and analytics for big data

November 26, 2012

Shark is 100X faster than Hive for SQL, and 100X faster than Hadoop for machine-learning

Seven reasons why I like Spark

August 21, 2012

Spark is becoming a key part of a big data toolkit.