Rapid Advances in Computer Science and Opportunities for Society - - PowerPoint PPT Presentation

▶

Oct 14, 2023 243 likes •1.03k views

Rapid Advances in Computer Science and Opportunities for Society European CS Presentation, October 2010 Alfred Spector VP, Research and Special Initiatives Rapid Advances in Computer Science & Opportunities for Society Abstract

SLIDE 1

Rapid Advances in Computer Science and Opportunities for Society

European CS Presentation, October 2010

Alfred Spector VP, Research and Special Initiatives

SLIDE 2

Rapid Advances in Computer Science & Opportunities for Society

Information and Communication Technologies have had a rapid impact on society, and –amazingly—the pace of innovation continues to

accelerate. This innovation is catalyzed by ever-increasing hardware and

networking capabilities, the growth in internet usage, as well as important advances in basic and applied computer science. In this talk, I will describe some of the research that Google is undertaking (for example, in machine translation, semantic processing, and information management), and discuss some of the likely beneficial impacts on our society – for example, in science, the humanities, education, philanthropic activities, and more. I’ll conclude my presentation with some interesting challenges from both a technology and policy point of view.

Abstract

SLIDE 3

Outline

Google Prodigiousness Advances in the Field: examples Translation Speech Vision Cloud-based collaboration around structured-data Operations Research Semantic Processing Beneficial Societal Impacts: examples Earth Engine Google Health Other Health Efforts Crisis Response Digital Humanities Education A Technical Themes Challenges

SLIDE 4

Mission

Organizing the world’s information and Making it universally accessible and useful.

SLIDE 5

Google and Commerce

Over 1 million AdWords advertisers worldwide Over 1 million AdSense publishers worldwide Via the Google Ad Network, AdSense publishers reach over 80% of global internet users in 100 countries and 20 languages YouTube is monetizing over a billion video views per week globally In 2009, Google generated $54 billion of economic activity for American businesses, website publishers, and non-profits

SLIDE 6

Prodigiousness

Giga 109, Tera 1012, Peta 1015, Exa 1018, Zetta1021 Publicized: Bigtable of 70 petabytes, 10M ops/sec. Warehouse computing possibilities? 100 x 10 x 20 x 20 x 40 = 16,000,000 nodes… Some representative numbers: Storage: 1018 -> 1020-21 Users: 109 -> 1010 Devices: 10? -> 1012 Network: 1020, now, ->1021/yr 32 KB/sec. for 1B people Apps: 105 -> 106-7 or more E.g., embedded car systems: 30-50 ECUs, 100M lines of code

SLIDE 7

A variety of science engineering challenges

SLIDE 8

Focus on Innovation that Benefits our Users Focus on Research and Engineering

Commitment to advancing technology Rich domain of work due to our mission Grand challenge problems Internal consensus that production issues are

ften as challenging/fun as pure invention

Technical leverage

1. Google Common Distributed System
2. A Focus on Services
3. Empiricism and a Holistic Approach to Design

SLIDE 9

Our Innovation Culture

Focus on talent Distributed across the organization Impacting Google necessitates broad, diverse involvement in science and engineering Research is done both in our research team and in

ur engineering organization, organized
pportunistically

Teams benefit greatly From mutual talent From Google’s comparative advantages to our scale and broad use From service-based architecture (“ease” of working in vivo)

SLIDE 10

Ideal Distributed Computing

Device s

SLIDE 11

Research Challenges in Ideal Distributed Computing

Alternative designs that would give better energy efficiency at lower utilization Server O.S. design aimed at many highly-connected machines in

ne building

Unifying abstractions for exploiting parallelism beyond inter- transaction parallelism and map-reduce Latency reduction A general model of replication including consistency choices, explained and codified Machine learning techniques applied to monitoring/controlling such systems Automatic, dynamic world-wide placement of data & computation to minimize latency and/or cost, given constraints on Building retrieval systems that efficiently and usably deal with ACLs Holistic models of privacy The user interface to the user’s diverse processing and state

SLIDE 12

Totally Transparent Processing

D: The set of all end-user access devices L: The set of all human languages M: The set of all modalities C: The set of all corpora

Personal Computers Phone Media Players/Readers Telematics Set-top Boxes Appliances Health devices … Current languages Historical languages Other forms of human notation Possible language specialization Formal languages … Text Image Audio Video Graphics Other sensor-based data … The normal web The deep web Periodicals Books Catalogs Blogs Geodata Scientific datasets Health data …

For all d in D, all l in L, all m in M, and all c in C

SLIDE 13

Totally Transparent Processing

SLIDE 14

“Hybrid” Intelligence

To extend the capability of people, not in isolation Aggregation of empirical signal is exceedingly valuable Ex: Feedback in Information Retrieval; e.g., in ranking or spelling correction Machine learning; e.g., image content analysis, speech recognition with semi-supervised learning

SLIDE 15

Research Challenges in Transparent Computing & Hybrid Intelligence

Endless applications, with very new user interface implications Addressing limits to data Techniques to integrate user-feedback in acceptable fashions Approaches to new signal Explanation, scale, and variance minimization in machine learning Information fusion/learning across diverse signals – The Combination Hypothesis, more generally Usability: devices and subpopulations Privacy

SLIDE 16

Domains of Application

Search engines Translation Speech recognition Vision Remedial Education Personal health Epidemiology Economic prediction Societal/environmental optimization Social Networking in ever more clever/useful ways Humanities and Social Sciences Multi-player gaming

SLIDE 17

Translation

SLIDE 18

Machine Translation @ Google

Statistical Machine Translation Model translation process with a statistical model Learning from data: monolingual & bilingual More data: better translation quality Computationally expensive approach Models have many hundreds of Gigabyte of data (Moore's law helps here) Applying syntax information as a signal Results: Much better translation quality Ongoing progress More research groups, ... 58 languages (so far) recently: Haitian Creole, Urdu, Georgian, ..., Latin

SLIDE 19

Grand Challenges

Morphology: translating into morphologically rich languages e.g. Russian, Hungarian need: morphology-aware translation models Reliability: some translation mistakes more severe than others:

hotel - Montreal Heath Ledger - Tom Cruise

Research: How to detect 'crazy' translations? Long-distance reordering: 'simple' case: SVO, SOV (one) approach: parse source & reorder issue: parsing accuracy for

ut-of-domain texts

Finding all Training Data

SLIDE 20

How about Poetry?

Paper at EMNLP 2010 conference: “Poetic” Statistical Machine Translation: Rhyme and Meter, D.Genzel, J.Uszkoreit, F.Och, EMNLP, 2010. Approach: Enforce meter and rhyme as extra constraints (similar to language model) E.g. iambic pentameter: stress pattern 0101010101 Produce most 'probable' translation that obeys constraints ("Function follows form") Example output (couplet in amphibrachic tetrameter An officer stated that three were arrested and that the equipment is currently tested.

SLIDE 21

Speech

SLIDE 22

Goals for Speech Technology at Google

Much of the world’s information is spoken – we need to recognize it before we can organize it:

YouTube transcription and translation (breaking the language barrier for YouTube access) Voicemail transcription

Mobile is the fastest growing and most widespread platform for communication and services that has ever existed

Spoken input and output is key to usability Our goal is completely ubiquitous availability of speech i/o (every application/service, every usage scenario, every language)

How do we get there?

Delivery from the cloud – support constant iteration and refinement Operating at large scale – train huge statistical models on huge amounts of data

SLIDE 23

Learning from use - without human transcription Challenges:

How do we grow the model to take advantage of the data? (richer models of accent, speaker, noise, etc.) Huge computational demands Infrastructure demands – parallelization – leverage Google software environment

Training Acoustic Models w/Unsupervised Learning

Supervised vs. unsupervised training - hours of data vs. error rate

SLIDE 24

Vision

SLIDE 25

Computer Vision

Advance state-of-the art in 3 key areas of image/audio/video analysis and apply results to our multimedia products. Semantic Interpretation: Generate human understandable description of content. (eg. auto-tagging videos on YouTube, Image annotation, porn classification etc.) Matching: Find similar entities from a large corpus. (eg. "find similar" on image search, video fingerprinting for YouTube etc ). Synthesis: Generate better images/video by understanding the statistics of a large corpus of images. (eg. better facades in 3D building on Google Earth, automatic shadow removal from areal images etc.)

SLIDE 26

Semantic Interpretation sample problem - Video Annotation

Video metadata has a cognitive cost on the user because they have to type it in, be careful about what keywords they use, and in general try to make their video searchable Many uploaders don’t have the motivation, or energy to provide proper metadata Noisy metadata hurts everyone – spam, misspellings, 1337, acronyms, etc.

SLIDE 27

Cloud-based Computing Structured Data

SLIDE 28

Structured Data on the Web

Discovery and search for structured data: The deep Web -- significant gap in coverage Structured tables on the Web -- not leveraged in search Enable easy creation, management, sharing and publishing of structured data: Fusion Tables: www.google.com/fusiontables

SLIDE 29

Google Fusion Tables

host, manage, collaborate on, visualize, and publish data tables online

What can I do with Fusion Tables? Host data online - and stay in control control can be at the level of columns or rows Re-use data without making copies Collaborate on the details Merge data from multiple tables Comment on individual rows, columns or cells Make a map (or chart or timeline) in minutes! Manage data via our site or an API

Fusion Table Example Gallery

SLIDE 30

Easy Data Upload, Attribution recorded

SLIDE 31

Easily Create Informative Maps

SLIDE 32

Easily Create Informative Maps

baby steps towards the dream platform DEMO:

circle of blue

SLIDE 33

Cloud-based Computing Prediction

SLIDE 34

1. Upload
2. Train

Upload your training data to Google Storage Build a model from your data Make new predictions

3. Predict

Machine learning as a web service

Smart Apps for every developer

RESTful HTTP service
Simple integration

Prediction API

SLIDE 35

Under the hood, many classifiers/regressors. Recent research, efficient and theoretically principled methods for distributed learning (NIPS-09, HLT-10):

Network costs can be reduced by an order of magnitude with minimal loss in classifier accuracy.

Under the API

SLIDE 36

Operations Reseach and Optimization

SLIDE 37

Operations Research Challenges

Size: Optimization is often NP Complete Increasing the size by 1 doubles the search space. The tools are barely keeping up with the problems. Uncertainty: Data is often fuzzy. How do you route cars when there are roadblocks, new

ne-ways, traffic jams?

Can you use optimization in on-line algorithm connected to users? How well can you optimize against forecasted data, how do you react if the forecast is bad? User expectation and requirements: The definition of problems is also unclear. What is the

bjective? What is a good solution? Can I violate this

requirement? By how much?

SLIDE 38

Operations Research Opportunities

Machine Learning can help us in two ways: By providing guidance towards good solutions. By qualifying valid solutions. By reducing the search space. Large computing resources means we can try a bit harder. Crowd-Sourcing means better data, better feedback, better evaluations of algorithms and solutions. Having all our code open-source means we can collaborate on building the best set of tools. See http://code.google.com/p/or-tools

SLIDE 39

Semantic Processing

SLIDE 40

Web Inference and Learning

Goal: better understanding of Web content and user intent Method: algorithms that draw reliable semantic inferences from the wealth of evidence implicit in massive Web data

How to interpret this term in this context? Does this sentence answer that question? Will this user click on that ad?

Learning: create concise representations to support good inferences

SLIDE 41

Meaning from the Web

Elementary semantic inference: what are the possible classes for each instance?

SLIDE 42

Combination wins

Combined graph: 1.4M nodes, 75M edges

SLIDE 43

Applications to Society Follow

SLIDE 44

Earth Engine

SLIDE 45

Motivation - Carbon Forest Tracking

UNEP: "Atlas of our Changing Environment"

1975 1989 2001

Rondonia, Brazil

SLIDE 46

A Sampling of Other Use Cases

Disease Early Warning: Remote surveillance of

disease and prediction of epidemics.

Population Census: Supplements traditional census

and mapping in developing regions.

Humanitarian Crisis Mapping: Can detect and

monitor a growing range of crisis types.

Water Resources: Monitor water quality and

availability and alleviate water shortage problems.

Food Security: Famine early warning, rainfall and

water requirements estimations, agr production estimates and irrigation and fertilizer supply & demand.

Global Education Programs

SLIDE 47

Parallel Geo-Processing “in the cloud” (a brief illustration)

SLIDE 48

Original image

SLIDE 49

Original image ... is divided into 256px sub-units.

SLIDE 50

Sub-units are distributed

SLIDE 51

Sub-units are distributed ... to separate machines.

SLIDE 52

Sub-units are distributed ... to separate machines ... where they can be processed in parallel.

SLIDE 53

Thousands can be processed simultaneously

SLIDE 54

Result is reassembled

SLIDE 55

Result is reassembled ... into a finished image

SLIDE 56

Global-scale earth observation and informatics platform

For public benefit, and to support emerging green economy Help science come out of research lab and into operational use, at scale Unprecedented catalog of earth observation data for mining and analysis Promote transparency, reproducibility, collaboration, “open science”

Very fast computation of scientific map products

Intrinsically-parallel pixel processing system Built-in Google algorithms as well as user-supplied Earth Engine API for 3rd party algorithm development Access control, versioning, provenance Online and desktop versions (open source desktop version)

On a lot of useful data

Every available Landsat and MODIS scene (more satellites coming) Commercial datasets (very high resolution satellite imagery) Environmental data (atmospheric, ocean, terrestrial) User-supplied (ex: in-situ data collected via Android phones)

Overview

SLIDE 57

Scale of Data

US Satellite imagery dataset: Landsat

Peru: 60 Landsat scenes (3Gpix, 20GB) per covering World: 8000 Landsat scenes (2TB) per covering Complete global coverage every 16 days Operating since 1972, historical archive holds ~4PB US, NASA EOS approaching ~10PB of Earth images

Europe: European Space Agency (ESA)

ESA satellite missions: MERIS, Envisat, others SpotImage (France): 20M SPOT images since 1986, 10,000 new images collected daily, 5+ PB archive ESA Launching Sentinel-1 in 2011

Representative Examples

Envisat : Gulf Oil Spill June 2010 (ESA)) MERIS: Hurricane Isabel Sept 2003 (ESA) Spot Image: Xingu Brazilian Amazon

SLIDE 58

View on YouTube

SLIDE 59

Health (US)

SLIDE 60

Google Health personal health dashboard

Launched in May 2008 Major update Sept 2010 User controls and

wns his/her data

A platform that…

Provides a dashboard for wellness information & medical records Allows user to connect and interact with a broad group of “add on” services Includes a non- tethered PHR

SLIDE 61

Crisis Response

SLIDE 62

Person Finder

SLIDE 63

SLIDE 64

Pre-Earthquake - Aug 26, 2009

SLIDE 65

1 Day After Earthquake - Jan 13, 2010

SLIDE 66

13 Days After Earthquake - Jan 25, 2010

SLIDE 67

Digital Humanities

SLIDE 68

Illuminating the Humanities

Q: What can you do with: 12 million books in

ver 400 languages

comprised of 5 billion pages and 2 trillion words ...all digitized?

A: Look to the humanities for new questions...

How would you (re)define Victorian literature? What are the differences between the English and Latin editions of Hobbes’ Leviathan? How have places changed over the course of history?

SLIDE 69

Digital Humanities Awards

Research program supporting university research taking a computational approach to traditional humanist questions. US program, Summer 2010 12 projects 23 researchers 15 universities European program, Winter 2010 10 projects planned

$1M total funding

SLIDE 70

Education

SLIDE 71

Curriculum Development

Seeding and supporting computing curriculum development

Exploring computational thinking in K12 (launch late Oct) CS4HS: High school computer science (cs4hs.com) Undergraduate open source CS curriculum: Google Code University (code.google.com.edu) Lantern platform: Wiki for open source curriculum development (in collaboration with Khan Academy)

SLIDE 72

Talent Development: Google Summer of CodeTM

Program Genesis ”Flip bits not burgers” during summer holidays Exposure to real-world software development Students paired with mentor from OS community Execute to milestones laid out in accepted application Stipend allows students to concentrate on OS development 2010 1026 students 150 organizations 69 countries

SLIDE 73

Technology Leadership: App Inventor for Android

Visual programming environment for Android mobile devices Helping people become creators (rather than consumers) of technology Launched in Google Labs July 12, 2010 http://appinventor.googlelabs.com/about/

SLIDE 74

CS4HS European Workshops

École Polytechnique Fédérale de Lausanne (EPFL), Switzerland: 'Building and Programming Robots' ETH Zurich, Switzerland: 'ABZ Ausbildung - und Beratungszentrum fuer Informatikunterricht' Makerere University, Uganda: 'Grassroot approach to improve the quality of applicants of computing programs at Makerere University' Manchester University, United Kingdom: 'Animation11' Oslo University, Norway: 'TENK' Queen Mary University, United Kingdom: 'cs4fn magazine' RWTH Aachen, Germany: 'Bright Brains in Computer Science' Sapienza University of Rome, Italy: 'Challenge and Fun with the CS Olympiads' University of Stuttgart, Germany: 'UniS2010' Technion, Israel: 'High School Computer Science Female Students', Visits in Google: 'Impressions, Conceptions and Influences' Trinity College Dublin, Ireland: 'Computer Programming Outreach, B2C' University of Cape Town, South Africa: 'Project Umonya' University College Dublin, Ireland: 'CS Summer School' University of Warsaw, Poland: 'Mastering Programing Skills, Workshops for Teachers'

SLIDE 75

Technology Leadership: Google Code University

Course content on current computing technologies and paradigms CS Curriculum Search Tech Talks on CS Topics Tutorials, lecture slides, problem sets for a variety of topic areas: AJAX Programming Algorithms Distributed Systems Web Security Languages Practical Skills (MySQL, Linux) http://code.google.com/edu/

SLIDE 76

Supporting our Academic Institutions

Research Awards Programs - 230+ projects funded in the last year Next Due dates August 15 (CS Awards), October 15 (Marketing Awards) Research-awards@google.com Focused Grant Program Visiting Faculty Program - 20 faculty (ongoing) University-relations@google.com Ph.D Fellowship Program 2009: 13 students supported in North America 2010: 15 in North America, 16 outside North America Over 150 other scholarships ~1000 interns worldwide CS4HS: 1500+ teachers (~100,000 students): US & EMEA

SLIDE 77

Final Thoughts

Scale of Communication and Computing is profound Endless opportunity for technical growth Some large themes Major new application domains

Google rapidly to innovate in science/technology and value to consumers We are providing increased support for academic institutions in computer science and related areas It's a most exciting area in which to innovate