wolfram alpha
play

Wolfram|Alpha - Answering Questions with the Worlds Factual Data - PDF document

Slide 1 of 25 Wolfram|Alpha - Answering Questions with the Worlds Factual Data Joshua Martell - Feb 2nd @ 5pm, Mission City B5 2 StrataConf-WolframAlpha.nb Slide 2 of 25 What is Wolfram|Alpha? - Computational Knowledge Engine - the


  1. Slide 1 of 25 Wolfram|Alpha - Answering Questions with the World’s Factual Data Joshua Martell - Feb 2nd @ 5pm, Mission City B5

  2. 2 StrataConf-WolframAlpha.nb Slide 2 of 25 What is Wolfram|Alpha? - Computational Knowledge Engine - the world’s systematic knowledge computable and accessible to everyone - to compute whatever can be computed about anything - Integrate the world’s facts, data, and algorithms - 5 years of R&D - launched in May 2009

  3. StrataConf-WolframAlpha.nb 3 Slide 3 of 25 Wolfram|Alpha by example - when is valentines day? - how long was t � rex? - how many calories is in a burger?

  4. 4 StrataConf-WolframAlpha.nb Slide 4 of 25 Examples: Math - what is 2 � 2? - plot sin � x^2 � y^2 � - int e^t sin � 5t � dt - product � 1 � 1 � n^4 � , n � 2 to infinity

  5. StrataConf-WolframAlpha.nb 5 Slide 5 of 25 Examples: Data lookups - population of china - GDP of the EU - flight time from seattle to tokyo - AAPL MSFT

  6. 6 StrataConf-WolframAlpha.nb Slide 6 of 25 Examples: Visualization - earthquakes dec 2004 - qr code: http: �� strataconf.com - caffeine

  7. StrataConf-WolframAlpha.nb 7 Slide 7 of 25 Examples: Formulas - mortgage 5 � 20 yr - RLC circuit

  8. 8 StrataConf-WolframAlpha.nb Slide 8 of 25 Examples: Fun - scrabble quixotic - what's the meaning of life? - airspeed velocity of an unladen swallow

  9. StrataConf-WolframAlpha.nb 9 Slide 9 of 25 Elements of the output Described in the tour - input field - assumptions (sometimes) - input interpretation - results pod - other pods - buttons and pull-down menus - output extras under the pod - source information

  10. 10 StrataConf-WolframAlpha.nb Slide 10 of 25 More about the input interpretation - getting from the free form input to here is our secret sauce - combination of heuristics, algorithms, and developer curation - W|A has formed an exact expression representing your input

  11. StrataConf-WolframAlpha.nb 11 Slide 11 of 25 More about pods - W|A looks for components that can report about the input - results pod is the “answer” - pods load asynchronously - related cross-domain information in other pods - entity only inputs

  12. 12 StrataConf-WolframAlpha.nb Slide 12 of 25 Finding data - want to have the best information available - quality, breadth, and technical assessment of each source - web searches, evaluation of source materials - prefer primary sources, prefer digital sources

  13. StrataConf-WolframAlpha.nb 13 Slide 13 of 25 Finding data - technical considerations: Print >> PDF >> HTML >> DB >> CSV - discuss with world experts to understand the data - PD & US Gov have simple licensing terms - corporate deals are more complicated, but data is better quality, documentation, assistance

  14. 14 StrataConf-WolframAlpha.nb Slide 14 of 25 Aggregating data - rare that one source has it all - fill in with secondary sources - alignment is difficult, troublesome, and error prone - use common identifiers, verify data across sources - hand checking....does this value make sense? - automating updates

  15. StrataConf-WolframAlpha.nb 15 Slide 15 of 25 Data cleaning and curation - we want the best data, and we’re willing to work for it - automate as much as possible, do the rest by hand - use Mathematica to find outliers and oddballs, explore data, verify quality - takes time and attention to detail

  16. 16 StrataConf-WolframAlpha.nb Slide 16 of 25 Making it computable - teach W|A about the domain and its relationships to existing domains - natural language parsing for entities and properties - data becomes a building block for the inspiration of users - GDP Greece � population of Italy - questions thus far?

  17. StrataConf-WolframAlpha.nb 17 Slide 17 of 25 Data storage and retrieval - read heavy system - writes from feeds and developers - some results involve only computation, but usually some data is used - elaborate tracking of data changes

  18. 18 StrataConf-WolframAlpha.nb Slide 18 of 25 Versioning and deployment - significant development effort - different tools for programmatic changes verses hand curation - versioning is closely tied to deployment; deploy only new or updated values - weekly deployment of a new revision of W|A with data updates and code changes - content distribution system updates colocations

  19. StrataConf-WolframAlpha.nb 19 Slide 19 of 25 Purely computational data - formulas, encodings, etc. - internal APIs are very flexible - data comes from a wide variety of computational and unconventional sources - many, many built in algorithms in Mathematica

  20. 20 StrataConf-WolframAlpha.nb Slide 20 of 25 Computation and visualization - Mathematica as a development platform - our (not so) secret weapon - functional, very high level, symbolic programming language - built in everything - statistics, numerics, advanced plots, charts, file formats - large collection of algorithms - database/Java/.NET integration - C interface - used by - all 15 major US Federal government departments - all Fortune 50 companies - all 50 largest universities worldwide

  21. StrataConf-WolframAlpha.nb 21 Slide 21 of 25 Web components - web Mathematica = Mathematica powered web pages - Mathematica integrates into the servlet engine - majority of code is Mathematica

  22. 22 StrataConf-WolframAlpha.nb Slide 22 of 25 Other Technologies: API - REST API, various language bindings - returns XML encoded HTML, plaintext, images, etc - used for W|A iPhone iPad / Android app - used by Bing to display W|A results into search results - free to try out 2GB over a T1 » Result 10363 seconds

  23. StrataConf-WolframAlpha.nb 23 Slide 23 of 25 Other Technologies: Wolfram|Alpha appliance - W|A in your data center - consulting to integrate corporate data into W|A system

  24. 24 StrataConf-WolframAlpha.nb Slide 24 of 25 Data Summit 2011 - Wolfram Data Summit - Washington DC, Sep 7,8, 9 - not W|A specific - many corporate, non-profit, government attendees - meeting place to discuss common issues and solutions

  25. StrataConf-WolframAlpha.nb 25 Slide 25 of 25

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend