Below is an blog post from Norm Matloff the author of the Art of R - a
http://blog.revolutionanalytics.com/2014/08/statistics-losing-ground-to-cs-losing-image-among-students.html
perceived to be unfashionable. The reference to a CS usurpation problem is
probably 2 very valid sides to that coin.
reveal an opportunity.
rebuttal of a viewpoint I share but Matloff really didn't deal with it well.
have written myself.
management. That market is all the rage now, but the R and Python
communities are probably lost causes.
Julia.
Post by Ihe OnwukaWell he didn't comment on SQL for JSON per se but saying that RDBMS are
sub-optimal for everything is a tacit repudiation of SQL is it not?
No, because he said exploitively that the *internals* of a database will
be different (columnar, main memory, streaming, etc)âŠ.. the
programming language will STILL be SQL. Or at least for all those
databases for whom the data model is STILL relational.
He buys into the notion that there will be swarms of data scientists doing
clever things with data which will need a different language.
Yes. SQL clearly doesnât solve the R use cases. So yes, R is on the
âacceptable OTHER languagesâ list.
But thatâs not clear that what we (aka the XML community see) as ânormalâ
data processing use cases will be considered necessary use cases
for the JSON/NoSQL community.
E.g. scanning the data and automatically extracting a schema. Is this an
acceptable use case for JSON ? Or not ?
If yes, then XQuery has a chance, because XQuery can do that and SQL cannot.
If no, people will stick to what they know : SQL.
He is right that statistical packages suck at data management but that
won't isn't going to deter the R community.
Yes, the R implementations (I looked at them in details about 2 years ago)
have NO IDEA about how to deal with large volumes
of data, so probably a mix between data technologies and database
technologies is necessary.
However, donât underestimate companies like Oracle. They are not dummies,
and the know what the market wants.
R is supported natively inside the Oracle database now.
I think that Stonebreaker exaggerates when he says that relational
databases will disappear in 10 years. Well⊠I donât think
this will happen so quickly.
Do you see XQuery fitting anywhere in this vision. It has potential as a
pipeling technology as does for that matter SQL. I think it will always be
problematic to do analytics on the source data because it is too dirty.
XQuery COULD be a very good âglueâ language between data in various
formats (CSV, Excel, PDF, HTML, XML, JSON, relational, whatever).
But I say âCOULDâ not âCANâ.
It needs many extensions to be good at that: scripting, support for JSON,
modules to support a variety of data formats and data processing services.
Best regards
Dana
P.S.
I am continually surprised that people this smart believe that there is
such a pool of data scientists people to draw from.
Me too. I fell down from my chair when I saw the article saying that US
needs 5 million data scientists in the next 2 years, aka, about 5% of the
US working population. Not sure if this for laughing, or for crying.
[[ aka, we will not have cashiers at Safeway anymore âcause they are all
data scientistsâŠ.]]
Someone up there doing the math in this article doesnât understand jack
nothing about numbers and statistics âŠâŠ.
http://www.nature.com/news/irreproducible-biology-research-costs-put-at-28-billion-per-year-1.17711?utm_content=buffer95bfb&utm_medium=social&utm_source=linkedin.com&utm_campaign=buffer
God knows how many medicines are wrongly given to sick people, because
nobody knows how to do a proper case study âŠ
REALLY scary ⊠but thatâs another discussion.
Again the same discussion comes up: DONâT look for 5 million data
scientists. Just do with a smaller number of smart ones, but GIVE
THEM BETTER TOOLS and AUTOMATIZE THE PROCESS.
But hey, how can you stop such a wold wide enthusiasm for âdata
scientistsâ !?? Logic doesnât do itâŠ.
Post by daniela florescuIhe,
I had discussions with Michael Stonebreaker for 20 years about about the fact that
XML âexistsâ or not. With Jim Gray too, before he disappeared. They were
both extremely
supportive for me, yet were both thinking that I am crazy to waste my
research career on XML.
Stonebreakerâs opinion: he doesnât believe that XML âexistsâ in industry.
So he will not mention it, because it doesnât exist :-)
But you have to remember that Stonebreaker is a database person. Probably he will not
understand the facet of XML which is âXML as documentsâ. It took me and
the other database
people involved in XQuery years before we swallowed it. (Don Chamberlin of SQL fame
famously once said âwho in the world would care about such a corner case
as mixed content !?").
Donât blame the database people that they donât âgetâ XML. On one hand,
it has never been explained
to them properly.
And again, Stonebreaker, being a database person, he will look at âXML as
dataâ aspect of the story.
And this today is INDEED non-existing in industry, or almost. Or, when t
is, it is mostly for log analysis.
============
JSON will completely change the landscape, in surprising ways, that none
of us can predict.
And no, I trust that Michael Stonebreaker is too smart to believe that
SQL is a solution to process JSON.
But time will tell.
Best regards
Dana
http://youtu.be/9K0SWs1mOD0
By implication it puts the kibosh on SQL as the basis of a solution for
the future.
_______________________________________________
http://x-query.com/mailman/listinfo/talk