Geeks With Blogs
Josh Reuben
11
Apr 16
The JVM (Java Virtual Machine) is a virtual "execution engine" instance that executes the bytecodes in Java class files on a CPU - Knowing how to tune its myriad flags affects how your application executes. JIT Compiler TuningHotspot Compilation Mechanism Selection-client - client compiler (C1) - begins compiling earlier -> optimize startup time. ......

When constructing Spark Machine Learning Pipelines - I find it really helpful to maintain a bird's eye view of the various transformers and estimators available.in a nutshell: fit trainingData (train a model), transform testData (predict with model)Transformer: DataFrame => DataFrameEstimator: DataFrame => TransformerTransformersToke... sentence ......

System Architecture patternsN-TierEvent-Driven - Mediator / BrokerMicrokernelMicroServi... - MVC / MVP / MVVMserver - RPC / Remoting / WS / SOA / RESTSpace-BasedSOA PatternsFoundational StructuralService Host - infraActive Service - worker thread for upstream pre-fetchTransactional ServiceWorkflowEdge ComponentQoS PatternsDecoupled Invocation ......

Overview Developed by Facebook HiveQL is a SQL-like framework for data warehousing on top of MapReduce over HDFS. converts SQL query into a series of jobs for execution on a Hadoop cluster. Organizes HDFS data into tables - attaching structure. Schema on Read Versus Schema on Write - doesn’t verify the data when it is loaded, but rather when a query ......

Big Data has a plethora of Data File Formats - its important to understand their strengths and weaknesses. Most explorers start out with some NoSQL exported JSON data. However, specialized data structures are required - because putting each blob of binary data into its own file just doesn’t scale across a distributed filesystem. TL/DR; Choose Parquet ......

As chief architect of an Ad-Tech startup, part of my role involves hiring and mentoring software engineering candidates. Our technology stack includes Scala, SBT, Akka, Spray, Spark + MLlib, AWS, Ecmascript 6, BeEF, Linux environment, Git, Docker, Bash, Kafka, ELK, NGinX, and as of today, Mesosphere. I can tell you, it ain't easy to find Scala Devs. ......

Scala IO is somewhat lacking at this point in time - often requiring a fallback to Java APIs. I did not have time to write this out in Scala - this post will contain a few Java snippets. What counts is the concepts - Buffers, Streams, Channels and IO vs NIO vs NIO2. Also, Scala per se' does not natively support a "try with resources' concept - for this ......

Last year I spent some time debugging Qt C++11 code using the GNU Debugger GDB https://www.gnu.org/softwar... (as well as profiling with Valgrind http://valgrind.org/ ).To skill up on this, I read the "Art of Debugging with GDB, DDD & Eclipse" http://www.amazon.com/The-A... (from the excellent as always No-Starch ......

Annotations (Metadata) embed metadata in source code, for design-time/compile-time/ru... processing Syntax - @<annotation-name>(&l... initializations>) the bedrock of JEE Annotation Definition Import java.lang.annotation package @interface - indicates an annotation type annotation types automatically extend Annotation interface - specifies ......

Overview A scalable cloud based architecture has converged upon a common design meta-pattern leveraging the following components, which are explained below: A Forward-Cache Proxy An elastic service Load Balancer Stateless Server Facade Nodes A Shared Memory State Cache Client requests hit the Forward-Cache Proxy which returns any memoized service requests ......

Copyright © JoshReuben | Powered by: GeeksWithBlogs.net | Join free