NoSQL weekly 279
Articles, Tutorials and TalksNoSQL Injection in Modern Web ApplicationsThe threat of SQL injection in modern web applications has been left by the wayside with the rise of NoSQL databases. Unfortunately, a new, but fundamentally similar threat has taken its place: NoSQL injection. Let's take an in-depth look at this type of attack and the steps we can take to protect ourselves from it. NoSQL Python
Anyone who's dealt with a CSV file that contains arrays or a JSON with nested fields knows the pain of shoehorning non-rectangular data into standard Python data tools, such as data frame. This presentation will show you Python best practices for managing such non-rectangular data and highlight new opportunities for using "NoSQL" Python for interesting and painless analyses of real world data. Secure MongoDB with X. 509 AuthenticationThis tutorial describes the detailed process of setting up X. 509 based authentication, both for cluster inter-member authentication as well as for client authentication, using a local CA (Certificate Authority). FPGA-based hardware acceleration for a key-value store databaseThis thesis investigates the use of a Field-Programmable Gate Array (FPGA) as a hardware accelerator for a key-value database. Utilized as a platform of reconfigurable logic, the FPGA offers massively parallel usability at a much faster pace than a traditional software-enabled database system. This project implements a key-value store database hardware accelerator in order to investigate the potential improvements in performance. Furthermore, as new technologies in materials science and computer architecture arise, a revision in database design welcomes the use of hardware for maximizing key-value database performance. Geospatial Indexing US Congressional Districts with Neo4j-spatialIn this post I will show you how we can use geospatial indexing with Neo4j. We will extend the legis-graph dataset (a graph of US Congress), adding US Congressional districts to the graph. We will use neo4j-spatial, a plugin for Neo4j, to index the geometries of the Congressional districts. Finally, I will show how we can connect Neo4j to Mapbox, a mapping framework, to make an interactive geographical visualization in the browser. Using Spark and Zeppelin to process big data on Kubernetes 1. 2Storing, Indexing and Searching files with Couchbase and Spring Boot8 Solid Tips for Succeeding with Neo4j Simple CouchDB and Cloudant BackupBooksAdvanced Analytics with Spark: Patterns for Learning from Data at ScaleYou'll start with an introduction to Spark and its ecosystem, and then dive into patterns that apply common techniques--classification, collaborative filtering, and anomaly detection among others--to fields such as genomics, security, and finance. If you have an entry-level understanding of machine learning and statistics, and you program in Java, Python, or Scala, you'll find these patterns useful for working on your own data applications. Interesting Projects, Tools and LibrariesFeatherFeather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow. REEFREEF, the Retainable Evaluator Execution Framework, is our approach to simplify and unify the lower layers of big data systems on modern resource managers. For managers like Apache YARN, Apache Mesos, Google Omega, and Facebook Corona, REEF provides a centralized control plane abstraction that can be used to build a decentralized data plane for supporting big data systems. Special consideration is given to graph computation and machine learning applications, both of which require data retention on allocated resources to execute multiple passes over the data. redis-apiA redis API for use with Slack. SneakerJSEntity relationship automation for AngularJS and CouchDB (or PouchDB etc. . . )xlatorTranslation service prototype that is backed by a hosted Elasticache (Redis) instance. Upcoming Events and WebinarsWebinar: Faster Big Data Analytics with MongoDBJoin us for 40 minutes session showcasing real-world case study on very unique and innovative Big Data Analytics solution which CIGNEX Datamatics, for one of its esteemed clients in Silicon Valley has architected, implemented and delivered for effective lead generation for data-driven sales, marketing and business development teams using MongoDB. Webinar: Building a Real-time Streaming Platform Using Kafka Streams and Kafka ConnectThis presentation will give a brief introduction to Apache Kafka and describe it's usage as a platform for streaming data. It will explain how Kafka serves as a foundation for both streaming data pipelines and applications that consume and process real-time data streams. It will introduce some of the newer components of Kafka that help make this possible, including Kafka Connect, framework for capturing continuous data streams, and Kafka Streams, a lightweight stream processing library. Finally it will describe the lessons learned by companies like LinkedIn building massive streaming data architectures. Using Memory-Based NoSQL Data Structures to Eliminate the Network Bottleneck - New York, NYThis session will explore how IMDGs can implement user-defined data structures by leveraging their NoSQL, object-oriented, data-parallel data model. We will also discuss techniques that lets applications define object-oriented methods which execute within the IMDG to implement a user-defined data structure. Other techniques extends this approach for scalability by distributing stored data across the IMDG's cluster of servers and performing data-parallel operations across the distributed data structure. Several examples will be described that demonstrate both the power of this approach and its ability to reduce network congestion. Cypher for SQL Developers - London, United KingdomThis session is an introduction to graphs for SQL developers. We'll start with an overview of Neo4j and its query language Cypher before moving on to a worked example where we'll query the same data set using a relational database and Neo4j. We'll also cover graph modeling, query optimisation and show an example of procedures which will be available in the 3. 0 version of Neo4j.