Tech Talk: Building Systems that Effectively and Cryptographically Protect User Privacy

Ghostor-and-MAGE_Blog-banner-image_380-x-403

Tech Talk: Building Systems that Efficiently and Cryptographically Protect User Privacy George Nychis About Our Guest Speaker Soroco invited Sam Kumar, a current 5th year Ph.D. student from U.C. Berkeley and prior U.C. Berkeley B.S. CS graduate, to give a talk on his and his colleague’s novel computer security and privacy research. The work that […]

Increasing the Accuracy of Textual Data Analysis on a Corpus of 2,000,000,000 Words: Part 1

At Soroco, natural language processing and machine learning-based classification of text are foundational to many of our products. In some instances, we may ingest between 200,000,000 and 2,000,000,000 words over the course of model training and analysis for a single team of workers using our Scout product. In this blog post, we will address some tips and tricks which we have found to significantly increase the accuracy of our models, including appropriate processing of text for the purpose of leveraging standard techniques from machine learning. Many advanced methods for performing text classification require careful modifications so as to respect the structure of multi-field textual data for optimal performance.

Abstract Syntax Tree for Patching Code and Assessing Code Quality

Abstract Syntax Tree is an extremely useful tool. It provides a tree representation of a source code, that can be traversed to parse the code while understanding the code’s language constructs. In this article, we explain how we have used ASTs to write IntelliPatch scripts that can programmatically apply patch to code. Further, we explain how ASTs can be used to assess code quality and create call graphs.

Building Large Scale Systems and Products with Python

Building Large Scale Systems and Products with Python George Nychis  Overview At the beginning of Soroco’s journey, we had to answer a question that many engineering organizations have had to answer before. What programming language were we going to use when building and scaling our products? The reason that each organization needs to answer the […]

Introducing PYCE: Executing Encrypted Python with No Performance Penalty

Introducing PYCE: Executing Encrypted Python with No Performance Penalty Learn how to modify Python’s import!   Wolfgang Richter  Overview Soroco is excited to write its first blog post on how we modified the Python language to make deploying Python code more secure in production. Though Python is a powerful and productive language for building systems, […]