rslt OPS = Value stream, removing constraints

🎭 index references    elucidation    metier 🎭
👐 top    mid    bottom   👐

⚙  bpm   sdlc   bianl   data   meta   math   ⚙
  
⚒   Intro   Soar CSI   Big data   ER-star ALC3   High-variability   Results evaluate   ⚒
  
⚒   System Performance   System Configuration   Flow load slicing   Flow monitor   dataops   Retrospective   ⚒

Data modelling, process scheduling.

Defining work of users & colleagues easier.

working with colleagues, full circle Data information security Control testautomation in the releasetrain Control releasetrain Control Data information security Control dataflow Working experiences by some controls. The SIAR cycle is there using the corners 👓, topic cycle in the center.

SIAR:
S Situation
I Initiatives
A Actions
R Realisations

🔰 lost here, than.. devops sdlc.

Progress


Contents

Reference Topic Squad
Intro Defining work of users & colleagues easier. 01.01
Soar CSI Analyzing Computer System Information. 02.01
Big data Big data low variability. 03.01
ER-star ALC3 The star ER-model - ALC type3. 04.01
High-variability High variability but not that big. 05.02
Results evaluate Evaluating results - impact value. 06.01
Words of thank to my former colleagues 06.02
System Performance System Performance - users, groups business lines. 51.01
System Configuration RACF, SMF System performance, system sizing. 52.01
Flow load slicing Big data low variability, ER model. 53.01
Flow monitor Monitoring, Logging, tuning - Service Delivery. 54.01
dataops Dataops, datamodel, AlC type3. 55.01
Retrospective Evaluating realisations. 56.01


sea log

Analysing Computer System Information.


Understanding the system CSI not Crime Scan Investigation although the difference would be minor when there is a cyber crime to investigate. Anything that can be observed is potential important. There are many overlapping operational tasks. Availability of systems is one of those.

Soc, Security operations Center and Computer Operations.
SoC in a very simplified approach only having that interface of security access.
System management facilities (SMF) collects and records system and job-related information that your installation can use in: That are a lot of goals with a different kind of usage. ETL, DWH, in a figure:
baisc SoC CSI

Add data for: The data lake for Computer System Information (CSI) is able to support security opeerations and others. It started wiht those others. Having a generic system (SMF) avoids the need for defining a data pipeline for every subsystem.

System Trend database.
💡 In this approach the SDDB´s are database collecting summarized history updated by increments in time. This saves storage, anonymizes the data, makes retention policies for detailed information realistic implementable.
The only disadvantage is that when a new or changed trend analyses is defined it has to rebuild from what is available.
green logl

Big data low variability.


The big data, AI, ML Machine learning, is getting a lot of attention by the new way of business rules, modelling. The practicial operational how to run processes neglegected. New requirements on how the model is explainable used and what the model does in operations are being ignored.

Running predictable planned processes ⚙
Planning an operational process is dictated by agreed interfaces. Those interfaces are decoupling responsibilities needing both actions on what is te goal when all goes as intended and what when one of the deadlines in interfaces by any party is missed.
operations computer
The scheduling:
This setup is the basic design of the model deployment. Fine tuning details, changing models, changing data, is the daily improvement activity.

Conditions using scheduling 🎭
Scheduling is done in a very basic way. There are many technical options. With the first four, most of what is wanted can be automated.
  1. moment in time. This one anybody knows.
  2. CC Condition Code of another program.
  3. Existence yes/no of some file.
  4. Calendar functionality for repeating flows.
  5. Some Database event (not generic).
  6. Load on system resources.
  7. Agreed moment a process has to be ready.
The Condition Code logic is a basic coding instruction. It is instructing developers to set this according functional logic.
💣 Do not use logic trying implement a Condition Code doing code log analyses. Prone to errors adding unnecessary complexity (muda mura muri).

Building modular programs for scheduling 📚
Every program should run in a limited time (wall clock, elapsed time), logical consistent restartable. These requiement are important for predictable operations.
Not every funtional good working code is efficient good (performance) or technical good (unnecesary constraints locks). There are many caveats. Doing the wrong choices in the engineering details it has to rebuild from scratch. A proof of concept will is not covering this.
💡 Solving all three requirements to an acceptable level as early as possible in the development process is real lean.
modern5 logl

The star ER-model - ALC type3.


Data warehousing has a long history on delivering reports for decision makers. Doing machine learning is doing data processing in a different way. The goal is to give an indication on some element. Features are descriptions associated around that element.

The ER-star diagram - predictable planned process ⚖
The previous coding guidelines, modular program, scheduling for predictable planned process still is applicable. The processing changed to an full daily synchronisation with every 15 minutes an incremental updates (near real time)

The ER-star diagram - ML data preparation ⚖
Grouping features around an element has similarities to the star-model in dimensional modelling used for a long time.
Differences to common dwh star model are:
  1. Data is not complete consistent nor having complete integrity.
  2. More data pipelines are needed.
  3. Dimensions not normalised, denormalised dimensions are on topic, sufficiënt.
  4. When doing partial updates, what objects have been touched.
Star model - AlC type3
The object that is going to be scored is in a data model in the same position as the fact table in a star schema used in a dwh data model.
There are a lot of differences it is more an ER-model similar to the dwh starschema.
Building up data for data scientists for modelling is sensible. They will transform it again to something better suited for the statistical programs. That is usually a single record composed of (modified) features.

Data is not complete consistent nor having complete integrity 🎭
Doing CDC (change data capture) from a source DBMS retrieving data avoid for any lock is needed. Reading data that is updated at that moment has a weird effect:
Getting features for an element in the facts that doesn'´t exist yet.
Selecting elements in facts than not all are valid to process at the moment.
With hindsight evaluating the data flow that is logical.
Surprise is learning this by analysing practical problems:
More data pipelines are needed 🎭
Developing with parallel teams and the Test, Acceptance, Production lines are all needed when a model goes through the lifecycle verifying all functionality. The scoring is becoming a part of the operational value stream.
This full lifecycle in a regular approach is not a requirement with a dwh just used for reporting dashboarding purposes.

denormalised dimensions are on topic, sufficiënt ⚙
The features , dimensions, are very good to process when it is delivered this way for the model building and model scoring. More normalisation adds waste by complexity.
In a classic dwh approach the assumptions is: you never know what the next wanted report will be. Adding complexity will create value sometime in the future.

When doing partial updates, what objects have been touched ⚙
A near real time scoring on a huge populations will run fine when only the most recent updates are scored and updated. This is requiring defining a working list of elements in the fact table that have been updated since the last run.
💡 This near real time update for scoring and retrieval using: I have never seen this done with reporting, dashboarding, based on a classic dwh.

modern7 logl

High variability but not that big.


Documenting and naming all elements that are used in data is a very basic activity of data governance. Bypassing this in favour of technical solutions will result in too little understanding how the data flow is. Do not start pushing the data information that is available, start at the information being requested.

Defining, naming all elements ⚖
Learning van information that is present and than asking for new inforamtion to add is a full circular appraoch. There are two questions to answer first.
  1. What information in what kind of data structure is the best at delivery, process output.
  2. What information in what kind of data structure is the best at input of the process.
data model, data life cycle
The available information (input) is a fixed layout.
output, results, having a goal are in a relational format using columns.

Adding the information request for what is needed, is the new mind setting.

Conversions transformations are the manufacturing, assembly steps.

Defining, data structure process output 🎭
Delivering information in a tabular format using columns with logical names is most practical for further processing.
Choosing the columns orientation is vague. There can be multiple options. During handing over found that both options are valuable. 💡 The end result in delivery:
Defining, data structure process output 🎭
💡 Storing partitioned in yearly collections for every delivery type is for archiving replacing correcting most sensible.
💡 Storing the elements in a vertical way, not the usually column orientations but transposed is giving the best flexiblity in all changes of new element types. This was an existing sitation, only the naming an decriptions of all elements technically complicated solved. Both approaches are not normal dwh practices. The naming definitions with text and keys to add.

Defining, naming all elements ⚙
Starting the inventarasition noticed there are lot of the same elements for a different time period. Solution in the naming convention: When preparing data for delivery these should go into seperate rows wiht an adjusted time indciator key value
The other one are values for other periods:
  1. realised (Accepted)
  2. expectations differences (Balance)
  3. Corrected total value
The simple formula to verify is: A+B=C.

Tansforming the information ⚙
Transposing data is not a normal data transforamtion. It doesn´t even exist as a standard dwh data flow. Transposing fast and controllable the data is what is needed in this approach. Windowing functions in SQL is an option, programming in the old classic way is easy and very fast.

siar not STAR PDCA

Evaluating results - technical.


The main technical issues are optimising data models and the processing in an easy understandable modular approach. At these projects I used SAS for prgramming. I am convinced it is also possible wiht other languages.

Naming conventions and standard way of work is the key issue. Not easily copied into another organisation.

Analtics in different contexts
The analytics usage, Computer System Investigations is very similar to business analytics. It got the focus on only securiyt operations as single usage goal.
mcafee siem analytics
Others are not getting the attention they deserve. Those are: A lot room for improvements.

Maturity learning organisation, successes - failures
Even more weird is seeing the generic ETL and DWH once were in place coming back as new trends. There is so much knowledge "how to do it" getting lost and ignored.
The best results for an local organisation are achieved when the system is build appreciating: 🎭 Ingonoring those local organisation only going for "using generic tools (cots)" brings hardly improvements for the local organisation.

detailed descriptive information at ..
Words of thank to my former colleagues
We had a lot fun in stressfull times working out this on the mainframe in the nineties. The uniquenes and difficulties to port it to somewhere else realising.
A few names:
Not always unhappy by what is going on
Ending bridge
this topic ends here. back to devops sdlc
👓 🔰
⚒   Intro   Soar CSI   Big data   ER-star ALC3   High-variability   Results evaluate   ⚒
  
⚒   System Performance   System Configuration   Flow load slicing   Flow monitor   dataops   Retrospective   ⚒

Technical Implementations.

System Performance - users, groups business lines.


System performance system resources.
Assuming the system has no speed limit no interactions between system resources is giving a lot of frustrations in realisations. Some of those are unexpectly popping up as an impossible impediment when working on solutions for an organistion.
baisc performance system resources.
Using standard tools to implement data flows and model scorings has dependencies on other tooling and the computer system. Knowing what the technical environment is and what limitations there are: Decisions for technical design and coding should be a result of knowing this and understanding what the impact will be. Some limitations will change in near future, others will get more troublesome. Duplication of data just because having a lot of involved machines complicated challenge.

⚠ Connecting (channel adapters - strings) storage
In the 80´s the mainframe got better storage, dasd 3390. Several units of an incrediable, for that era, of 10Gb got connected. All attention was on the sizing.
We learned in the hard way that the best performance was achieved by spreading data over those four connections (striping), servicing customers (multi tenancy) by time slicing. This tuning guideline by storage administration got well known.
⌛ ⏳
Working after 2016 on that big data low varibility ER model, got hit by performance issues. Analysing the issue and hardware setup. Found four strings and the data not being spread (not striped). IT was sold as I0 type storage. Redundant using mirroring and striping, using four storage units on each string. Missed was the striping because all attention went to the sizing. What was the practical issue:
❓ Elapsed time for a process went from 10 minutes to 4 hours, same data same code.
✅ Transfersizes buffers are lookng fine (64k unit size). Alignment for files data are set. When using bulk load, bloksize transfer are big. The drop in transfer bloksize transfer is coming back analysing accessing manay files.
. ❓ Creating a test program doing modified record updates. Did run apx 20 seconds on one machine and 10 minutes on another, same data same code.
Alarming: Nobody at responsible parties reacted understanding the issue.

⚠ Caching globally (OS) or in a subsystem DBMS
In the 80´s the mainframe got IO system routines. HAving not that much internal memory the choice was in who and where caching of information was done the most effective way. Caching in Area's and buffers at the DBMS or specfying them for a named IO stream (JCL DD-statement) worked with all control the best, avoiding system overload.
⌛ ⏳
Unix Windows in recent years got that much internal memory that they use all what is not used ba some process will use for IO-caching. This happens without any notification or logging. What was the practical issue:
❓ Elapsed time for a process went unexplainable up in mulitple times, same data same code when may and big datasets are processes at the same moment.
❓ Creating a test program it wsa not repeatable, only looking at the total load when the delay was experienced and also when running smoothly showed that correlation.
Calculating the size and load the system cach limit was exceeded far. Effectively overlaoding the system waiting for IO, cleaning up the cache.
💡 Decrease size and doing time slicining avoids the issue.
Alarming: Nobody at responsible parties reacted understanding the issue.

RACF, SMF System performance, system sizing.


Soc, Security operations Center and Computer Operations.
SoC in a very simplified approach only having that interface of security access.
Assuming the interface by a security tool is about access is wrong.
baisc SoC CSI
Menu setting, resources limits, account (cost management)
are often put in the same operational management interface.

An ancient figure of the 80´s.

Logs are shared for different goals.

IT Operations Analytis, new in big data (just before 2020).
In the fields of information technology (IT) and systems management, IT operations analytics (ITOA) is an approach or method to retrieve, analyze, and report data for IT operations. ITOA may apply big data analytics to large datasets to produce business insights. In 2014, Gartner predicted its use might increase revenue or reduce costs. In 2016, Gartner coined the term AIOps, which stands for "Artificial Intelligence for IT Operations", as an industry category for machine learning analytics technology that enhances IT operations. By 2017, it predicted that 15% of enterprises will use IT operations analytics technologies. (wikipedia)
Nothing real new 🎭. The focus is on services products to sell, not on improvement of the local busisess (how do they do it?).

⚠ Memory user setting (2010), not Security.
A disappointing example of the user interface mismatch: The new challenge is to explain to operational security staff increase the setting from 4Mb to 64Mb. At the desktop the memory size was already 4Gb.
⚠ Primary group user setting (2010), not Security.
Another disappointing example of the user interface mismatch: Using the primary group as indicator in an application makes a tool reusable by many groups. Defining an in file or registry settings are other options, those are adding complexity at the user support staff side.
⌛ ⏳
Unix is also using a Primary Group set with the user. DAC (Posix) access rights setting default for an organisation does not make sense for business applications. Setting the groupid and sticky bit (Command: chmod 3750, 3770) is the real solution for business logic an separate the business data on folders. The owner is able to set those and that is not by central security staff. There are some more options using the hierachical tree.

Big data low variability, ER model.


Running predictable planned processes ⚙
Planning an operational process is dictated by agreed interfaces. Those interfaces are decoupling responsibilities needing both actions on what is the goal when all goes as intended and what when one of the deadlines in interfaces by any party is missed.
operations computer
The scheduling:

⚠ Network load
The full load is causing heavy network traffic. In the setup this not isolated from end-users. Guideline: avoid that heavy traffic at office hours.
Using the same flow that is not allowed to run in multiple instances avoids data corruption. It will cause delays, stacking up flows to wait finishing previous ones.
❓ ⚠ Shameful this caused issues when a flow did crash and not cleaned up well by the commercial scheduler - load balancer.

⚠ Processing many files (vtoc catalogue - inodes)
In the 80´s the mainframe got IO system routines BDAM QSAM VSAM using catalogues where to find data and VtoC (volume table of contends) describing what is on volume. For many small datasets a PDS / PDSE was used to avoid most of the overhead involving catalogue and vtoc maintenance. Several experiences in the effect on open/close resulted in avoiding catalogues - vtocs administration to a limited number for performance reasons.
⌛ ⏳
Running a program using "modify data in place". It did run acceptable at the develop machine. Monitoring system resources an unexpected heavy IO load was seen. Why that heavy load when all records are sequential processed? Then seeing the time stamps of the file are continuously updated. The block size for writes (IO) very small. This is a programming design failure impossible to get escalated.
⌛ ⏳
Running a program using many small files in a Unix(linux) environment. With many I mean above 32.000 files in one directory, 250.000 in a directory tree. In that case seeing all kind of performance issue on the system even going to be non-responsive.
This is an unexpected system behaviour impossible to get solved by support, only reducing load at a moment (load balancing) is possible.
⌛ ⏳
Having assumed all temporary files are redirected to well sized and performing locations than a new one popped up. In this case caused by the java run time environment default location.
This is a programming design failure impossible to get escalated.

⚠ CPU & memory load
The old idea of a single CPU with dedicated attached memory is not the stat of art anymore. When considering a chip with multiple cores, there is a question of whether the caches should be shared or local to each core. Implementing shared cache inevitably introduces more wiring and complexity. But then, having one cache per chip, rather than core, greatly reduces the amount of space needed, and thus one can include a larger cache. .. For example, an eight-core chip with three levels may include an L1 cache for each core, one intermediate L2 cache for each pair of cores, and one L3 cache shared between all cores. (wikipedia)
Just adding some CPU could nout be effective when it not sharing enough of those caches. Another challenge when running into performance, load balancing issues.

Monitoring, Logging, tuning - Service Delivery.


Well behaving Information Systems 💡
The choice to achieve an good Informations System is not by assuming the hardware / cloud will solve it. No matter what it is about, understanding the information flow and the global impact for load on infrastrcuture is required.
There are global references at the "design data" chapter, some more technicla details are in pragrpahs here.
The bottom line is using OS, operating system, tools for monitoring system resources. Anything else is using some OS tool in a hidden way.

With a focus on having used SAS intensively: SAS practical paper A previous SAS White Paper, "A Practical Approach to Solving Performance Problems with the SAS System," detailed the role of the FULLSTIMER option in diagnosing and solving performance problems. It introduced the usage of host-based performance monitors for further investigation. This paper continues with that approach, detailing the use of the most commonly available host-based performance monitors. It will discuss how to employ them in performance testing, interpret them with a SAS mindset, and reconcile them to FULLSTIMER output to determine problem causes.

Short recap lean desing, data warehouse 3.0 💡
DWh 3.0 The inbound and outbound warehouse are have a simple goal: Solving the availability for a continuous flow in other parts of a request-delivery chain.
The classic dwh concept is not according that warehouse goal when doing a lot of transformations.
E-L ➡ T
Decoupling the E (extract) and L (Load) from the T (transformation) is needed to simplify information flows.
Short recap lean desing, Initialisation Terminiation 💡
WM initialistion termination control. Detailed control & balance, managing the Extract & Load is requiring knowledge by an initialisation and termination logic, tailored to an organisation in house processes.
E-L ➡ T
Those control steps are impossible in a generic technical approach. Requiring non standard logic to add to generic tools classified as "econfiguration" (undervaluation of importance).
Short recap lean desing, full process design 💡
Lean process / full connections Any process whether physcial or in the cyberspace is going through a circle.
➡ request, IV ➡ III (pull)
(push) ➡ I ➡ II ➡ delivery
Two supporting processes are there for delivering the needed components and another for the uses machines.
🚧 ⚠
Changing focus from the "value stream" into using machines (tools software, cots) is a common distraction with ICT.
Importance of Naming Conventions, Data Administration 💡
Data Administration or Master Data Management (MDM), the goal is a clear understanding on the meaning context inentions and goals. When MDM is technology driven I prefer DA for human context driven.: Gartner Master data management (MDM) is a technology-enabled discipline in which business and IT work together to ensure the uniformity, accuracy, stewardship, semantic consistency and accountability of the enterprise?s official shared master data assets. Master data is the consistent and uniform set of identifiers and extended attributes that describes the core entities of the enterprise including customers, prospects, citizens, suppliers, sites, hierarchies and chart of accounts.
🚧 ⚠
The importance of a clear naming conventions with the goal of clear understanding and using that for simple technical solutions cannot be overvalued.

Dataops, datamodel, AlC type3.


Becoming data driven. agile thinking 💡
In a data driven approach there is cycle around the analist. Structuring data is mentioned but not the goal. The goal is having it automated for consumers. Datastorming EL-T

Avoiding the intermediate dwh data model can save a lot of unncessary overhead. The used figure is from trifacta, promoting the EL - T decoupling.

Correcting information, Logical 💡
Logical correction of information is solvable unless the interpretation of information gets fraudulent. Presenting Mainframe cost using PC devices is possible adjusting numbers in spreadsheets, very difficult to solve as exceptions in data flows (SDDB).

Correcting information, Technical 💡
Numbers are not absolute in the real floating world. Rounding is often hidden in the technical connections by changing dtatypes. Another shift can be caused by date representations.
corrections: Using coding guidelines is the only possible option to avoid mistakes.

Monitoring information, Logical 💡
The information shoud be monitored on logical correctness. The technical correctness is a matter of whether some code has run.
Cross validations: Because it is about monitoring logical values there are no standard technical solutions. Only technical patterns are possible to develop.
siar not STAR PDCA

Evaluating realistions.


EOL (end of life) for ETL.
🚧 There is long history on doing ETL, delivering data marts from a special developed data warehouse using a data model that is completely different from the operational information data model. That all started with good reasons of limited machine capacity and a forced separation from operations.

💡 It is time for a change of those old habits. Dataops is the new buzzword.
ETL vs ELT: Decoupling ETL (trifacta)
Traditional ETL might be considered a bottleneck, but that doesn´t mean it´s invaluable. The same basic challenges that ETL tools and processes were designed to solve still exist, even if many of the surrounding factors have changed. For example, at a fundamental level, organizations still need to extract (E) data from legacy systems and load (L) it into their data lake. And they still need to transform (T) that data for use in analytics projects."ETL" work needs to get done but what can change is the order in which it is achieved and new technologies that can support this work.

Instead of an ETL pipeline, many organizations are taking an "ELT" approach, or decoupling data movement (extracting and loading) from data preparation (transforming). This ELT approach follows a larger IT trend. Whereas IT architecture was historically built in monolithic silos, many organizations are decoupling the same components so that they function independently. Decoupled technologies means less work up front (stacks don´t need to be deployed understanding all possible uses and outcomes) and more efficient maintenance. A clean separation between data movement and data preparation also comes with its own specific benefits:

Not always unhappy by what is going on
Doing a data pipeline as lean as possible was the way to do it. With dataops again becoming good practice. Control at III (and IV) left side, delivery II (and I) at the right.
working with colleagues, full circle JST testautomation in the releasetrain Control testautomation in the releasetrain Control releasetrain Control Data information security Control dataflow Working experiences by some controls. The SIAR cycle is there using the corners 👓, topic cycle in the center.

SIAR:
S Situation
I Initiatives
A Actions
R Realisations

🔰 lost here, than.. devops sdlc.


⚒   System Performance   System Configuration   Flow load slicing   Flow monitor   dataops   Retrospective   ⚒
  
⚒   Intro   Soar CSI   Big data   ER-star ALC3   High-variability   Results evaluate   ⚒
  
⚙   bpm   sdlc   bianl   data   meta   math   ⚙

© 2012,2020 J.A.Karman
👐 top    mid    bottom   👐
🎭 index references    elucidation    metier 🎭