Uncommon Descent Serving The Intelligent Design Community

On Active Information, search, Islands of Function and FSCO/I

Categories
ID Foundations
rhetoric
specified complexity
Share
Facebook
Twitter/X
LinkedIn
Flipboard
Print
Email

A current rhetorical tack of objections to the design inference has two facets:

(a) suggesting or implying that by moving research focus to Active Information needle in haystack search-challenge linked Specified Complexity has been “dispensed with” [thus,too, related concepts such as FSCO/I]; and

(b) setting out to dismiss Active Information, now considered in isolation.

Both of these rhetorical gambits are in error.

However, just because a rhetorical assertion or strategy is erroneous does not mean that it is unpersuasive; especially for those inclined that way in the first place.

So, there is a necessity for a corrective.

First, let us observe how Marks and Dembski began their 2010 paper, in its abstract:

Needle-in-the-haystack problems look for small targets in large spaces. In such cases, blind search stands no hope of success. Conservation of information dictates any search technique will work, on average, as well as blind search. Success requires an assisted search. But whence the assistance required for a search to be successful? To pose the question this way suggests that successful searches do not emerge spontaneously but need themselves to be discovered via a search. The question then naturally arises whether such a higher-level “search for a search” is any easier than the original search. We prove two results: (1) The Horizontal No Free Lunch Theorem, which shows that average relative performance of searches never exceeds unassisted or blind searches, and (2) The Vertical No Free Lunch Theorem, which shows that the difficulty of searching for a successful search increases exponentially with respect to the minimum allowable active information being sought.

That is, the context of active information and associated search for a good search, is exactly that of finding isolated targets Ti in large configuration spaces W, that then pose a needle in haystack search challenge. Or, as I have represented this so often here at UD:

csi_defnUpdating to reflect the bridge to the origin of life challenge:

islands_of_func_chall

In this model, we see how researchers on evolutionary computing typically confine their work to tractable cases where a dust of random walk searches with drift due to a presumably gentle slope on what looks like a fairly flat surface is indeed likely to converge on multiple zones of sharply rising function, which then allows identification of likely local peaks of function. The researcher in view then has a second tier search across peaks to achieve a global maximum.

This of course contrasts with the FSCO/I [= functionally specific, complex organisation and/or associated information] case where

a: due to a need for multiple well-matched parts that

b: must be correctly arranged and coupled together

c: per a functionally specific wiring diagram

d: to attain the particular interactions that achieve function, and so

e: will be tied to an information-rich wiring diagram that

f: may be described and quantified informationally by using

g: a structured list of y/n q’s forming a descriptive bit string

. . . we naturally see instead isolated zones of function Ti amidst a much larger sea of non-functional clustered or scattered arrangements of parts.

This may be illustrated by an Abu 6500 C3 fishing reel exploded view assembly diagram:

abu_6500c3mag

. . . which may be compared to the organisation of a petroleum refinery:

Petroleum refinery block diagram illustrating FSCO/I in a process-flow system
Petroleum refinery block diagram illustrating FSCO/I in a process-flow system

. . . and to that of the cellular protein synthesis system:

Protein Synthesis (HT: Wiki Media)
Protein Synthesis (HT: Wiki Media)

. . . and onward the cellular metabolic process network (with the above being the small corner top left):

cell_metabolism

(NB: I insist on presenting this cluster of illustrations to demonstrate to all but the willfully obtuse, that FSCO/I is real, unavoidably familiar and pivotally relevant to origin of cell based life discussions, with implications onward for body plans that must unfold from an embryo or the like, OOL and OOBP.)

Now, in their 2013 paper on generalising their analysis, Marks, Dembski and Ewert begin:

All but the most trivial searches are needle-in-the-haystack problems. Yet many searches successfully locate needles in haystacks. How is this possible? A success-ful search locates a target in a manageable number of steps. According to conserva-tion of information, nontrivial searches can be successful only by drawing on existing external information, outputting no more information than was inputted [1]. In previous work, we made assumptions that limited the generality of conservation of information, such as assuming that the baseline against which search perfor-mance is evaluated must be a uniform probability distribution or that any query of the search space yields full knowledge of whether the candidate queried is inside or outside the target. In this paper, we remove such constraints and show that | conservation of information holds quite generally. We continue to assume that tar-gets are fixed. Search for fuzzy and moveable targets will be the topic of future research by the Evolutionary Informatics Lab.

In generalizing conservation of information, we first generalize what we mean by targeted search. The first three sections of this paper therefore develop a general approach to targeted search. The upshot of this approach is that any search may be represented as a probability distribution on the space being searched. Readers who are prepared to accept that searches may be represented in this way can skip to section 4 and regard the first three sections as stage-setting. Nonetheless, we sug-gest that readers study these first three sections, if only to appreciate the full gen-erality of the approach to search we are proposing and also to understand why attempts to circumvent conservation of information via certain types of searches fail. Indeed, as we shall see, such attempts to bypass conservation of information look to searches that fall under the general approach outlined here; moreover, conservation of information, as formalized here, applies to all these cases . . .

So, again, the direct relevance of FSCO/I and linked needle in haystack search challenge continues.

Going further, we may now focus:

is_ o_func2_activ_info

In short, active information is a bridge that allows us to pass to relevant zones of FSCO/I, Ti, and to cross plateaus and intervening valleys in an island of function that does not exhibit a neatly behaved objective function. And, it is reasonable to measure it’s impact based on search improvement, in informational terms. (Where, it may only need to give a hint, try here and scratch around a bit: warmer/colder/hot-hot-hot. AI itself does not have to give the sort of detailed wiring diagram description associated with FSCO/I.)

It must be deeply understood, that the dominant aspect of the situation is resource sparseness confronting a blind needle in haystack search. A reasonably random blind search will not credibly outperform the overwhelmingly likely failure of the yardstick, flat random search. Too much stack, too few search resources, too little time. And a drastically improved search, a golden search if you will, itself has to be found before it becomes relevant.

That means, searching for a good search.

Where, a search on a configuration space W, is a sample of its subsets. That is, it is a member of the power set of W, which has cardinality 2^W. Thus it is plausible that such a search will be much harder than a direct fairly random search.  (And yes, one may elaborate an analysis to address that point, but it is going to come back to much the same conclusion.)

Further, consider the case where the pictured zones are like sandy barrier islands, shape-shifting and able to move. That is, they are dynamic.

This will not affect the dominant challenge, which is to get to an initial Ti for OOL then onwards to get to further islands Tj etc for OOBP.  That is doubtless a work in progress over at the Evolutionary Informatics Lab, but is already patent from the challenge in the main.

To give an outline idea, let me clip a summary of the needle-to-stack challenge:

Our observed cosmos has in it some 10^80 atoms, and a good atomic-level clock-tick is a fast chem rxn rate of perhaps 10^-14 s. 13.7 bn y ~10^17 s. The number of atom-scale events in that span in the observed cosmos is thus of order 10^111.

The number of configs for 1,000 coins (or, bits) is 2^1,000 ~ 1.07*10^301.

That is, if we were to give each atom of the observed cosmos a tray of 1,000 coins, and toss and observe then process 10^14 times per second, the resources of the observed cosmos would sample up to 1 in 10^190 of the set of possibilities.

It is reasonable to deem such a blind search, whether contiguous or a dust, as far too sparse to have any reasonable likelihood of finding any reasonably isolated “needles” in the haystack of possibilities. A rough calc suggests that the ratio is comparable to a single straw drawn from a cubical haystack ~ 2 * 10^45 LY across. (Our observed cosmos may be ~ 10^11 LY across, i.e. the imaginary haystack would swallow up our observed cosmos.)

Of course, as posts in this thread amply demonstrate the “miracle” of intelligently directed configuration allows us to routinely produce cases of functionally specific complex organisation and/or associated information well beyond such a threshold. For an ASCII text string 1,000 bits is about 143 characters, the length of a Twitter post.

As just genomes for OOL  start out at 100 – 1,000 k bases and those for OOBP credibly run like 10 – 100+ mn bases, this is a toy illustration of the true magnitude of the problem.

The context and challenge addressed by the active information concept is blind needle in haystack search challenge, and so also FSCO/I. The only actually observed adequate cause of FSCO/I is intelligently directed configuration, aka design. And per further experience, design works by injecting active information coming from a self-moved agent cause capable of rational contemplation and creative synthesis.

So, FSCO/I remains as best explained on design. In fact, per a trillion member base of observations, it is a reliable sign of it. Which has very direct implications for our thought on OOL and OOBP.

Or, it should. END

Comments
179 Winston EwertMay 4, 2015 at 1:56 pm
Didn’t you read what WE had to say about it? And what about Dembski, Meyer, Behe, Marks et al.? Do you think they even consider FSCO/I? FSCO/I just dead and never lived.
What I said was that I wasn’t familiar with it, and suggested that it should be pursued via publication in a venue like a paper or conference rather than in blog posts. I have no opinion on the merits of FSCO/I. I don’t know enough about it to have an opioion.
According to Google FSCO/I has been metioned at UD more than 1000 times. I.e., more often than Dembski's original CSI which although much older was mentioned 918 times according to Google. Maybe it's my English skills but your statements
I’ve seen posts about it. I’m not inclined to take it seriously until I see it published some place more serious then a blog.
You really think a comment on a blog that quotes other people and calls them idea-roots for FSCO/I qualifies as a serious presentation of the idea of FSCO/I?
don't really sound as if you don't have an opinion on FSCO/I. BTW, is the founder of UD even aware of FSCO/I? Did you ever discuss FSCO/I with Dr. Dembski or Dr. Marks in the past or in the since it was mentioned in the previous thread? If not I guess everybody here would be happy if you discuss FSCO/I with him and Dr. Marks and publish a summary of the discussion here.sparc
May 4, 2015
May
05
May
4
04
2015
08:54 PM
8
08
54
PM
PDT
Z, you are simply showing just how difficult the real problem is. KFkairosfocus
May 4, 2015
May
05
May
4
04
2015
07:15 PM
7
07
15
PM
PDT
In the case of Dawkin’s Weasel program, the algorithm doesn’t know the answer. So? The weasel function requires a string as input since the algorithm doesn’t have a built-in solution. So? WeaselFunction(pointer to string) Why a string? Why not a pointer to a peanut butter and jelly sandwich? Let me tell you why. Because the programmer knew that the desired solution was a string of a certain length composed of specific letters in a specific order and found a way to code that into the program. The program didn't magically conceive that on it's own. We say GA's are designed because they are designed. And then people say that's irrelevant. Har.Mung
May 4, 2015
May
05
May
4
04
2015
04:07 PM
4
04
07
PM
PDT
No word is an island.Mung
May 4, 2015
May
05
May
4
04
2015
03:54 PM
3
03
54
PM
PDT
Didn’t you read what WE had to say about it? And what about Dembski, Meyer, Behe, Marks et al.? Do you think they even consider FSCO/I? FSCO/I just dead and never lived.
What I said was that I wasn't familiar with it, and suggested that it should be pursued via publication in a venue like a paper or conference rather than in blog posts. I have no opinion on the merits of FSCO/I. I don't know enough about it to have an opioion.Winston Ewert
May 4, 2015
May
05
May
4
04
2015
12:56 PM
12
12
56
PM
PDT
No, Carpathian, the specification was the wavelength and type of wave- you know typical antenna stuff that engineers use. The GA emulated engineers. They all use actual selection as opposed to natural selection which is a process of elimination. Selecting the top 1% is much different from eliminating the bottom 1%. Selection drives towards a target. I know no one knows the solution. The point is GAs actively drive towards that solution.Joe
May 4, 2015
May
05
May
4
04
2015
11:59 AM
11
11
59
AM
PDT
Yes, the "specification" was the "environment" the antenna would be operating in. Sometimes we don't even know that. In the case of a Weasel type program generating "DNA snippets", we don't have a clue what they might be and we don't know we will like the result even if it survives. The point is, that the criteria for allowing that "DNA snippet" to survive is unknown before the organism is actually tested. This is a blind test to the farmer, the seed company and Dawkin's Weasel program. No one knows the "solution" which will simply be something the farmer likes.Carpathian
May 4, 2015
May
05
May
4
04
2015
11:51 AM
11
11
51
AM
PDT
The solution isn't known but what is wanted is known. The specification for the antenna was known but the solution, ie the actual antenna design, was not. Without the specification the antenna would never have been realized.Joe
May 4, 2015
May
05
May
4
04
2015
11:42 AM
11
11
42
AM
PDT
In Dawkin's Weasel algorithm, the input sentence is the "specification". You could replace the input string with a short "DNA sequence" and take the next randomly altered by Weasel "DNA sequence" and use it as a GMO modifier. If the corn or green beans turn out to be a better crop, you feed that short "DNA sequence" back into Weasel to reproduce with random changes. You would never actually compare any strings. The test would be better crops. This way there is no reason to say the solution is known since we would never compare it to any specific sequence.Carpathian
May 4, 2015
May
05
May
4
04
2015
11:35 AM
11
11
35
AM
PDT
They solve the problem by design. They are no different than thousands of qualified people all working independently to solve the problem. In Dawkins weasel if the answer wasn't provided the sentence would never be found. In the antenna GA if the specification for the antenna wasn't provided the solution would never be found. GAs do what humans do, they just do it faster and with virtual resources.Joe
May 4, 2015
May
05
May
4
04
2015
11:16 AM
11
11
16
AM
PDT
Joe: A designed algorithm does not mean the problems it solves are designed. Take "X + Y = Z". Given that numbers are infinite, there is no possible way that all solutions are known beforehand. In the case of Dawkin's Weasel program, the algorithm doesn't know the answer. (pointer to solution) = WeaselFunction((pointer to string)); The weasel function requires a string as input since the algorithm doesn't have a built-in solution.Carpathian
May 4, 2015
May
05
May
4
04
2015
10:57 AM
10
10
57
AM
PDT
No Carpathian, YOU are confused. GAs are intelligently designed and so are their solutions. GAs use selection- actual intelligent selection, to drive variants towards a goal.Joe
May 4, 2015
May
05
May
4
04
2015
10:37 AM
10
10
37
AM
PDT
Joe:
Evolutionary algorithms model intelligent design evolution. Nice own goal.
You are confusing the lab and the test. The tools we use may be designed, but that should not have an effect on what we are measuring. Sometimes the effects can be diminished and sometimes we just have to live with them. As an example, there is no reason we could not use designed math to model random probabilities resulting in a dealt hand in a card game.Carpathian
May 4, 2015
May
05
May
4
04
2015
10:32 AM
10
10
32
AM
PDT
sparc- Dembski wrote that biological specification refers to function. All FSCO/I do0es it make it much clearer, ie more specific, what is being discussed.Joe
May 4, 2015
May
05
May
4
04
2015
09:41 AM
9
09
41
AM
PDT
Zachriel:
This process is abstracted in an evolutionary algorithm.
Evolutionary algorithms model intelligent design evolution. Nice own goal.Joe
May 4, 2015
May
05
May
4
04
2015
09:33 AM
9
09
33
AM
PDT
Zachriel is a projectionist:
You won’t stake out a position so that you can disavow it later. There’s no point testing your position when you won’t cooperate in devising a suitable test.
Talk about not knowing one's place.
Then let’s consider longer words. Do you think 12-letter words are isolated islands?
Not for intelligently designed processes like word mutagen.Joe
May 4, 2015
May
05
May
4
04
2015
09:32 AM
9
09
32
AM
PDT
mike1962: So then, is all the code and data except the target string part of the replicators’ set of properties? There's a relationship between the sequences and the fitness landscape. That's what it means to evaluate the fitness. In natural biology, when a mutated sequence is expressed, there may be a variety of very complex changes in the organism. In terms of reproductive potential, this can be beneficial, detrimental, or neutral. We don't have to know the details of this process to observe whether a variation is advantageous or not in a given environment. This process is abstracted in an evolutionary algorithm.Zachriel
May 4, 2015
May
05
May
4
04
2015
09:19 AM
9
09
19
AM
PDT
Mung:
That a ’0? in the genome represents a ‘T’ in the phenome is a design decision. That a ’1? in the genome represents an ‘H’ in the phenome is a design decision. That ‘T’ and ‘H’ have relevance to a potential solution is a design decision.
This sounds to me more of a case of labeling, not design. You seem to be confusing the "lab" with the "Device Under Test". If I type that I prefer to use a pen instead of using a computer, I have not demonstrated that I am a hypocrite by typing this sentence. It is simply the case that this is the best and probably only way to get that information to you.Carpathian
May 4, 2015
May
05
May
4
04
2015
09:18 AM
9
09
18
AM
PDT
mike1962: So then, is all the code and data except the target string part of the replicators’ set of properties? Here's the usual steps: * Evaluate the individual fitness of each member of population; * Determine best fit individuals to become parents; * Breed new individuals through mutation and crossover to give birth to offspring;Zachriel
May 4, 2015
May
05
May
4
04
2015
09:10 AM
9
09
10
AM
PDT
Zechriel: The population of sequences are the replicates. The environment is the target string, “Methinks it is like a weasel”.
So then, is all the code and data except the target string part of the replicators' set of properties?mike1962
May 4, 2015
May
05
May
4
04
2015
08:50 AM
8
08
50
AM
PDT
kairosfocus: how you do it is your problem. You won't stake out a position so that you can disavow it later. There's no point testing your position when you won't cooperate in devising a suitable test. kairosfocus: By hoping to get away with extrapolating on short words that will be closer in Hamming space, you set yourself that burden. Then let's consider longer words. Do you think 12-letter words are isolated islands?Zachriel
May 4, 2015
May
05
May
4
04
2015
08:44 AM
8
08
44
AM
PDT
Z, how you do it is your problem. By hoping to get away with extrapolating on short words that will be closer in Hamming space, you set yourself that burden. On the cases I showed, they obviously compared a corpus of literature. KFkairosfocus
May 4, 2015
May
05
May
4
04
2015
08:40 AM
8
08
40
AM
PDT
kairosfocus: it is you who still have that burden to show. We could use a book of every possible phrase in the English language, but that would require quite a bit of memory. How about a partial phrase book? That would result in even larger gaps between valid sequences, and make the job of navigating it with an evolutionary algorithm considerably more difficult. The phrase book would include words and phrases, a phrase defined as any contiguous words found in a valid sentence. Does that work for a fitness landscape to show islands of function?Zachriel
May 4, 2015
May
05
May
4
04
2015
08:38 AM
8
08
38
AM
PDT
Z, it is you who still have that burden to show. And the target zone -- assuming use of the 128 ASCII character set, is 72 characters of coherently meaningful text strings. So far, as cited above and long since headlined here at UD, 19 - 24 characters is what's been done with W ~ 10^50, a factor of 10^100 short of the scope where FSCO/I is a material factor. KFkairosfocus
May 4, 2015
May
05
May
4
04
2015
08:30 AM
8
08
30
AM
PDT
mike1962: What part of the software and/or data is the replicator object and what part is the environment? The population of sequences are the replicates. The environment is the target string, "Methinks it is like a weasel". kairosfocus: Therefore, it is part of your own burden of proof. Let's start with the simpler case. Using a standard dictionary for the fitness landscape, are seven-letter words isolated islands? Ten-letter words?Zachriel
May 4, 2015
May
05
May
4
04
2015
07:50 AM
7
07
50
AM
PDT
Zechrial: A simple example is Weasel
Let's stick with Weasel for now: What part of the software and/or data is the replicator object and what part is the environment?mike1962
May 4, 2015
May
05
May
4
04
2015
07:27 AM
7
07
27
AM
PDT
sparc: You obviously are refusing to look at and take due note of the evidence right in front of your eyes. FSCO/I is a descriptive acronym, that actually builds on Orgel, Wicken and Thaxton et al. Dembski and Meyer speak to the same cluster of facts, phenomena, concepts and issues. That has been on record for decades. All around you the phenomenon is manifest. Whatever we may wish for by way of more research, those facts are already more than adequate to address seriously the functionally specific subset of complex specified information that is as blatantly in your face as the text s-t-r-i-n-g-s we are using to communicate with in this thread. It is as commonplace as the electrical and electronic circuits in the PCs we are using. It is in the coded info in the programs in those PCs. It is in the exploded view wiring diagrams behind ever so many machines, whether PCs or fishing reels or automobiles or airplanes. It is in the DNA of the cells, the ribosomes, the proteins, the von Neumann self replicating facilities in the cells. And, consistently, when we see the cause, it is intelligently directed configuration. With a very good reason as discussed in the OP above you seem to be studiously avoiding, namely that wiring diagram interactive functionality sharply constrains effective arrangements of parts, creating isolated islands of function in large config spaces. So, the means to bridge seas of non-function to reach such islands is by injection of active information that moves us beyond what we may reasonably expect of blind chance and mechanical necessity. KFkairosfocus
May 4, 2015
May
05
May
4
04
2015
07:07 AM
7
07
07
AM
PDT
kairosfocus @ 148 You mean Demsbki, Behe, Meyer et al. were referring to FSCO/I before you even coined the term? Dembski's and Marks's co-worker and co-author Winston Ewert was obviously reluctant to even consider FSCO/I. When asked directly
Is FSCO/I something you’ve heard of? If you have, do you (and as spokesman for DEM) endorse it?
he replied
I’ve seen posts about it. I’m not inclined to take it seriously until I see it published some place more serious then a blog.
BTW, nobody is mentioning Lewontin in this thread.sparc
May 4, 2015
May
05
May
4
04
2015
05:38 AM
5
05
38
AM
PDT
VS, welcome. KFkairosfocus
May 4, 2015
May
05
May
4
04
2015
05:21 AM
5
05
21
AM
PDT
Z:
kairosfocus: As a first answer, we can challenge you, Z, to provide an onward incrementalist functional at each step chance variation and selection based progress to something like [Z:} Provide us the fitness landscape, a way to determine whether something is a valid phrase in English, then we will provide the algorithm. A phrase book should contain these among other snips: “quick dog” “lazy fox”, “jumps over”, “the quick”, etc.
Z, it is you who by suggesting a stepwise task suggested such. Therefore, it is part of your own burden of proof. And had you observed the Million Monkeys task, you would see the obvious point, matching a large corpus of literature stored electronically as a way to address it. All of which goes to the absence of actual algorithms specifying step by step procedure that computationally implemented on a reasonable machine solve the problem in hand. KFkairosfocus
May 4, 2015
May
05
May
4
04
2015
05:20 AM
5
05
20
AM
PDT
1 2 3 4 5 6 10

Leave a Reply