- mathfun@mailman.xmission.com - mailman.xmission.com

Re: [math-fun] Radial disk-dissection
by Dan Asimov 22 Apr '18

22 Apr '18

I'm trying to guess what RWG meant without peeking at his drawings. In order to make Jim Propp's statement exact, I would have to make precise 1) what "dissect and reassemble" mean and 2) what "converges" to a 1-by-pi rectangle means. A typical meaning for 1): For subsets A, B of R^2, to dissect A and reassemble it to B means that there is a partition A = X_1 + ... + X_n of A as a finite disjoint union, such that there exist isometries f_1, ..., f_n of R^2 such that B = f_1(X_1) + ... + f_n(X_n) forms a partition of B as a finite disjoint union. * * * One meaning for 2) could be in the sense of Hausdorff distance between compact sets in the plane. The only problem I see here is that if strict partition are used in 1) as above, then the resulting rectangle B will not be compact, as it will not contain all of its boundary. I have complete faith that appropriate hand-waving will not incur the wrath of the math gods. —Dan ----- Jim Propp wrote: > If you dissect a unit disk radially into a large number of equal wedges, > it’s well known that you can reassemble them to form a shape that in the > limit converges to a 1-by-pi rectangle. > RWG wrote: ----- gosper.org/picfzoom.gif gosper.org/semizoom.gif --rwg I don't see how to get anything other than allowing unequal wedges. ----- -----

5 14

Re: [math-fun] Optimally sorting a cyclic group
by Henry Baker 22 Apr '18

22 Apr '18

I don't know much about Sparc register banks, except that they seem to be "architectural" features -- i.e., something that the compiler has to know about in order to generate code that computes the correct answer -- rather than "performance" features -- i.e., something that the compiler *optimizers* have to know about in order to generate fast(er) code. At least a decade before the Sparc, there were "real time" microprocessors -- e.g., RCA's COSMAC/180X 8-bit processors -- where the "registers" were merely locations in external RAM selected by a special index register (at least in some implementations). Thus, although the compiler thought in terms of "registers", these registers didn't have faster access than any other RAM location. The reasons for this design: 1) very low interrupt latency: simply drop in a new register "index" and you instantly have a brand-new set of registers; and 2) not very much space on the chip for even one register bank, much less several -- these were very early single chip microprocessors. WRT patterns of cache behavior: We talked about n^(-1.5) "long tail" behavior for caches in the past several months. This "-1.5" exponent appears to be in the middle of the range for actual programs, so I'm taking this as a "given", at least for the moment. At 09:01 AM 4/22/2018, Tomas Rokicki wrote: >I was always intrigued by the SPARC register banks and how they compare >to just using the Tomasulo algorithm directly with fast caches . . . has anyone >looked at RISC V enough to determine if it uses register windows? > >Henry, I'm still struggling to understand precisely what patterns you see that >might be pre-existing in order to gain a benefit in sort time. Few of the n! >permutations have sufficient elements in-place (even when considering all >cyclic end-states) to make this interesting up front so you must have some >sort of permutation distribution in mind for which this might be a benefit. > >-tom > >On Fri, Apr 20, 2018 at 9:48 PM, Andres Valloud <avalloud(a)smalltalk.comcastbiz.net> wrote: > >> Starts sounding like SPARC's register banks / windows. >> >> How would software discover what the machine is doing to self-tune? >> >> On 4/20/18 14:30 , Henry Baker wrote: >>> Agreed. >>> >>> Blind men & elephants come from designing & evaluating >>> at the same time. >>> >>> I'm trying to figure out what major mechanisms are required >>> to achieve "essentially" optimal results w/o overdesigning. >>> >>> E.g., I've been resisting allowing "massive" simultaneous >>> rearrangements, but they may be required in order that >>> worst-case behavior doesn't dominate the average case >>> behavior. >>> >>> Why I'm doing this: I've been unhappy with the standard >>> RAM model which assumes constant memory access time for >>> all memory accesses, which is complete hogwash. >>> >>> What I'm really trying to design/model here is a memory >>> "cache", but a bizarre type of cache in that every cache >>> line has a slightly different access time, with more >>> "remote" elements being monotonically slower than "closer" >>> elements. As in modern microprocessors, the speed of >>> the fastest cache lines may be 3 orders of magnitude >>> faster than the slowest cache lines. Furthermore, the >>> cost of a cache line is directly correlated with its >>> speed, so there are far more slower elements than faster >>> ones. >>> >>> I haven't settled down on a particular cost model, >>> because I wanted to play with different cost models. >>> >>> As usual, we want most of the "work" to be done in >>> the closest/fastest cache lines, including any work >>> of rearranging the order of the cache lines. This >>> is because "small" rearrangements can be much faster >>> than "large" rearrangements. >>> >>> These are all "exclusive" caches, meaning that a >>> particular item lives in exactly one cache line. >>> >>> I think I'm close to deciding that some massive >>> rearrangements -- e.g., replacing the first N elements >>> of the fastest part of the cache with N sequential >>> remote elements -- will be a necessary builtin feature. >>> However, it is likely that the cost of such a massive >>> rearrangement will have to be correlated with the >>> "distance" as well as the number of items. Furthermore, >>> it may be necessary and/or convenient to have the >>> number of elements (N) involved in each transfer be >>> correlated with the distance. There is a tradeoff >>> here because a large transfer from "far" may bring >>> in something you need, but also a lot of other stuff >>> you did't want -- thus "polluting" the fast cache. >>> >>> This is reminiscent of 1950's/1960's style programming >>> where large blocks of data have to be brought in/out by >>> "channel operations"; some of these channels were even >>> smart enough to transfer data *within* main memory w/o >>> even touching an I/O device. >>> >>> At 12:44 PM 4/20/2018, Tomas Rokicki wrote: >>> >>>> I think we need some sort of explicit cost model here. Otherwise we are >>>> each blindly touching a different part of the elephant. >-- >-- http://cube20.org/ -- http://golly.sf.net/ --

1 0

Re: [math-fun] Optimally sorting a cyclic group
by Henry Baker 22 Apr '18

22 Apr '18

Agreed. Blind men & elephants come from designing & evaluating at the same time. I'm trying to figure out what major mechanisms are required to achieve "essentially" optimal results w/o overdesigning. E.g., I've been resisting allowing "massive" simultaneous rearrangements, but they may be required in order that worst-case behavior doesn't dominate the average case behavior. Why I'm doing this: I've been unhappy with the standard RAM model which assumes constant memory access time for all memory accesses, which is complete hogwash. What I'm really trying to design/model here is a memory "cache", but a bizarre type of cache in that every cache line has a slightly different access time, with more "remote" elements being monotonically slower than "closer" elements. As in modern microprocessors, the speed of the fastest cache lines may be 3 orders of magnitude faster than the slowest cache lines. Furthermore, the cost of a cache line is directly correlated with its speed, so there are far more slower elements than faster ones. I haven't settled down on a particular cost model, because I wanted to play with different cost models. As usual, we want most of the "work" to be done in the closest/fastest cache lines, including any work of rearranging the order of the cache lines. This is because "small" rearrangements can be much faster than "large" rearrangements. These are all "exclusive" caches, meaning that a particular item lives in exactly one cache line. I think I'm close to deciding that some massive rearrangements -- e.g., replacing the first N elements of the fastest part of the cache with N sequential remote elements -- will be a necessary builtin feature. However, it is likely that the cost of such a massive rearrangement will have to be correlated with the "distance" as well as the number of items. Furthermore, it may be necessary and/or convenient to have the number of elements (N) involved in each transfer be correlated with the distance. There is a tradeoff here because a large transfer from "far" may bring in something you need, but also a lot of other stuff you did't want -- thus "polluting" the fast cache. This is reminiscent of 1950's/1960's style programming where large blocks of data have to be brought in/out by "channel operations"; some of these channels were even smart enough to transfer data *within* main memory w/o even touching an I/O device. At 12:44 PM 4/20/2018, Tomas Rokicki wrote: >I think we need some sort of explicit cost model here. Otherwise we are >each blindly touching a different part of the elephant.

3 2

Re: [math-fun] Nearest Rational Algorithm
by Guy Haworth 22 Apr '18

22 Apr '18

Thanks, James B ... interesting how infrequent the 'improved approximations' are. Guy

1 0

[math-fun] Nearest rationals algorithm
by Guy Haworth 22 Apr '18

22 Apr '18

I think there was a discussion about 'nearest rationals to pi' ... and I think there's a 'sequence' about that somewhere. I'd be interested in an algorithm to generate such sequences, ... and I'd like to see what results it gives for sqrt(2)^sqrt(2) ... a number which has a specific role in proving that a^b can be rational even if 'a' and 'b' are not [see, e.g., The Princeton Companion] Guy

2 1

[math-fun] YouTube: Making a sphere on a lathe
by Henry Baker 21 Apr '18

21 Apr '18

FYI -- several months ago, we discussed how to make a spherical object with a lathe, as might have been done circa 1500AD. Here's an 11-minute YouTube video showing exactly this process: https://www.youtube.com/watch?v=hZT0ZTvRFxI #40 Hybrid Sphere Burl & Resin "Red Dawn"

1 0

[math-fun] Maximal density of finite patters in Z/nZ
by Dan Asimov 20 Apr '18

20 Apr '18

Recent posts reminded me of this question first suggested to me by my late brother Simon (1955-2005). Given an integer n > 0, what are the finite patterns that can partition the discrete circle C_n = Z/nZ ??? A finite pattern is simply a finite subset [X] of C_n = Z/nZ *up to isometry* I.e., X and x + K are considered the same subset of X, via the corresponding equivalence relation on certain subsets of the power set P(C_n) of C_n. In this case, *translations* of A finite pattern [X] in C_n "can partition C_n" if: *there exists for some k >= 1 a finite sets of isometric copies X_j \sub C_n, j-1..., k of X in C_n, such that a) the X_j are disjoint, 1 <= j <= n and b) the union of the X_j is all of C_n. Question: --------- For each n, classify the finite patterns that can partition C_n. —Dan

1 0

Re: [math-fun] Optimally sorting a cyclic group
by Henry Baker 20 Apr '18

20 Apr '18

Yes, a radix-style sort will work; I had forgotten about that. I'm working on a model where every access is expensive, and RAM access is not even remotely constant time. In this particular problem, I can *preprocess for free* to determine the complete permutation, but then I have to pay full retail price for actually applying this permutation to rearrange the data records. I was hoping to minimize rearrangements in the cases where the data was already "almost sorted". I was thinking along the lines of Quicksort, which is pretty good when the data is already almost sorted. At 10:05 AM 4/20/2018, Tomas Rokicki wrote: >A linear walk can notice and preserve the cyclic walk else abort. For what >disorder distribution do you want to do better? > >If you have such a field as you describe radix sort is linear time, as >discussed in TAOCP. > >On Fri, Apr 20, 2018 at 12:56 PM Henry Baker <hbaker1(a)pipeline.com> wrote: >> Suppose you have N records in which one of >> the fields is a number 0..(N-1), and there >> is exactly one record for each number >> 0..(N-1). >> >> It is trivial to sort such a set using >> traditional Knuth-TAOCP algorithms. >> >> However, if I don't care that the first >> record is labelled "0", but only care >> that the final ordering preserves the >> *cyclic* ordering of the group, can I >> do any better? >> >> Obviously, I can't do a lot better, >> since cyclicly rotating the ordering >> costs O(N), while sorting costs O(NlogN), >> but is there a clever way to sort w/o >> putting "0" at the front and "(N-1)" >> at the end? In particular, is there >> a way to sort that automatically >> notices when the records are already >> in cyclic order, and doesn't bother >> making any further changes. >-- http://cube20.org/ -- http://golly.sf.net/ --

4 6

Re: [math-fun] Optimally sorting a cyclic group
by Dan Asimov 20 Apr '18

20 Apr '18

Let C_n denote the cyclic group Z/nZ, and let s : C_n —> C_n be any permutation of C_n. Question: Is there in some sense always a "nearest" rotation of C_n (i.e., a translation by a group element) to the permutation s, in a consistent manner. I.e., such that for any rotation R of C_n we have nearest(s) = R*nearest(R*s). or in group notation nearest(s) + g = g + nearest(L_g+s) That is, the nearest rotation to [a permutation s followed by a rotation R] is the result of applying the rotation R to [the nearest rotation to the permutation s]. —Dan Henry Baker wrote: ----- Suppose you have N records in which one of the fields is a number 0..(N-1), and there is exactly one record for each number 0..(N-1). ... However, if I don't care that the first record is labelled "0", but only care that the final ordering preserves the *cyclic* ordering of the group, can I do any better? -----

1 0

[math-fun] Optimally sorting a cyclic group
by Henry Baker 20 Apr '18

20 Apr '18

Suppose you have N records in which one of the fields is a number 0..(N-1), and there is exactly one record for each number 0..(N-1). It is trivial to sort such a set using traditional Knuth-TAOCP algorithms. However, if I don't care that the first record is labelled "0", but only care that the final ordering preserves the *cyclic* ordering of the group, can I do any better? Obviously, I can't do a lot better, since cyclicly rotating the ordering costs O(N), while sorting costs O(NlogN), but is there a clever way to sort w/o putting "0" at the front and "(N-1)" at the end? In particular, is there a way to sort that automatically notices when the records are already in cyclic order, and doesn't bother making any further changes.

3 2